Sentence segmentation accuracy #157
Unanswered
rshahrabani
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
mic.txt
I am using the latest build of Catalyst for sentence segmentation. I have used it on the attached file but the accuracy seems to be off. For example, the following was broken into 2 separate sentences (see General Instruction A.2. below):
June 7, 2021 MACQUARIE INFRASTRUCTURE CORPORATION (Exact name of Registrant as specified in its charter) (212) 231-1000 (Registrant's telephone number, including area code) N.A. (Former name or former address, if changes since last report) Check the appropriate box below if the Form 8-k filing is intended to simultaneously satisfy the filing obligation of the registrant under any of the following provisions (see General Instruction A.2.
below): Β¨ Written communications pursuant to Rule 425 under the Securities Act (17 CFR 230.425) x Soliciting material pursuant to Rule 14a-12 under the Exchange Act (17 CFR 240.14a-12) Β¨ Pre-commencement communications pursuant to Rule 14d-2(b) under the Exchange Act (17 CFR 240.14d-2(b)) Β¨ Pre-commencement communications pursuant to Rule 13e-4(c) under the Exchange Act (17 CFR 240.13e-4(c)) Securities registered pursuant to Section 12(b) of the Act: Indicate by check mark whether the registrant is an emerging growth company as defined in Rule 405 of the Securities Act of 1933 (-230.405 of this chapter) or Rule 12b-2 of the Securities Exchange Act of 1934 (-240.12b-2 of this chapter).
I am using the code:
Is there any way to improve the accuracy of this sentence segmentation?
Beta Was this translation helpful? Give feedback.
All reactions