Start a new topic

QES: Change in segmentation?

 Hi,


after the last but one update I noticed CF changed the way it makes segmentation. Despite the segmentation rule is set to "sentence" it often fails to begin a new segment with the following sentence.

Is it an intentional change to this or just a bug?


                                 Regards


                               Wojciech


Nothing has been changed regarding the segmentation in the 2017 updates.

Hi,
the problem described above persists.
I have segmentation set to "Sentence" but the programme segments the text by paragraphs. I don't know how to fix it.
Another question - how to add to CT a regular expression for email addresses?

                                     Regards
                                     Wojciech

 

If your project was not created/segmented in another tool, the Sentence segmentation should work fine. The segmentation process takes place when you create a new project. You cannot change it (re-segment) afterwards. 

Could it be that in your source text, the punctuation marks are followed by non-breaking spaces?

Just a guess.

 

Maybe you are right. It occurs in OCR word documents.
Thanks for your advice.

                                                     Regards
                                              Wojciech

 

Login to post a comment