Start a new topic

REQ: Split segments at Headings for ODT files

Hello,


With the same segmentation rules (Tried Default Rules.srx, for example), MS Word (DOCX) files are segmented at headings, while ODT files have the segments joined, with a tag and no space between phrases.


This makes working in ODT files quite harder/less convenient.


Is there a segmentation rule that can force segmentation after headings (they get the paragraph end sign) for OFT files, or, could this be fixed at file filter level?


Attaching screenshots.


Thank you,




odt test.png
(13.5 KB)
docx test.png
(14.5 KB)

Can you submit a ticket with a sample ODT file attached?

Sure Igor, I've submitted a ticket with a few ODT/DOCX files to help reproduce the issue.


Thanks!


Jean

Login to post a comment