Start a new topic

Sometimes you get a TMX file that's segmented per paragraph

Sometimes you get a TMX file that's segmented per paragraph. If you want to split these TUs per sentence, you'll get an empty target sentence.


CafeTran already has a segmenting algorithm for sentences, so why not use it to catch the first target sentence of a segment with multiple sentences too?


Or even better: allow resegmenting of paragraph-wise segmented TMX files automatically, based on the sentence segmenting algorithm?


4B8RPmLpRkWFG1tIVoH4myKt0G-mgcjqDw.png


Xg69dVXzSxa9pOnvcjytNss55cJTPQfdnQ.png


Login to post a comment