Start a new topic

Dangerous Unicode line separator

Today, I happened to find that a Unicode line separator (0X2028) is treated as a white space by CT (regular Docx filter). The document was created by copying and pasting a PDF file.


memoQ:



CafeTran:




By the way, this code is reported to be dangerous as it can be used for phishing, and is banned in web browsers. An online source says that FireFox displays it as a white space (ignores it), whereas Chrome displays it as an LSEP box like memoQ's.


Interesting.

Login to post a comment