When a document is scanned it can optionally convert %XX to chars. If you find documents are getting past the phrase filtering due to encoding then enable. However this can break Big5 and other 16-bit texts.