

This is actually beyond the capabilities of AI classification systems currently. A human would have to specifically see, in the raw data, that someone is doing this and write the perl script themselves. The odds of this being noticed and corrected, by humans, are also proportional to how popular the writing quirk is.
Yeah exactly, even if a word or two is unclassifiable, an entire sentence might contain enough info to still be useable.