From Maurice Kinal@1:153/7001 to Ozz Nixon on Monday, April 29, 2019 15:12:54
Hey Ozz!
I was wrong about the 0x8d's being stripped. I just looked in the raw pkt and now see that they are being substituted with a \r (aka 0x0d) and a space. Thus
the utf8 character c38d becomes c30d20 which is not a utf8 character given the two trailing ascii bytes neither of which are valid utf8 trailing bytes as well
as the fact that 0xc3 can only have one valid trailing byte ranging from 0x80 -
0xbf. Without thinking qabout it too hard I figure around 28 16 bit utf8 characters will be rendered invalid.
Life is good,
Maurice
... Don't cry for me I have vi.
--- GNU bash, version 5.0.7(1)-release (x86_64-pc-linux-gnu)
* Origin: Little Mikey's Brain - Ladysmith BC, Canada (1:153/7001)