[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Invalid UTF-8 byte? (was: Re: utf)


On Mon, 2 Apr 2018 08:37:54 -0400
rhkramer@gmail.com wrote:

> A few weeks ago, I was looking for a byte that, in UTF-8, would be a
> totally invalid byte (not an invalid sequence of bytes).  At the time,
> I tried some googling, but it looked rather hopeless (maybe it was my
> googling that was hopeless).
> I know that your statement does not imply there is such a byte, but
> maybe you (or someone else reading this) know(s)?
> (The reason I wanted such a byte was to use it as a record separator in
> a set of text files (that I use as an askSam "workalike" (or
> "worksimilar") so that I could use msort (which depends on a 1 byte
> record separator to --separate the records ;-) while sorting.)  (Some
> of the files already include UTF-8, and, in the future, I anticpate all
> will be in UTFF-8.)

maybe you could use the null byte?



.-.. .. ...- .   .-.. --- -. --.   .- -. -..   .--. .-. --- ... .--. . .-.

War is never imperative.
		-- McCoy, "Balance of Terror", stardate 1709.2

Reply to: