Word is not text

Note to self: next time you have occasion to commit a Word document to CVS (dog forbid), make sure you tell it that it’s a binary file, not text.

I foolishly let Eclipse decide for me… and it picked text. Sigh. So a week after I had left the project, I get a call from my ex-manager complaining that certain documents could not be opened. Since one of the last things I did before leaving the job was to wipe my PC’s hard drive, I was a little concerned.

But a quick inspection of the “corrupt” Word document confirmed my suspicion. Every newline (0x0a) character was indeed preceed by a carriage return (0x0d). DOS line breaks! Thanks for that one, Bill!

Anyway, this small piece of Perl hackery later, and the document opened fine (in OpenOffice anyway).