When I try to read the training data into my c++ program, I'm running into a problem at the end of file. For the InitialTrainingSet 2012_11_12 flighthistory.csv, the actual data (including '\n' and spaces) is exactly 7784526 characters long. However, if I keep reading until I hit an end of file character (== '\0'), I end up reading 7809940 characters, where the extra characters are gibberish.
My text editor, too, insists that the file is 7809940 characters large, but a direct character count yields only 7784526 characters.
Anyone know what's going on?



Flagging is a way of notifying administrators that this message contents inappropriate or abusive content. Are you sure this forum post qualifies?

with —