[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: CDROM Data.



> 
> Did one of you say you had converted the fixed length files to delimited ?
> What sizes did you end up with ?
> 
I did.
I only did it with the apt file.

I posted the info, and the perl code I used.
I never heard a word form anyone, and have not done anything more with
the code since.

The apt file was split into 3 files.
The reason for that, is almost all the these files have more than one
record type in them, the database guy in me couldn't swallow that.

The apt file contains general airport info, runway info for each runway
and additional remarks. there are variable numbers of the last 2 types.

Nothing was removed from the files, data or field-wise.
Nothing was done to encode or compress the data.
IE things like "Airframe Service: "MINOR or MINOR or NONE""
could easily be encoded in 4 bits, rather than 5 chars.

The short info on the file sizes is this:
Original File:
102,171,238 Feb  2 13:05 apt.txt

Straight delimited text:
9,184,999 Feb  4 11:03 apt.apt.delimited
4,766,349 Feb  4 11:03 apt.rmk.delimited
3,934,793 Feb  4 11:03 apt.rwy.delimited

gzip *.delimited:
2,198,942 Feb  4 11:03 apt.apt.delimited.gz
1,296,794 Feb  4 11:03 apt.rmk.delimited.gz
  779,955 Feb  4 11:03 apt.rwy.delimited.gz
 

Marc

-
Archives of linux-aviation: http://mail.nl.linux.org/lists/linux-aviation/
To unsubscribe: send the command "unsubscribe linux-aviation" in the body
of a mail message to <Majordomo@mail.nl.linux.org>.