M
Mark Jerde
(If these are the wrong groups please suggest the right one(s). Thanks.)
I need to come up with a way to test potentially thousands of data (files /
records / streams) to determine if they match one of about thirty defined
data formats. If a record partially matches one of the formats I need to
log why it failed.
The formats are byte-oriented. Byte 0 is the type, byte 1 is the subtype,
bytes 2-5 give the total record length, etc. There are two wrinkles.
First, some of the formats allow 1..n subrecords, like a person listing her
home phone, cell phone, fax number, ICQ #, the dog's cell phone, etc.
Second, some of the formats allow other formats to be wholly contained in
them, like an "inventory" format being made up of many separate items of
different "item" format types.
In the history of computers this *can't* be the first need for this kind of
program. ;-) New formats are approved periodically so hard-coding
everything in C# or VB.NET is a sub-optimal solution. ISTM it should be
possible write the permissible format "rules" in (XML / ASN.1 / RegEx /
etc.), present the rules to a tried and true program, and smash data files
against the program all day long.
Suggestions? Windows preferred but not required.
Thanks.
-- Mark
I need to come up with a way to test potentially thousands of data (files /
records / streams) to determine if they match one of about thirty defined
data formats. If a record partially matches one of the formats I need to
log why it failed.
The formats are byte-oriented. Byte 0 is the type, byte 1 is the subtype,
bytes 2-5 give the total record length, etc. There are two wrinkles.
First, some of the formats allow 1..n subrecords, like a person listing her
home phone, cell phone, fax number, ICQ #, the dog's cell phone, etc.
Second, some of the formats allow other formats to be wholly contained in
them, like an "inventory" format being made up of many separate items of
different "item" format types.
In the history of computers this *can't* be the first need for this kind of
program. ;-) New formats are approved periodically so hard-coding
everything in C# or VB.NET is a sub-optimal solution. ISTM it should be
possible write the permissible format "rules" in (XML / ASN.1 / RegEx /
etc.), present the rules to a tried and true program, and smash data files
against the program all day long.
Suggestions? Windows preferred but not required.
Thanks.
-- Mark