Hi all,
I do not know how to start programming on that case :
I have several csv files formatted like the following :
identifier / last update / content ....
I would like to merge all of them into one single file. If an identifier is found in several files, I want to take the latest "last update" only.
What do you think could be the best algo for this ?
Thks,
AL.
Examples :
file 1 :
id1 / 051108 / blabla
id2 / 051107 / blabla
id3 / 051107 / blabla
id4 / 051108 / blabla
file 2 :
id5 / 051108 / blabla
id1 / 051105 / blabla
id1 / 051104 / blabla
....
What I would like at the end is an unique file with
id1 / 051108 / blabla
id2 / 051107 / blabla
id3 / 051107 / blabla
id4 / 051108 / blabla
id5 / 051108 / blabla
--> there is an unique identifier which is the last updated.
I do not know how to start programming on that case :
I have several csv files formatted like the following :
identifier / last update / content ....
I would like to merge all of them into one single file. If an identifier is found in several files, I want to take the latest "last update" only.
What do you think could be the best algo for this ?
Thks,
AL.
Examples :
file 1 :
id1 / 051108 / blabla
id2 / 051107 / blabla
id3 / 051107 / blabla
id4 / 051108 / blabla
file 2 :
id5 / 051108 / blabla
id1 / 051105 / blabla
id1 / 051104 / blabla
....
What I would like at the end is an unique file with
id1 / 051108 / blabla
id2 / 051107 / blabla
id3 / 051107 / blabla
id4 / 051108 / blabla
id5 / 051108 / blabla
--> there is an unique identifier which is the last updated.