compare two text columns and find the closest matched pair of cells

B

betty77

I have a database with two columns I need to match up:

column 1 column 2
( book summary1 ) ( book summary2 )


book 1 summary1 book 1 summary2 first part
book 2 summary1 book 1 summary2 last part
book 3 summary1 first part book 2 summary2
book 3 summary1 last part book 3 summary2
...

As you can see that each book's summary1 does not match up with it
summary2, because they are divided to multiple rows. My task is t
match each book's summary1 with its summary2, and if summary1 o
summary2 is in multiple rows, I need to put them into one.

Summary1 is like a rephrase of summary 2. They are talking about th
same thing but using different wording and sentence division. Als
there is no unique identifier in the two columns like book ISBN numbe
or title. I plan on counting the number of words that are common in th
two summaries columns to find the closest match, and then perform manua
check to confirm.

Can anyone help me to write an excel program to compare the two column
and count the number of words that are common in the two columns? Thank
a million
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top