dotlucene question: tighten search results.

  • Thread starter Thread starter Peter Rilling
  • Start date Start date
P

Peter Rilling

If anyone has experience with DotLucene, then this question might be right
up your alley.

I have two lists of music titles. Each from a different source. I am
trying to match determine possible matches to associate them. I know that
any association on text will not be perfect, but I am interested in the
probability of two titles being the same.

Using dotLucene, I have created an index with one set and enumerating the
second while performing a search. I get back a set of hits but it does not
give me very precise results. Maybe it is because titles do not give much
content to search on. I am looking for a way to make the results more
strict or take into account work position and proximity when calculating the
score.

For example, the title "Give In To Me" is matching 100% with the title
"Heaven Give Me World". Likewise, "You Rock My World (Dance Mix)" is 100% a
match for "How's My World Treatin' You".

Can I tighten the search system so that it is more strict?
 
You can use add proximity to the words you are searching for (~x where
x is the maximum distance between words).

Also remember to check your stop word list as those will be excluded
from the index and so may skew the results.
 
Also check that the query is using AND rather than OR for the search.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Back
Top