match - Domain Name Matching algorithms -


i have list of companies , want match domains fetched using google search ones belong same company. there existing algorithms available use case(legally allowed used in commercial project too).

e.g. have internet movie database company name , google returns me results out of valid ones internetmoviedatabase, internet-movie-database, the-internet-movie-database, theinternetmoviedatabase, internetmovies, internet-movies, imd, imdb.(note: have excluded tlds list make question simpler)

sounds looking approximate string matching algorithm. not sure if looking algorithm or implementation.

there question on here: string matching algorithm

one possible solution use levenshtein distance: http://en.wikipedia.org/wiki/levenshtein_distance

if looking implementation, if google "approximate string matching c++", first result: http://www.chokkan.org/software/simstring/

good luck!


Comments

Popular posts from this blog

monitor web browser programmatically in Android? -

Shrink a YouTube video to responsive width -

wpf - PdfWriter.GetInstance throws System.NullReferenceException -