Google Refine is an elegant tool for data cleaning. One of its most powerful features is the ability to call “Reconciliation Services” to help clean data, for example by matching names to external identifiers. Google Refine comes with the ability to use Freebase reconciliation services, but you can also add external services. Inspired by this I’ve started to implement services to reconcile taxonomic names.
A timely and comprehensive article by Rod Page. Read more on iPhylo. Thanks Rod!