Open source data integration vendor Talend has unveiled a tool aimed at scrubbing dirty data from corporate information repositories.
Talend Data Quality, which will be available free under a GPL license, ferrets out such errors as duplicate names and address, and improperly configured data including phone numbers. At its most basic, the software can ensure a person's phone number is correct and has the required number of digits; check that zip codes match the cities contained in an address entry; and consolidate entries that have names, nicknames or abbreviations that apply to the same person.
"Data quality goes way beyond name and address issues," says Yves De Montcheuil, vice president of worldwide marketing for Talend. "Those are the most prevalent. But if you have a product catalog, you need to ensure product descriptions are correct and that the price makes sense."
He says clean data is key when integrating information across systems because mis-information can propagate fast not only internally but to partners.
Talend is coupling its Data Quality tool with the Talend Open Profiler software it released in June. The Profiler can look inside a database and pinpoint problems. The tools can be used together or separately. In addition, both tools work in harmony with Talend's Integration Suite.
The company also has been forging relationships with major vendors, including a partnership with Microsoft earlier this year.
Data Quality is a graphical tool that lets users drag and drop components onto a process map. The components describe such tasks as reformatting an address, checking an address against the U.S. Postal Service database, pulling data from specific repositories, adding longitude and latitude to customer records to provide navigation help to delivery drivers, or pulling data from a credit bureau. An SDK lets users create their own components.
Once the process is complete, Talend Data Quality generates an executable code in Java or Perl that can be installed in multiple places on the network and close to data sources.
Talend plans to deliver Data Quality at the end of September and will offer tech support and other services via a subscription that starts at US$15,000 per year.
Latest on Business
- Sun banks on open source for its survival
- Novell's FOSSA architecture document out by December
- Open sourcing code may improve transparency on Wall Street
- Audits, open source needed with e-voting
- Free software: It's about the money
- Is Sun Solaris on its deathbed?
- At 10, Google reiterates commitment to CIOs
- Analysts: Google spreading itself too thin
- As Google turns 10, enterprise success in question
- 10 open source companies to watch
Open Source Essentials
- Microsoft 'interested' in open source browser: Ballmer
- Flying high with open source
- Open sourcing code may improve transparency on Wall Street
- Problem-solvers hunt open-source solutions
- Open source advocates hail appeals court ruling
- Open-source software a security risk, study claims
- Insurance company bets health on open source
- Open source stack solid foundation for All Homes
- 20 great Windows open source projects you should get to know
- Via releases laptop design as open source
TechWorld Jobs (beta)
Recent Jobs
TechWorld Blogs
-

TalkingTech
The view from the top of IT with TechWorld Editor Rodney Gedda
-

Entrenched
Cooking up better code, IDG's developers reveal some of their secrets
-

Broadband Voice
Darren Pauli digs in from the front line of Australia's broadband battleground
Recent blog posts
- An open storage stack? I like the sound of that
- The mobile clone wars: fighting for a better phone experience
- Stopping the "Clean Feed"
- Identifying web platforms
- Clean Feed ‘not technically possible’
- No Clean Feed - well duh!
- Conroy's content cops still on the cards
- Will open source ruin the economy? Please help
- Linux kernel 2.6.27 is out!
- Falling off the ob_start stack
Recent comments
- Hello this is Brianna
5 hours 5 min ago - Turn any PC into a media center
18 hours 57 min ago - How About the Correct Title?
1 day 10 hours ago - who are you kidding?
1 day 15 hours ago - Seriously, how much did they pay for this advertisement
3 days 6 hours ago - SF Bay Area - free Seminar on Enterprise Cloud Computing
3 days 9 hours ago - video conferening but not telepresence...
3 days 16 hours ago - SAMSUNG OLED 40" TECHNOLOGY
4 days 1 hour ago - What was the question again, oh well this was prepared earlier
6 days 8 hours ago - Worldwide broadband prices continue to drop which means ? in AU
6 days 8 hours ago - Not a Problem Here in Australia and New Zealand
1 week 1 day ago - Clear the air
1 week 2 days ago - Tabbed browsing, Quick Find,
1 week 5 days ago - Microsoft details plans for new social bookmarking tool
1 week 6 days ago - There is a 3rd party tool
2 weeks 1 day ago - Demise of Windows
2 weeks 1 day ago - new OS
2 weeks 1 day ago - Re: Favicon
2 weeks 2 days ago - Multi Camera Kino
2 weeks 2 days ago - Favicon
2 weeks 3 days ago



