In Pictures: 9 open source Big Data technologies to watch
Apache Hadoop
Apache Hadoop is an open source software framework for data-intensive distributed applications originally created by Doug Cutting to support his work on Nutch, an open source Web search engine. To meet Nutch's multimachine processing requirements, Cutting implemented a MapReduce facility and a distributed file system that together became Hadoop. He named it after his son's toy elephant. Through MapReduce, Hadoop distributes Big Data in pieces over a series of nodes running on commodity hardware. Hadoop is now among the most popular technologies for storing the structured, semi-structured and unstructured data that comprise Big Data. Hadoop is available under the Apache License 2.0.
Latest News
- Are we ready for a mobile-first world?
- Smartphone chips could replace server processors in HPC, researchers say
- Google pursuing broad wireless project for emerging markets, report says
- Schnucks wants federal court to handle data breach lawsuit
- Proposed law would make reprogramming cellphone IDs a crime
- Puppet gets a more expressive configuration language
- iPhone 6 rumour rollup for the week ending May 24
- iPad 5 rumour rollup for the week ending May 23
- Is Google trying to swipe Waze from under Facebook's nose?
- Public cloud shakeup: VMware in, Dell out, and OpenStack in limbo?
- Reports: FTC examining Google's display ads
- Researchers warn of increased Zeus malware activity this year
- Google, Facebook launch bidding war for Waze
- Google ready to start shipping Glass to #ifihadglass Explorers
- Vanishing into thin [MacBook] Air: Shortages signal WWDC refresh























