Continuing with their Open Source tradition, Psychsoftpc, a Quincy, MA computer manufacturer, is now offering their Linux Psychlone Clusters with Apache Hadoop. Hadoop is a Java-based programming framework that supports the processing of large data sets in a distributed computing environment.
|
|
Psychsoftpc offers Hadoop Cluster Solution
Continuing with their Open Source tradition, Psychsoftpc, a Quincy, MA computer manufacturer, is now offering their Linux Psychlone Clusters with Apache Hadoop. Hadoop is a Java-based programming framework that supports the processing of large data sets in a distributed computing environment. Hadoop is based on Google's MapReduce algorithm, which breaks an application down into numerous small parts. Any of these parts (also called fragments or blocks) can be run on any node in the cluster. Hadoop makes it possible to run applications on systems with thousands of nodes involving thousands of terabytes. A distributed file system (DFS) facilitates rapid data transfer rates among nodes and allows the system to continue operating uninterrupted in case of a node failure. The Psychsoftpc Hadoop offering includes Cloudera's Distribution Including Apache Hadoop or CDH. CDH delivers a streamlined path for putting Apache Hadoop to work solving business problems in production. Ideal for enterprises seeking an integrated, tested Apache Hadoop-based system without proprietary vendor lock-in, CDH is the bridge between the experience of organizations using Hadoop in production and the continuous stream of innovations from the Apache community.
Hadoop is widely used across industries including finance, entertainment, government, health care, media, technology, telecom, and other industries with big data analytic requirements. Let's face it, Hadoop isn't for everyone. It is a technology designed for very large data sets. But if you have to analyze terabytes of data, Hadoop is the open source solution. One company that's using Hadoop successfully is Twitter, which needs to use clusters for its data. The amount of data it stores every day is too great to be reliably written to a traditional hard drive. Twitter is working with 12 terabytes of new data per day. Twitter has also found that SQL wasn't efficient enough to do analytics at the scale the company needs. Thus, they turned to Hadoop. Facebook also uses Hadoop to manage the 40 billion photos it stores. "It's how Facebook figures out how closely you are linked to every other person," said Jeff Hammerbacher. Other industries that could benefit from Hadoop are biotech, oil and gas, retail and insurance, any industry that consistently creates and analyzes huge amounts of data.
Psychsoftpc has been offering Linux, Open Source and Cluster solutions since the late 1990's and has been a staunch supporter of the Linux and Open Source communities. Founded in 1987 as an Artificial Intelligence Natural Language Software company, Psychsoftpc has grown too offer high end computer hardware to the DOD, NASA, NIH, the private sector and colleges and universities throughout the United States. Psychsoftpc is based in Quincy, MA.
Psychsoftpc can be found at http://www.psychsoftpc.com and can be reached at 617-471-8733.
|