The exponential growth in data volumes has generally translated into equally daunting e-discovery collection challenges. Using traditional methods to process voluminous data under strict production deadlines can bury organizations in an avalanche of big data. Apache Hadoop, a hyped open-source software framework allowing distributed processing of large data sets, can transform this avalanche, to further the analogy, into well-manicured ski slopes. But it is not perfect.
Hadoop can multiply computing capacity
In e-discovery cases with huge data volumes, predictive coding and other advanced analytics can be crucial to manage the necessary review and discovery scheduling order. Both the volumes and the level of computing required with this kind of analysis of big data have prompted a need for greater computing capacity.
Customary methods, however, are not adequate. Installing larger, more powerful computers can be costly and insufficient to keep pace and provide processing scalability. Hadoop, on the other hand, can be run on inexpensive commodity servers, which are manufactured by multiple vendors and have open standards. New servers can easily be added to the Hadoop cluster while problems on other servers are fixed without halting the entire process.
To Continue Reading: Click Here
-----------------------------------------------------
Source: ACEDS
By: John Addington
Hadoop can multiply computing capacity
In e-discovery cases with huge data volumes, predictive coding and other advanced analytics can be crucial to manage the necessary review and discovery scheduling order. Both the volumes and the level of computing required with this kind of analysis of big data have prompted a need for greater computing capacity.
Customary methods, however, are not adequate. Installing larger, more powerful computers can be costly and insufficient to keep pace and provide processing scalability. Hadoop, on the other hand, can be run on inexpensive commodity servers, which are manufactured by multiple vendors and have open standards. New servers can easily be added to the Hadoop cluster while problems on other servers are fixed without halting the entire process.
To Continue Reading: Click Here
-----------------------------------------------------
Source: ACEDS
By: John Addington

1 comment:
Nice Post by edd. It contain useful information.The objective of Cloud Computing to provide the high performance computing technology traditionally available to governmental authorities and military, within the reach of individual and small businesses. It is equally tempting to large enterprises especially because of its cost efficiency.
Forum Posting
Post a Comment