Syncsort Contributes Ground-Breaking Enhancements to Apache Hadoop
Commit of New Feature is Major Milestone in Boosting Hadoop’s Big Data Integration Capabilities
WOODCLIFF LAKE, N.J. – February 25, 2013
Syncsort, a global leader in high-performance data integration solutions, today announced a milestone contribution in its ongoing commitment to the open source community, with a new feature that strengthens Apache Hadoop’s Big Data integration & ETL capabilities.
The new feature is now committed to Apache Hadoop 2.0.3-alpha
and has received broad-based support from leading Hadoop organizations. The key improvement is a new feature that allows external sort implementations within the Hadoop MapReduce framework, helping organizations to accelerate development, build complex ETL flows and MapReduce jobs without coding and seamlessly optimize Hadoop. The patch also simplifies use cases that are currently challenging in MapReduce so they can be implemented faster and more efficiently.
“Hadoop is a rapidly evolving ecosystem that is emerging as the operating system for Big Data,” said Josh Rogers, senior vice president, data integration business, Syncsort. “Our focus is to help build out Hadoop’s data integration & ETL capabilities, removing barriers that undermine its potential and helping organizations ramp-up their Big Data initiatives.”
Syncsort has worked with the Apache Hadoop community on enhancements and fixes and will continue to collaborate on future projects. The additional flexibility provided by the new feature will help the emerging ecosystem as well as current Hadoop users tackle a broader set of use cases for Big Data analytics. In addition, Syncsort will leverage the feature by delivering a pluggable version of its leading high-performance sort solution, DMExpress®
this spring, which is currently in beta test with select customers.
At the O’Reilly Strata conference
in Santa Clara, California this week, in booth #900, Syncsort will highlight how the feature helps Hadoop MapReduce users. Syncsort will also demonstrate how DMExpress can help organizations have the most current, accurate data available for business analysis, while reducing the cost and complexity of processing increasingly large amounts of data.
For more information about the feature, read our blog at blog.syncsort.com
. To learn how Syncsort helps organizations reduce data integration TCO and complexity, visit bit.ly/12HrJZu
to get our white paper.
About Syncsort’s Data Integration Business
Syncsort provides data-intensive organizations across the big data continuum with a smarter way to collect and process the ever-expanding data avalanche. With thousands of deployments across all major platforms, including mainframe, Syncsort helps customers around the world to overcome the architectural limits of today’s ETL and Hadoop environments, empowering their organizations to drive better business outcomes in less time, with less resources and lower TCO.
Director, Corporate Communications
Smart Connections PR