Syncsort, a global leader in Big Iron to Big Data solutions, today announced new open metadata management capabilities in its DMX-h data integration software that, along with its seamless integration with Cloudera Navigator, make Big Data governance easier. Syncsort’s multi-award winning DMX-h provides organizations with a single interface for accessing and integrating all enterprise data, including IBM z mainframes, and the flexibility to use the metadata repository that best meets their needs, on premise and in the cloud.
“Organizations are finding that keeping track of data, data security, data access, and regulatory compliance is more critical and more challenging than ever before,” said Tendu Yogurtcu, General Manager of Syncsort’s Big Data business. “Data governance in Hadoop — including auditing, lineage, and metadata management — requires a scalable approach that is easy to interoperate across multiple platforms, on-premise and in the cloud. Syncsort continues to work closely with Cloudera to deliver a best-in-class solution for this challenge.”
Because Syncsort DMX-h runs natively in Hadoop, it seamlessly integrates with Cloudera Navigator, allowing users to search for DMX-h jobs across a unified metadata repository and view data lineage within the Navigator user interface out-of-the-box.
“As a long-time Cloudera partner and contributor to big data innovation including open source projects like Hadoop, Sqoop and Spark, Syncsort continues to innovate in support for Cloudera’s Data Governance leadership,” said Tim Stevens, vice president, Business and Corporate Development, Cloudera. “Our joint customers are excited that Syncsort’s data integration solution supports Big Data governance by tracking data lineage from legacy sources such as mainframes and databases to new enterprise data hubs, and shares that information with Cloudera Navigator.”
New Open Metadata Management Capabilities Support Big Data Governance with Unprecedented Flexibility and Choice
In new capabilities delivered this month, Syncsort extends metadata management by publishing DMX-h metadata, including data flow definitions and lineage tracking information, in an open format that can be easily consumed by and integrated with any metadata repository. This metadata keeps track of processing done inside or outside a distributed computing cluster.
Rather than locking users into a proprietary metadata format, Syncsort’s latest capabilities employ a file-based metadata approach and integrates seamlessly with Hadoop-based metadata repositories such as HCatalog and HDFS. With the new capabilities, DMX-h exports cross-platform metadata in an open comma delimited .csv text format that any system can import. This data can be loaded into databases or other metadata repositories, used in governance audit reports, and analyzed in other third party tools. Organizations can leverage Syncsort’s differentiated approach to integrate their Big Data lineage information with the metadata management tool of their choice and transition from existing in-house tools to the Hadoop based metadata management tools such as Cloudera Navigator at their own pace.
DMX-h can be deployed via Cloudera Manager and supports Hadoop-based security protocols, such as Kerberos, for data security and privacy and Apache Sentry for fine grained data access control.
The new Big Data governance capabilities are now available in DMX-h v9, which allows organizations to securely access and integrate batch and real-time data streams from multiple enterprise data sources, including Kafka, mainframe, relational databases, and unstructured sources in the same data pipeline feeding Hadoop and Spark, where customers can then leverage advanced analytics for competitive advantage, on-premise and in the cloud.