Sign up FAST! Login

Stinger - the next thing in Big Data architecture

Informatica and Big Data Innovations for the Modern Data Architecture


In this partner guest blog, John Haddad, senior director of product marketing at Infomatica, explains and enumerates how’s key innovations in Enterprise SQL at Hadoop scale will augment Informatica’s Big Data Edition integration with Hortonworks’ Modern Data Architecture.

Informatica is excited about the new innovations Hortonworks is including in the project such as Hive transactions, Hive-Spark integration, and sub-second queries. The Informatica Big Data Edition helps our customers take advantage of these new innovations without having to rebuild their data pipelines for Big Data analytics. This is achieved through a visual development environment and metadata driven approach built on the Informatica Vibe virtual data machine that runs natively in Hadoop. For example:

  • We expect that the enhancements to Tez, block based processing, and support for Spark will improve performance for specific data integration and data quality transformations.
  • The Introduction of ACID support with Stinger will simplify the implementation of update strategies on Hadoop.
  • The Sub-second SQL queries on Hadoop will make it easier for analytics teams to do visualizations and data analysis.

These are just some key benefits as the Stinger project continues to evolve. Our joint customers expect that the corporate IT standards for governance, security, and reliability will extend throughout the Modern Data Architectureas they look to complement their existing information management infrastructure with Hadoop. Transaction-level support (i.e. ACID), sub-second queries, and continuous performance improvements are major milestones for applications running on Hadoop such as the Informatica Big Data Edition. Informatica running on Hortonworks ensures our customers can implement a Modern Data Architecture (see Figure 1) that supports the demands of enterprise-ready deployments with regards to performance, scalability, security, and governance.


Figure 1: Informatica + Hortonworks reference architecture for data integration and data quality to support the demands of of enterprise-ready deployments.

Stashed in:

To save this post, select a stash from drop-down menu or type in a new one:

I'm so behind I didn't realize there's an Apache MapReduce.

You May Also Like: