At Strata + Hadoop World San Jose today, Microsoft is announcing several updates to its data platform products including DocumentDB and HDInsight. DocumentDB is a globally distributed database service that allows developers to build planet-scale applications. It is important to note that DocumentDB is one of the fastest growing services on Azure.
Microsoft is today announcing the availability of Spark connector for DocumentDB. This new connector enables real-time data science and exploration. It takes advantage of the native DocumentDB managed indexes and enables updateable columns when performing analytics, push-down predicate filtering, and advanced analytics to data sciences against fast-changing globally-distributed data. Microsoft is also today announcing the general availability of DocumentDB’s API for MongoDB which allows apps built on top of MongoDB to seamlessly target DocumentDB and continue to use their MongoDB client drivers and toolchain.
Microsoft also announced that the latest Hortonworks Data Platform 2.6 will be continuously available to HDInsight even before its on-premises release. “At Hortonworks we have seen more and more Hadoop related work loads and applications move to the cloud. Starting in HDP 2.6, we are adopting a “Cloud First” strategy in which our platform will be available on our cloud platforms – Azure HDInsight at the same time or even before it is available on traditional on-premises settings. With this in mind, we are very excited that Microsoft and Hortonworks will empower Azure HDInsight customers to be the first to benefit from our HDP 2.6 innovation in the near future,”
said Arun Murthy, co-founder, Hortonworks.
Microsoft is also updating Apache Spark for Azure HDInsight with new capabilities to support real-time streaming solutions with Spark integration to Azure Event Hubs and leveraging the structured streaming connector in Kafka for HDInsight.
Learn more about these announcements here.