.NET for Apache Spark hits v1.0 

2 min read

About two years ago, we heard an increasing demand from the .NET community for an easier way to build big data applications with .NET, outside of needing to learn Scala or Python. Thus, in a collaboration between Azure Data and .NET teams, we started the .NET for Apache® Spark™ open source project. Today, we are Read more

How to process streams of data with Apache Kafka and Spark 

23 min read

Data is produced every second, it comes from millions of sources and is constantly growing. Have you ever thought how much data you personally are generating every day? Data: direct result of our actions There’s data generated as a direct result of our actions and activities: Browsing twitter Using mobile apps Performing financial transactions Using Read more

3 Comments

Azure HDInsight 3.6, now generally available 

1 min read

This week at the DataWorks Summit, Microsoft announced the general availability of Azure HDInsight 3.6, backed by Microsoft’s enterprise-grade SLA. HDInsight 3.6 brings updates to various open source components in the Apache Hadoop and Spark ecosystem to the cloud, allowing customers to deploy them easily and run them reliably on an enterprise-grade platform. “HDInsight 3.6 Read more

Updates to HDInsight tools for IntelliJ and Eclipse now available 

1 min read

HDInsight is Microsoft Azure’s managed Hadoop-as-a-service. It is the only fully-managed cloud Hadoop offering that provides optimized open source analytic clusters for Spark, Hive, MapReduce, HBase, Storm, Kafka, and R Server – all backed by a 99.9% SLA. Each of these big data technologies and ISV applications are easily deployable as managed clusters with enterprise-level Read more

Zoom on the InfoQ week, featuring BizSpark Plus startups! 

1 min read

Four BizSpark Plus startups were interviewed by IT publication InfoQ France, following their participation at Microsoft Experiences conference last October. These startups have chosen Microsoft Azure for their technical architecture and to support their open source technologies. Discover their stories and their technological choices in the following articles (note: links are in French): Realytics.io: Hadoop, Read more

Now available in preview: R Server inside Azure HDInsight 

1 min read

This week at Strata + Hadoop World, Microsoft announced the availability of R Server inside Azure HDInsight, Microsoft’s managed Hadoop-as-a-service part of Azure Data Lake. R is a very popular programming language that helps millions of data scientists solve their most challenging problems in fields ranging from computational biology to quantitative marketing. R Server for Read more

SocialDice: Revolutionizing talent recruitment in the the Middle East 

2 min read

For rapidly-growing, Middle Eastern small- to medium-sized businesses, hiring the best employees is a challenge. Both busy employers and ambitious talent find the existing job boards slow and cumbersome. In response, start-up SocialDice has made the process easy, intelligent, and fun by aggregating regional job boards in one place and using a proprietary smart algorithm Read more