Data Engineering
This certification course is targeted towards data engineers, data architects, data scientists, and data developers who implement big data engineering workflows on HDInsight. Candidates for this exam should have relevant work experience in big data analytics solutions. Candidates should also be familiar with the features and capabilities of batch data processing, real-time processing, and interactive processing.

70-775 Perform Data Engineering on Microsoft HD Insight
This certification course is targeted towards data engineers, data architects, data scientists, and data developers who implement big data engineering workflows on HDInsight. Candidates for this exam should have relevant work experience in big data analytics solutions. Candidates should also be familiar with the features and capabilities of batch data processing, real-time processing, and interactive processing.
Course Content
Administer and Provision HDInsight Clusters
- Deploy HDInsight clusters
- Deploy and secure multi-user HDInsight clusters
- Ingest data for batch and interactive processing
- Configure HDInsight clusters
- Manage and debug HDInsight jobs
Implement Big Data Batch Processing Solutions
- Implement batch solutions with Hive and Apache Pig
- Design batch ETL solutions for big data with Spark
- Operationalize Hadoop and Spark
Implement Big Data Interactive Processing Solutions
- Implement interactive queries for big data with Spark SQL
- Perform exploratory data analysis by using Spark SQL
- Implement interactive queries for big data with Interactive Hive
- Perform exploratory data analysis by using Hive
- Perform interactive processing by using Apache Phoenix on HBase
Implement Big Data Real-Time Processing Solutions
- Create Spark streaming applications using DStream API
- Create Spark structured streaming applications
- Develop big data real-time processing solutions with Apache Storm
- Build solutions that use Kafka
- Build solutions that use HBase