ETL

Azure Synapse

Azure Synapse Full Course

Azure Synapse Analytics (ASA) is changing the way we work with data services in Azure. The ASA workspace combines the core technologies required for data warehousing, Big Data Analytics and Data Science. In this Learn with the Nerds event, Mitchell Pearson will teach you how you can use Synapse Analytics to solve the paradox of […]

Read More
Azure Synapse

Data Science and Predictive Analytics with Azure Synapse

Discover new Azure Synapse features to integrate predictive analytics capabilities into your organization—using both code-free and code-first options for AI/ML.

Read More
Databricks

How Databricks Leverages Auto Loader to Ingest Millions of Files an Hour

Continuously and incrementally ingesting data as it arrives in cloud storage has become a common workflow in our customers’ ETL pipelines. However, managing this workflow is rife with challenges, such as scalable and efficient file discovery, schema inference and evolution, and fault tolerance with exactly-once guarantees. Auto Loader is a new Structured Streaming source in […]

Read More
Big Data Data Databricks

Empowering Zillow’s Developers with Self-Service ETL

Databricks  shows how their tech empowers Zillow’s developers via self-service ETL. These tools abstract away the orchestration, deployment, and Apache Spark processing implementation from their respective users. In this talk, Zillow engineers discuss two internal platforms they created to address the specific needs of two distinct user groups: data analysts and data producers. Each platform […]

Read More
Azure Data

The Modern Data Warehouse in Azure – Data Processing

In this video, Chris Seferlis continues discussing the Modern Data Platform in Azure with Part 3: Data Processing. Tools Discusssed: Azure Data Factory Data Flows – https://docs.microsoft.com/en-us/azure/data-factory/concepts-data-flow-overview Azure Databricks – https://azure.microsoft.com/en-us/services/databricks/ Azure HDInsight – https://azure.microsoft.com/en-us/services/hdinsight/ Azure Marketplace – https://azuremarketplace.microsoft.com/en-us/marketplace/

Read More
Azure Big Data Databricks

How to Build a Cloud Data Platform with Databricks Part 2 – ETL Processing

Learn how to use Apache Spark and Delta Lake on Databricks to perform ETL processing, manage late arriving data, and repair corrupted data. Companies look to support both business analytics and machine learning initiatives within their organization, but often face challenges with complex operations, proprietary technologies, and unreliable data.

Read More
Azure Big Data

Introduction to the Modern Data Warehouse in Azure

Chris Seferlis will be publishing on the modern data warehouse in Azure. Here he starts with an overview of the stages of a data warehouse and the implications of ELT vs ETL as we move from sources, ingestion, storage, transformation, staging and presentation. Learn more: https://azure.microsoft.com/en-in/solutions/architecture/modern-data-warehouse/

Read More
Azure SQL Server

Azure Synapse Analytics – Next-gen Azure SQL Data Warehouse

Azure Synapse is a limitless analytics service that brings together enterprise data warehousing and Big Data analytics. It gives you the freedom to query data on your terms, using either serverless on-demand or provisioned resources—at scale. Azure Synapse brings these two worlds together with a unified experience to ingest, prepare, manage, and serve data for […]

Read More
Azure Data Data Science

Kappa vs Lambda Architecture

Chris Seferlis describes some key differences between the Kappa and Lambda Architectures, advantages and disadvantages of each, and why you might choose one over the other on the Azure platform.

Read More