Big Data

Big Data Data

What is Data Lineage?

Your business decisions are driven by data. But are you sure you can trust your data? In this video, Scott Buckles explains the importance of understanding and tracking the lineage of data, drawing parallels to the trust placed in the food supply chain. With automated data lineage tools, you can get real-time insight into data […]

Read More
Big Data

Ray Summit 2023: Day 1 Keynote

Ray is the most popular open source framework for scaling and productionizing AI workloads. From Generative AI and LLMs to computer vision, Ray powers the world’s most ambitious AI workloads.

Read More
Big Data Data Data Warehouse

Creating your first Data Warehouse in Microsoft Fabric

This video is from Guy in a Cube. It’s time to show the Data Warehouse within Microsoft Fabric some love! Patrick walks you through how you can get started with your first Data Warehouse using Data Pipelines. What is data warehousing in Microsoft Fabric? https://learn.microsoft.com/fabric/data-warehouse/data-warehousing Microsoft Fabric decision guide: data warehouse or lakehouse https://learn.microsoft.com/fabric/get-started/decision-guide-warehouse-lakehouse?toc=%2Ffabric%2Fdata-warehouse%2Ftoc.json&bc=%2Ffabric%2Fdata-warehouse%2Ftoc.json Data […]

Read More
Azure Big Data Data Data Warehouse Microsoft

Your data is in the Lakehouse, but now what? | Microsoft Fabric (Public Preview)

This video is from Guy in a Cube. You’ve got your data into OneLake and a Lakehouse, but now what? What can you do with that data after you’ve landed it in Microsoft Fabric? Justyna walks us through different areas where you can leverage your data throughout fabric. From data warehouses to even Power BI!

Read More
AI Big Data Data Data Science

How to Read Giant Datasets Fast – 3 Tips For Better Data Science Skills

This video is from Python Simplified. We’ve learned how to work with data. But how about massive amounts of data? as in – files with millions of rows, tens of gigabytes in size, and ages of staring at your computer waiting for everything to load? Luckily, in this tutorial, I will show you how to […]

Read More
Big Data Data

Prepare for the DP-300 exam & the Azure Database Administrator Associate cert | Data Exposed

This video provides an overview of the DP-300 exam and its significance for database professionals. It delves into the key topics covered in the exam, exploring their relevance to real-world scenarios. Additionally, it highlights the most recent changes in the DP-300 exam to keep you up to date. The presentation concludes with practical tips and […]

Read More
Big Data Data Livestream Red Hat

Data Office Hours: Trino, S3 Select, & CEPH

Dive into Trino, S3 Select, and Ceph in this episode of Data Office Hours. Discover how to boost performance and cut costs by fetching ONLY what you need from S3 objects. Don’t miss these game-changing tips! 🚀 #DataOptimization

Read More
Big Data Data Open Source

Starburst Product Demo

This video is from Great Data Minds explores Starburst Enterprise, based on open source Trino (formerly PrestoSQL) is the fastest SQL-based MPP query engine.

Read More
AI Big Data Data

The Importance of Data Pipelines in AI and Data Science: An Overview

Data is the lifeblood of Artificial Intelligence (AI) and Data Science. It drives insights, powers decisions, and propels innovations. To unlock its full potential, data must be correctly handled, and this is where data pipelines come into play. What are Data Pipelines? Data pipelines are a series of data processing steps where data is ingested […]

Read More
Big Data Spark

Scaling and Unifying SciKit Learn and Spark Pipelines using Ray

Pipelines have become ubiquitous, as the need for stringing multiple functions to compose applications has gained adoption and popularity. Common pipeline abstractions such as “fit” and “transform” are even shared across divergent platforms such as Python Scikit-Learn and Apache Spark. Scaling pipelines at the level of simple functions is desirable for many AI applications, however […]

Read More