
What is Data Lineage?
Your business decisions are driven by data. But are you sure you can trust your data? In this video, Scott Buckles explains the importance of understanding and tracking the lineage of data, drawing parallels to the trust placed in the food supply chain. With automated data lineage tools, you can get real-time insight into data […]
Read More
Ray Summit 2023: Day 1 Keynote
- Frank
- September 20, 2023
- Ray
Ray is the most popular open source framework for scaling and productionizing AI workloads. From Generative AI and LLMs to computer vision, Ray powers the world’s most ambitious AI workloads.
Read More
Creating your first Data Warehouse in Microsoft Fabric
- Frank
- September 5, 2023
- Data Analytics
- data analytics for beginners
- Data Factory
- data factory pipeline tutorial
- microsoft fabric
- microsoft fabric data factory
- microsoft fabric data warehouse
- microsoft fabric demo
- microsoft fabric synapse
- microsoft fabric synapse data warehouse
- microsoft fabric tutorial
- Synapse
- synapse data pipelines
- synapse data warehouse
- synapse data warehousing
This video is from Guy in a Cube. It’s time to show the Data Warehouse within Microsoft Fabric some love! Patrick walks you through how you can get started with your first Data Warehouse using Data Pipelines. What is data warehousing in Microsoft Fabric? https://learn.microsoft.com/fabric/data-warehouse/data-warehousing Microsoft Fabric decision guide: data warehouse or lakehouse https://learn.microsoft.com/fabric/get-started/decision-guide-warehouse-lakehouse?toc=%2Ffabric%2Fdata-warehouse%2Ftoc.json&bc=%2Ffabric%2Fdata-warehouse%2Ftoc.json Data […]
Read More
Your data is in the Lakehouse, but now what? | Microsoft Fabric (Public Preview)
- Frank
- August 24, 2023
- Business Intelligence
- Data Analyst
- Data Analytics
- Data Science
- fabric
- fabric lakehouse
- Lakehouse
- microsoft fabric
- microsoft fabric data warehouse
- microsoft fabric lakehouse
- microsoft fabric lakehouse shortcuts
- microsoft fabric lakehouse tutorial
- microsoft fabric notebook
- microsoft fabric onelake
- onelake
- onelake shortcuts
- Power BI
This video is from Guy in a Cube. You’ve got your data into OneLake and a Lakehouse, but now what? What can you do with that data after you’ve landed it in Microsoft Fabric? Justyna walks us through different areas where you can leverage your data throughout fabric. From data warehouses to even Power BI!
Read More
How to Read Giant Datasets Fast – 3 Tips For Better Data Science Skills
- Frank
- August 14, 2023
- Big Data
- Tips
This video is from Python Simplified. We’ve learned how to work with data. But how about massive amounts of data? as in – files with millions of rows, tens of gigabytes in size, and ages of staring at your computer waiting for everything to load? Luckily, in this tutorial, I will show you how to […]
Read More
Prepare for the DP-300 exam & the Azure Database Administrator Associate cert | Data Exposed
This video provides an overview of the DP-300 exam and its significance for database professionals. It delves into the key topics covered in the exam, exploring their relevance to real-world scenarios. Additionally, it highlights the most recent changes in the DP-300 exam to keep you up to date. The presentation concludes with practical tips and […]
Read More
Data Office Hours: Trino, S3 Select, & CEPH
Dive into Trino, S3 Select, and Ceph in this episode of Data Office Hours. Discover how to boost performance and cut costs by fetching ONLY what you need from S3 objects. Don’t miss these game-changing tips! 🚀 #DataOptimization
Read More
Starburst Product Demo
- Frank
- July 20, 2023
- Starburst
- Trino
This video is from Great Data Minds explores Starburst Enterprise, based on open source Trino (formerly PrestoSQL) is the fastest SQL-based MPP query engine.
Read More
The Importance of Data Pipelines in AI and Data Science: An Overview
Data is the lifeblood of Artificial Intelligence (AI) and Data Science. It drives insights, powers decisions, and propels innovations. To unlock its full potential, data must be correctly handled, and this is where data pipelines come into play. What are Data Pipelines? Data pipelines are a series of data processing steps where data is ingested […]
Read More
Scaling and Unifying SciKit Learn and Spark Pipelines using Ray
Pipelines have become ubiquitous, as the need for stringing multiple functions to compose applications has gained adoption and popularity. Common pipeline abstractions such as “fit” and “transform” are even shared across divergent platforms such as Python Scikit-Learn and Apache Spark. Scaling pipelines at the level of simple functions is desirable for many AI applications, however […]
Read More