15 Free YouTube subscribers for your channel
Get Free YouTube Subscribers, Views and Likes

Data Engineering Course For Beginners - #2 TRANSFORM

Follow
Karolina Sowinska

***
My FREE guide on how to become a data engineer:
https://karolinasowinska.notion.site/...
***

This is the second part of the Free Data Engineering Course for Beginners that I've decided to create for you! Over the course of the four videos, we are going to cover the entire ETL process (extract, transform, load), and at the end we are also going to talk about job scheduling.

In this course you will build your first data feed (or data pipeline) using Spotify API. This feed will run daily, and it will download the data about the songs that you listened to during a day, and save that data in a SQLite database on your local machine.

In this video we are going to cover the Transform stage of the ETL process, which means that we will be learning how to validate the data that we received from a data vendor (Spotify in this case). We'll check for empty files, null values, stale data and duplicates! Along the way I will also explain some basic data engineering concepts such as a primary key constraint, or "garbage in, garbage out" principle.

Follow this link to generate your Spotify API token:
https://developer.spotify.com/console...

Find the code with this data engineering project on GitHub:
https://github.com/karolinasowinska/...



If you'd like to learn data engineering, I recommend following the 4 simple steps below to land you the first job interview:

1. Learn Python
I recommend following the Python for Everybody specialization course on Coursera, which is one of the most popular courses there:
http://imp.i384100.net/x9gVO3

2. Learn SQL
SQL is still the lingua franca of data. I recommend going with Learn SQL Basics for Data Science course, because it contains some chapters which are very releavant to data engineering in partcular, e.g. distributed computing with Spark
http://imp.i384100.net/QOMZ09

3. Learn Bash scripting/Linux
I wouldn't take a full course on it, but at least read a good article.
if you do prefer to take a course/guided project, I think this one is short and good:
https://www.coursera.org/projects/com...

4. Learn how to develop on the cloud, e.g. on AWS
There are a few good courses around there, but I think the Coursera one is the most comprehensive
imp.i384100.net/P0MJBM

Discloure: The above contain affiliate links, meaning when you click the links and make a purchase, I receive a commission!

Discloure: The above contain affiliate links, meaning when you click the links and make a purchase, I receive a commission.



Music:
What Now Golden Age Radio

Connect with me on Instagram:
@karo_sowinska

And if you want to make my day with a cup of coffee... :)
https://kofi.com/karolina_sowinska

posted by sjemenitiah