Description
Data Engineering Essentials using SQL, Python, and PySpark is a project-based training course on data engineering with SQL, Python, and the PySpark framework, published by Udemy Academy. At the end of this training course, you will be able to build a data pipeline for data processing and storage with the techniques and technologies taught, and do engineering and data analysis projects on your own. Data engineering is the process of filtering, storing and processing different data based on the needs and goals of a specific project or research. Data engineering is a general concept and includes many sub-disciplines. At the beginning of this training course, you will learn the basics of two programming languages, Python and SQL, and after solving the exercises and exercises of this section, you will refer to the next and more advanced topics.
What you will learn in the Data Engineering Essentials using SQL, Python, and PySpark course:
- Building a data pipeline with SQL
- Postgres database management system
- Initial installation and setup of the database and performing simple operations on information such as adding, deleting, updating, etc.
- Writing simple SQL queries and requests
- Filter, merge and compress data with SQL
- Creating indexes and tables in the database environment with DDL commands
- Partitioning and categorizing information in the database
- Predefined functions in SQL such as manipulation of string values and…
- Writing complex and specific SQL queries with Postgresql
- Principles of Python programming
- Implementing and performing simple operations in the database with the Python programming language
- Conditional statements and loops in Python
- List and set in Python
- Data types and data types in Python programming
- Map and Reduce libraries in Python
- Pandas library
- Initial installation and commissioning of the data engineering application development environment
- types of API Dataframe Spark As select ,filter ,groupBy ,orderBy And …
- Using different files and formats like Parquet ,JSON ,CSV And… to build information transmission channels
Course details
Publisher: Yudmi
teacher: Durga Viswanatha Raju Gadiraju
English language
Education level: Intermediate
Number of courses: 624
Training duration: 56 hours
Course headings
Prerequisites of Data Engineering Essentials course using SQL, Python, and PySpark
Laptop with decent configuration (Minimum 4 GB RAM and Dual Core)
Sign up for GCP with the available credit or AWS Access
Setup self support lab on cloud platforms (you might have to pay the applicable cloud fee unless you have credit)
CS or IT degree or prior IT experience is highly desired
Course images
Sample video of Data Engineering Essentials course using SQL, Python, and PySpark
Installation guide
After Extract, view with your favorite Player.
English subtitle
Quality: 720p
Previous title:
Data Engineering Essentials Hands-on – SQL, Python and Spark
Changes:
The version of 2023/7 compared to 2021/11 has reduced the number of 14 lessons and the duration of 1 hour and 22 minutes.
download link
Password file(s): www.downloadly.ir
Size
15.4 GB
Be the first to comment