Öppna kurser

Python Data Wrangling

In this Python Data Wrangling course, you will learn how to use Python to extract/transform data from various sources, including large database vaults and Excel financial tables. You will also explore insights into why you should avoid traditional methods of data cleaning, as done in other languages, and take advantage of the specialised functions from NumPy and Pandas.

Utbildningsmål

  • Extract and parse data from various sources
  • Transform and clean data using Numpy and Pandas
  • Summarise and visualise data with Matplotlib
  • Read HTMLXML, and JSON data from internet resources
  • Search and filter data sets
  • Apply Python tools and techniques to process data sets efficiently
  • Continue learning and face new challenges with after-course one-on-one instructor coaching

Målgrupp

This course is for data analysts and data scientists looking to utilise Python to extract from various sources and prepare it for machine learning modelling.

Förkunskaper

To succeed in this course, you should have a working knowledge of Python basics, including data structures, importing and using modules, creating functions, and using the Jupyter Notebook platform.

Innehåll

Module 1: Introduction to Data Structure using Python

  • Python for Data Wrangling
  • Lists, Sets, Strings, Tuples, and Dictionaries

Module 2: Advanced Operations on Built-In Data Structure

  • Advanced Data Structures
  • Basic File Operations in Python

Module 3: Introduction to NumPy, Pandas, and Matplotlib

  • NumPy Arrays
  • Pandas DataFrames
  • Statistics and Visualisation with NumPy and Pandas
  • Using NumPy and Pandas to Calculate Basic Descriptive Statistics on the DataFrame

Module 4: Deep Dive into Data Wrangling with Python

  • Subsetting, Filtering, and Grouping
  • Detecting Outliers and Handling Missing Values
  • Concatenating, Merging, and Joining
  • Useful Methods of Pandas

Module 5: Getting Comfortable with Different Data Sources

  • Reading Data from Different Text-Based (and Non-Text-Based) Sources
  • Introduction to BeautifulSoup4 and Web Page Parsing

Module 6: Learning the Hidden Secrets of Data Wrangling

  • Advanced List Comprehension and the zip Function
  • Data Formatting

Module 7: Advanced Web Scraping and Data Gathering

  • Basics of Web Scraping and BeautifulSoup libraries
  • Reading Data from XML

Module 8: RDBMS and SQL

  • Refresher of RDBMS and SQL
  • Using an RDBMS (MySQL/PostgreSQL/SQLite)

Module 9: Application in Real Life and Conclusion of Course

  • Applying Your Knowledge to a Real-life Data Wrangling Task
  • An Extension to Data Wrangling

Kursen levereras genom utbildningspartner: Learning Tree
Learning-Tree-Logo_horizontal.png