The Ultimate Guide to Bivariate Analysis with Python

This article will review some of the critical techniques used in Exploratory Data Analysis, specifically for Bivariate Analysis. We will review some of the essential concepts, understand some of the math behind correlation coefficients and provide sufficient examples in Python for a well-rounded, comprehensive understanding. What is Bivariate Analysis? Exploratory Data Analysis, or EDA, is … Read more

How to Filter Data from an Excel File in Python with Pandas

Problem Formulation and Solution Overview This article will show different ways to read and filter an Excel file in Python. To make it more interesting, we have the following scenario: Sven is a Senior Coder at K-Paddles. K-Paddles manufactures Kayak Paddles made of Kevlar for the White Water Rafting Community. Sven has been asked to … Read more

How to Remove Text Within Parentheses in a Python String?

Problem Formulation and Solution Overview This article will show you how to remove text within parentheses in Python. To make it more interesting, we have the following running scenario: Rivers Clothing has a CSV file containing all their employees. The format is currently first name (middle name) and last name (for example, Martin (Robert) Simpson). … Read more

Python Time Series Forecast – A Guided Example on Bitcoin Price Data

A Time Series is essentially a tabular data with the special feature of having a time index. The common forecast task is ‘knowing the past (and sometimes the present), predict the future’. This task, taken as a principle, reveals itself in several ways: in how to interpret your problem, in feature engineering, and in which … Read more

Python – Finding the Most Common Element in a Column

Problem Formulation and Solution Overview This article will show you how to find the most common element in a Pandas Column. To make it more interesting, we have the following running scenario: You have been provided with a downloadable CSV file containing crime statistics for the San Diego area, including their respective NCIC Crime Codes. … Read more

Python Convert Parquet to CSV

Problem πŸ’¬ Challenge: How to convert a Parquet file ‘my_file.parquet’ to a CSV file ‘my_file.csv’ in Python? In case you don’t know what a Parquet file is, here’s the definition: πŸ’‘ Info: Apache Parquet is an open-source, column-oriented data file format designed for efficient data storage and retrieval using data compression and encoding schemes to … Read more