## Matplotlib Color Palette

In this article, we’ll learn how to generate the Matplotlib color palette and then we will use it to select a specific color for our plot. Problem and Solution Overview Problem: When presenting data, the color that you assign to a plot is very important; a bad color choice can make your data difficult to … Read more

## How to Calculate z-scores in Python?

The z-scores can be used to compare data with different measurements and for normalization of data for machine learning algorithms and comparisons. π‘ Note: There are different methods to calculate the z-score. The quickest and easiest one is: scipy.stats.zscore(). What is the z-score? The z-score is used for normalization or standardization to make differently scaled … Read more

## Quant Developer — Income and Opportunity

Before we learn about the money, let’s get this question out of the way: What is a Quant Developer? A quantitative developer (i.e., Quant) is a financial programmer focused on financial modeling and quantitative finance and trading. Quants use their profound knowledge of statistics and math, finance, data structures, algorithms, machine learning, scientific computing, data … Read more

## np.zeros() — A Simple Illustrated Guide

In Python, the numpy.zeros() function returns a new array of given shape and type, filled with zeros.Β  Here is the parameter table of numpy.zeros(). If it sounds great to you, please continue reading, and you will fully understand the numpy.zeros() function through Python code snippets and vivid visualization. Concretely, I will introduce its syntax and … Read more

## How to Scrape Google Search Results?

Problem Formulation π¬ Given a text query/keyword such as “History of Chess”. How to scrape the top Google results for that search query (=keyword) in Python? Disclaimer: Have a look at the important question: Is Web Scraping Legal? Method Summary You can get the top Google search results given a certain keyword string by installing … Read more

## Tableau Developer — Income and Opportunity

Before we learn about the money, let’s get this question out of the way: What is Tableau? Let’s have a look at the definition inspired by the official Tableau website: Tableau is a visual data analytics platform focused on the business analytics use case that helps you use data to solve problems. It is great … Read more

## The Ultimate Guide to Data Cleaning in Python and Pandas

What is Data Cleaning? Data cleaning describes the process of turning messy data into clean datasets that can be used for research and data science purposes. For example, tidy data will be in a wide format: every column contains a variable, and every row contains one case. Also, data cleaning means getting rid of corrupt … Read more

## 4 Best Ways to Count Duplicates in a DataFrame

Problem Formulation and Solution Overview This article will show you how to count duplicates in a Pandas DataFrame in Python. To make it more fun, we have the following running scenario: Rivers Clothing has a CSV containing all its employees. However, their CSV file has more rows than employees. This is a definite problem! They … Read more

## Chatbot Developer — Income and Opportunity

Before we learn about the money, let’s get this question out of the way: What Is a Chatbot Developer? A chatbot developer creates software to automate communication with customers and users. An example chatbot application is in customer service for an eCommerce website. Chatbot developers use machine learning and artificial intelligence techniques to communicate with … Read more

## Pandas DataFrame to_coo() Method

Preparation Before any data manipulation can occur, four (4) new libraries will require installation. The Pandas library enables access to/from a DataFrame. The NumPy library supports multi-dimensional arrays and matrices in addition to a collection of mathematical functions. The pandas_gbq allows access to Google Big Query (GBQ) The google.auth authentication. To install these libraries, navigate … Read more

## Pandas DataFrame to_gbq() Method

This article focuses on the serialization and conversion methods of a Python DataFrame: to_gbq(), to_coo(). Let’s get started! Preparation Before any data manipulation can occur, four (4) new libraries will require installation. The Pandas library enables access to/from a DataFrame. The NumPy library supports multi-dimensional arrays and matrices in addition to a collection of mathematical … Read more

## Pandas DataFrame to_string() Method

Preparation Before any data manipulation can occur, three (3) new libraries will require installation. The Pandas library enables access to/from a DataFrame. The Pyarrow library allows writing/reading access to/from a parquet file. The Openpyxl library allows styling/writing/reading to/from an Excel file. To install these libraries, navigate to an IDE terminal. At the command prompt (\$), … Read more