## 4 Best Ways to Count Duplicates in a DataFrame

Problem Formulation and Solution Overview This article will show you how to count duplicates in a Pandas DataFrame in Python. To make it more fun, we have the following running scenario: Rivers Clothing has a CSV containing all its employees. However, their CSV file has more rows than employees. This is a definite problem! They … Read more

## How to Add a Column to a CSV

Problem Formulation and Solution Overview In this article, you’ll learn how to add a new column to a CSV file in Python. To make it more fun, we have the following running scenario: The owner of the Finxter Academy has asked you to add a new column to their existing CSV file called Total_Chrgs. 💬 … Read more

## How to Suppress Scientific Notation in Python

Summary: Use the string literal syntax f”{number:.nf}” to suppress the scientific notation of a number to its floating-point representation. Problem Formulation: How will you suppress a number represented in scientific notation by default to a floating-point value? Note: Generally, Python represents huge floating-point numbers or very small floating-point numbers in their scientific form. Scientific notation … Read more

## How to Read Specific Columns from CSV File in Python

Problem: Given a CSV file, how to read only specific column(s) from the csv file? (Reading a specific column from a csv file will yield all the row values pertaining to that column.) Example: Consier the following csv file (countries.csv): Question: How will you read the above csv file and display the following columns – … Read more

## Pandas DataFrame to_coo() Method

Preparation Before any data manipulation can occur, four (4) new libraries will require installation. The Pandas library enables access to/from a DataFrame. The NumPy library supports multi-dimensional arrays and matrices in addition to a collection of mathematical functions. The pandas_gbq allows access to Google Big Query (GBQ) The google.auth authentication. To install these libraries, navigate … Read more

## Pandas DataFrame to_gbq() Method

This article focuses on the serialization and conversion methods of a Python DataFrame: to_gbq(), to_coo(). Let’s get started! Preparation Before any data manipulation can occur, four (4) new libraries will require installation. The Pandas library enables access to/from a DataFrame. The NumPy library supports multi-dimensional arrays and matrices in addition to a collection of mathematical … Read more

## Pandas DataFrame to_string() Method

Preparation Before any data manipulation can occur, three (3) new libraries will require installation. The Pandas library enables access to/from a DataFrame. The Pyarrow library allows writing/reading access to/from a parquet file. The Openpyxl library allows styling/writing/reading to/from an Excel file. To install these libraries, navigate to an IDE terminal. At the command prompt (\$), … Read more

## Pandas DataFrame to_feather() Method

Preparation Before any data manipulation can occur, three (3) new libraries will require installation. The Pandas library enables access to/from a DataFrame. The Pyarrow library allows writing/reading access to/from a parquet file. The Openpyxl library allows styling/writing/reading to/from an Excel file. To install these libraries, navigate to an IDE terminal. At the command prompt (\$), … Read more

## Pandas DataFrame to_parquet() Method

Preparation Before any data manipulation can occur, three (3) new libraries will require installation. The Pandas library enables access to/from a DataFrame. The Pyarrow library allows writing/reading access to/from a parquet file. The Openpyxl library allows styling/writing/reading to/from an Excel file. To install these libraries, navigate to an IDE terminal. At the command prompt (\$), … Read more

## Pandas DataFrame to_xml() Method

Preparation Before any data manipulation can occur, four (4) new libraries will require installation. The Pandas library enables access to/from a DataFrame. The Tabulate library enables formatted output. The Tables library allows formatted output (table format). The lxml library enables writing to an XML file. To install these libraries, navigate to an IDE terminal. At … Read more

## Pandas DataFrame to_latex() Method

Preparation Before any data manipulation can occur, four (4) new libraries will require installation. The Pandas library enables access to/from a DataFrame. The Tabulate library enables formatted output. The Tables library allows formatted output (table format). The lxml library enables writing to an XML file. To install these libraries, navigate to an IDE terminal. At … Read more

## Pandas DataFrame to_hdf() Method

Preparation Before any data manipulation can occur, four (4) new libraries will require installation. The Pandas library enables access to/from a DataFrame. The Tabulate library enables formatted output. The Tables library allows formatted output (table format). The lxml library enables writing to an XML file. To install these libraries, navigate to an IDE terminal. At … Read more