How to Install pyarrow in Python?

pip install pyarrow

The Python pyarrow library is among the top 100 Python libraries, with more than 30,549,707 downloads. This article will show you everything you need to get this installed in your Python environment.

Alternatively, you may use any of the following commands to install pyarrow, depending on your concrete environment. One is likely to work!

πŸ’‘ If you have only one version of Python installed:
pip install pyarrow

πŸ’‘ If you have Python 3 (and, possibly, other versions) installed:
pip3 install pyarrow

πŸ’‘ If you don't have PIP or it doesn't work
python -m pip install pyarrow
python3 -m pip install pyarrow

πŸ’‘ If you have Linux and you need to fix permissions (any one):
sudo pip3 install pyarrow
pip3 install pyarrow --user

πŸ’‘ If you have Linux with apt
sudo apt install pyarrow

πŸ’‘ If you have Windows and you have set up the py alias
py -m pip install pyarrow

πŸ’‘ If you have Anaconda
conda install -c anaconda pyarrow

πŸ’‘ If you have Jupyter Notebook
!pip install pyarrow
!pip3 install pyarrow

How to Install pyarrow on Windows?

  1. Type "cmd" in the search bar and hit Enter to open the command line.
  2. Type “pip install pyarrow” (without quotes) in the command line and hit Enter again. This installs pyarrow for your default Python installation.
  3. The previous command may not work if you have both Python versions 2 and 3 on your computer. In this case, try "pip3 install pyarrow" or “python -m pip install pyarrow“.
  4. Wait for the installation to terminate successfully. It is now installed on your Windows machine.

Here’s how to open the command line on a (German) Windows machine:

Open CMD in Windows

First, try the following command to install pyarrow on your system:

pip install pyarrow

Second, if this leads to an error message, try this command to install pyarrow on your system:

pip3 install pyarrow

Third, if both do not work, use the following long-form command:

python -m pip install pyarrow

The difference between pip and pip3 is that pip3 is an updated version of pip for Python version 3. Depending on what’s first in the PATH variable, pip will refer to your Python 2 or Python 3 installation—and you cannot know which without checking the environment variables. To resolve this uncertainty, you can use pip3, which will always refer to your default Python 3 installation.

How to Install pyarrow on Linux?

You can install pyarrow on Linux in four steps:

  1. Open your Linux terminal or shell
  2. Type “pip install pyarrow” (without quotes), hit Enter.
  3. If it doesn’t work, try "pip3 install pyarrow" or “python -m pip install pyarrow“.
  4. Wait for the installation to terminate successfully.

The package is now installed on your Linux operating system.

How to Install pyarrow on macOS?

Similarly, you can install pyarrow on macOS in four steps:

  1. Open your macOS terminal.
  2. Type “pip install pyarrow” without quotes and hit Enter.
  3. If it doesn’t work, try "pip3 install pyarrow" or “python -m pip install pyarrow“.
  4. Wait for the installation to terminate successfully.

The package is now installed on your macOS.

How to Install pyarrow in PyCharm?

Given a PyCharm project. How to install the pyarrow library in your project within a virtual environment or globally? Here’s a solution that always works:

  • Open File > Settings > Project from the PyCharm menu.
  • Select your current project.
  • Click the Python Interpreter tab within your project tab.
  • Click the small + symbol to add a new library to the project.
  • Now type in the library to be installed, in your example "pyarrow" without quotes, and click Install Package.
  • Wait for the installation to terminate and close all pop-ups.

Here’s the general package installation process as a short animated videoβ€”it works analogously for pyarrow if you type in “pyarrow” in the search field instead:

Make sure to select only “pyarrow” because there may be other packages that are not required but also contain the same term (false positives):

How to Install pyarrow in a Jupyter Notebook?

To install any package in a Jupyter notebook, you can prefix the !pip install my_package statement with the exclamation mark "!". This works for the pyarrow library too:

!pip install my_package

This automatically installs the pyarrow library when the cell is first executed.

How to Resolve ModuleNotFoundError: No module named ‘pyarrow’?

Say you try to import the pyarrow package into your Python script without installing it first:

import pyarrow
# ... ModuleNotFoundError: No module named 'pyarrow'

Because you haven’t installed the package, Python raises a ModuleNotFoundError: No module named 'pyarrow'.

To fix the error, install the pyarrow library using “pip install pyarrow” or “pip3 install pyarrow” in your operating system’s shell or terminal first.

See above for the different ways to install pyarrow in your environment.

Improve Your Python Skills

If you want to keep improving your Python skills and learn about new and exciting technologies such as Blockchain development, machine learning, and data science, check out the Finxter free email academy with cheat sheets, regular tutorials, and programming puzzles.

Join us, it’s fun! πŸ™‚