You want to understand Numpy’s arange function once and for all?

Start with level 1 and pass all 6 levels!

[Reading time: 3-7 minutes]

(Or watch the video.)

Level 1: numpy.arange

The numpy arange function (no it’s not numpy arrange) creates a numpy array with evenly spaced numbers within a fixed interval.

In this article, I lead you step-by-step through all the different use cases of the numpy arange function. The article ends with a short FAQ – Feel free to add your own questions as comments!

Level 2: numpy.arange(stop)

Let’s start with an example of the basic usage of numpy arange where you only specify the stop argument.

Say, you want to get seven subsequent numbers for seven weekdays (0: Monday, 1: Tuesday, 2: Wednesday, 3: Thursday, 4: Friday, 5: Saturday, 6: Sunday). Your numpy array starts with the index 0 and ends in the index 6.

Similar to many other sequence operations, numpy arange implicitly starts with the index 0. If you only specify one single argument for numpy arange, you implicitly set the start argument to 0.

Here is an example of the numpy arange function with a single argument:

np.arange(7)
# np.array([0, 1, 2, 3, 4, 5, 6])

The result is a numpy array with ten elements starting from the implicitly chosen index 0 (inclusive) and ending in the explicitly chosen index 7 (exclusive).

But what if we want to define a start index? Say, you want to take off Mondays, i.e., skip the start index 0, because you hate being anxious on Sunday evenings (get yourself a new job, man!). Ok, it’s a lame example but you know what I mean?

Level 3: numpy.arange(start, stop)

Nothing simpler than that. If you add a second argument, the Python interpreter interprets (*lol*) the first one as the start index and the second one as the end index.

Here is an example of the numpy arange function with two arguments:

np.arange(1, 7)
# np.array([1, 2, 3, 4, 5, 6])

Ok, we mastered the two argument version as well. Let’s move on to the next level.

Level 4: np.arange(start, stop, step)

As a next step, you decide that you want to start your week on Tuesdays, as previously. But as work is so exhausting, you take every other day off as well (luckily, you do have a solid salary as a programmer, don’t you?). It’s getting interesting: add a third argument–that is the step size 2. Here is an example:

np.arange(1, 7, 2)
# np.array([1, 3, 5])

Well done, you only work on Tuesdays, Thursdays, and Saturdays now! All programmers are lazy, you are lazy, so you are a programmer (find the bug).

Level 5: np.arange(start, stop, step, dtype)

As you put your working days [1, 3, 5] into your company’s tracking system, it complains about the wrong formatting of your input numpy array. It expects all numpy array values to be formatted as a float rather than an integer value. “What a design flaw for weekday data!”.

But it’s a simple fix using the fourth argument of the numpy arange function: the data type of the output numpy array, i.e., dtype. As dtype argument, you can use two kinds of data types. First, use traditional language-specific data types such as float and integer. Second, useĀ  numpy-specific data types such as np.int16 or np.float32.

For instance, the numpy-specific data types np.int16 or np.float32 allow for an integer value with 16 bits (=2 bytes) or a float value with 32 bits (=4 bytes). Keep in mind that more bits lead to higher overhead but more powerful ranges of numbers (or precision in the case of float data types).

Here is a collection of dtypes you can use (check out this excellent post if you need more information about numpy dtypes):

  • bool: The default boolean data type in Python (1 Byte).
  • int: The default Python integer data type in Python (4 or 8 Bytes).
  • float: The default float data type in Python (8 Bytes).
  • complex: The default complex data type in Python (16 Bytes).
  • np.int8: Integer (1 Byte).
  • np.int16: Integer (2 Bytes).
  • np.int32: Integer (4 Bytes).
  • np.int64: Integer (8 Bytes).
  • np.float16: Float (2 Bytes).
  • np.float32: Float (4 Bytes).
  • np.float64: Float (8 Bytes).

Here is an example for the 4-argument version of np.arange():

np.arange(1, 7, 2, dtype=np.float32)
# np.array([1., 3., 5.])

Examples

Now, you have understood all the four different uses of the numpy arange function. To wrap things up, make sure to skim over this more comprehensive list of examples:

print(np.arange(10))
# np.array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])

print(np.arange(2, 10, 3))
# np.array([2, 5, 8])

print(np.arange(1.5, 10., 1.5))
# np.array([1.5, 3.,  4.5, 6.,  7.5, 9. ])

print(np.arange(10, 1, -2))
# np.array([10,  8,  6,  4,  2])

print(np.arange(9.7, 9.2, -0.1))
# np.array([9.7, 9.6, 9.5, 9.4, 9.3])

print(np.arange(1, 1, 1))
# np.array([])

print(np.arange(1, 5, dtype=np.int16))
# np.array([1, 2, 3, 4])

print(np.arange(1, 5, dtype=np.float32))
# np.array([1., 2., 3., 4.])

Congratulations! You have just mastered one basic piece of your Python numpy education. Now test your understanding of numpy arange by solving this Python puzzle:

Level 6: Solve this Numpy Arange Puzzle

np.arange(7)
# np.array([0, 1, 2, 3, 4, 5, 6])

What is the output of this puzzle?

Hints: Read over this short summary of what you have already learned.

Numpy is a popular Python library for data science focusing on linear algebra.

This puzzle is about the numpy arange function.

You might know the Python built-in range(x,y,z) function that creates a sequence of linear progressing values. The sequence starts from x, increases the values linearly by y, and ends if the value becomes larger than z.

The arange(x,y,z) function is similar but creates a numpy array and works with float numbers as well.

Note that a common mistake in this puzzle is to not account for the first value of the array: 0.


Are you a master coder?
Test your skills now!

And the solution is…

9

Frequently Asked Questions

When should I use np.arange() and when np.linspace()?

Use np.arange() if you want to create integer sequences with evenly distributed integer values within a fixed interval.

Use np.linspace() if you have a non-integer step size. The numpy linspace function handles the endpoints better. This prevents you from introducing unnecessary bugs into your code. One such bug is for example that you assume an endpoint is not in the numpy array while it actually is because of the floating point arithmetic.

What are possible use cases of np.arange()?

Numpy is mostly about multi-dimensional matrices. It is common to create a 1D numpy array with the numpy arange function and to transform it immediately into a 2D numpy array using the np.reshape() function. An example is the following where we create a 2D numpy array with three rows and two columns from a 1D numpy array.

np.arange(6).reshape((3, 2))
# array([[0, 1],
#       [2, 3],
#       [4, 5]])

If you have any question, please comment below.

Little time to learn Python? Checkout my book “Coffee Break Python” to solve Python puzzles during your coffee break (click the cover for more information):

Coffee Break Python