5 Best Ways to Implement Concurrency with Threads in Python

💡 Problem Formulation: In programming, concurrency is the ability of your application to do many things at the same time, which is vital for tasks that require waiting, such as IO operations, network requests, or heavy computations that can be distributed. In Python, one way to achieve concurrency is by using threads. This article will explore methods to implement concurrency through threading in Python, aiming to optimize performance through parallel task execution.

Method 1: Using the threading Module

Python’s threading module provides a way to create and manage threads easily. By subclassing the Thread class or using the Thread object directly, you can execute functions concurrently. It’s the standard method for threading in Python and is suitable for I/O-bound tasks due to the Global Interpreter Lock (GIL).

Here’s an example:

import threading
import time

def print_numbers():
    for i in range(5):
        time.sleep(1)
        print(i)

thread = threading.Thread(target=print_numbers)
thread.start()
thread.join()

print("Done!")

Output:

0
1
2
3
4
Done!

This code snippet defines a function that prints numbers 0 through 4 with a 1-second pause between each. A thread is created that targets this function, starts it, and waits for the thread to complete with join(). This allows the main thread to run in parallel with our print_numbers function, executing tasks concurrently.

Method 2: ThreadPoolExecutors from concurrent.futures

The concurrent.futures module provides a high-level interface for asynchronously executing callables. The ThreadPoolExecutor is an Executor subclass that uses a pool of threads to execute calls asynchronously. It’s great for executing many IO-bound tasks concurrently.

Here’s an example:

from concurrent.futures import ThreadPoolExecutor

def task(message):
    return f"Task received: {message}"

with ThreadPoolExecutor(max_workers=2) as executor:
    future = executor.submit(task, ("Hello from ThreadPoolExecutor"))
    return_value = future.result()
    print(return_value)

Output:

Task received: Hello from ThreadPoolExecutor

Here, ThreadPoolExecutor creates a pool of threads with 2 workers. The submit() method schedules the callable, task, to be executed and returns a Future object which encapsulates the execution. Calling result() on the Future object waits for the callable to finish and returns its result.

Method 3: Using Queues to Manage Workload

Python’s Queue module provides a thread-safe FIFO implementation that can be used to manage a workload between multiple threads. By separating workload management from the worker threads, the Queue module is an excellent tool for implementing producer-consumer scenarios in a concurrent fashion.

Here’s an example:

from queue import Queue
from threading import Thread

def worker(q):
    while True:
        item = q.get()
        if item is None:
            break
        print(f"Processing {item}")
        q.task_done()

q = Queue()
threads = []
for i in range(3):
    t = Thread(target=worker, args=(q,))
    t.start()
    threads.append(t)

for item in range(10):
    q.put(item)

q.join()

for i in range(3):
    q.put(None)
for t in threads:
    t.join()

Output:

Processing 0
Processing 1
Processing 2
...
Processing 9

This snippet creates a Queue and several worker threads that process items in the queue. When a worker thread retrieves None from the queue, it breaks from its loop, ending the thread. After putting items into the queue, q.join() is used to block until all items are processed. This method efficiently distributes tasks to multiple threads.

Method 4: Using Global Interpreter Lock (GIL) aware features

Understanding the GIL in Python is key for writing efficient multithreaded programs. The GIL allows only one thread to execute Python bytecode at a time, which means CPU-bound operations may not see a performance boost with threads. However, using libraries that release the GIL, such as NumPy or using native extensions, can help.

Here’s an example:

import threading
import numpy as np

def compute():
    # Intensive computation that releases the GIL
    result = np.sum(np.random.random(1000000))

threads = []
for i in range(2):
    thread = threading.Thread(target=compute)
    thread.start()
    threads.append(thread)

for thread in threads:
    thread.join()

print("Computation completed in all threads.")

Output:

Computation completed in all threads.

This example uses NumPy, which is optimized to take advantage of the system’s native libraries and hardware, and releases the GIL when doing so. Multithreaded computations with NumPy are therefore able to run in true parallelism, sidestepping the GIL and potentially increasing performance on multi-core systems.

Bonus One-Liner Method 5: Using generator expressions and threads

If your task involves simple operations that can be represented as a generator expression, you can use threads in a one-liner that both instantiates and starts the threads. This method is succinct but less flexible than the others.

Here’s an example:

from threading import Thread

[Thread(target=lambda: print(i)).start() for i in range(5)]

Output:

This one-liner creates and starts a list of threads in a single expression. Each thread calls the print function with a loop index. Note that this syntax lacks thread management capabilities like the ability to join threads, so use it only for very simple scenarios.

Summary/Discussion

Method 1: Threading module. Easy to start with basic threading tasks. Subject to GIL, thus best for IO-bound tasks, not CPU-bound tasks.
Method 2: ThreadPoolExecutor. Provides advanced features like work queue and future objects but does not help with CPU-bound tasks due to the GIL.
Method 3: Queues for workload management. Useful for managing complex workflows and tasks between producer and consumer threads. Can be overkill for simpler tasks.
Method 4: Using GIL-aware features. Good when using external libraries that can release the GIL. Less straightforward due to needing specific external libraries or extensions.
Bonus Method 5: One-liner threads with generator expressions. Quick and elegant for simple, fire-and-forget tasks. Unsuitable for tasks requiring sophisticated thread management.