Python | Split String by Number

āœØSummary: To split a string by a number, use the regex split method using the “\d” pattern.

Minimal Example

my_string = "#@1abc3$!*5xyz"

# Method 1
import re

res = re.split('\d+', my_string)
print(res)

# Method 2
import re

res = re.findall('\D+', my_string)
print(res)

# Method 3
from itertools import groupby

li = [''.join(g) for _, g in groupby(my_string, str.isdigit)]
res = [x for x in li if x.isdigit() == False]
print(res)

# Method 4
res = []
for i in my_string:
    if i.isdigit() == True:
        my_string = my_string.replace(i, ",")
print(my_string.split(","))

# Outputs:
# ['#@', 'abc', '$!*', 'xyz']

Problem Formulation

šŸ“œProblem: Given a string containing different characters. How will you split the string whenever a number appears?

Method 1: re.split()

TheĀ re.split(pattern, string)Ā method matches all occurrences of theĀ patternĀ in theĀ stringĀ and divides the string along the matches resulting in a list of stringsĀ betweenĀ the matches. For example,Ā re.split('a', 'bbabbbab')Ā results in the list of stringsĀ ['bb', 'bbb', 'b'].

Code:

import re
my_string = "#@1abc3$!*5xyz"
res = re.split('\d+', my_string)
print(res)

# ['#@', 'abc', '$!*', 'xyz']

Explanation: TheĀ \dĀ special characterĀ matches any digit between 0 and 9. By using the maximal number of digits as a delimiter, you split along the digit-word boundary.Ā 

Method 2: re.findall()

TheĀ re.findall(pattern, string)Ā method scansĀ stringĀ fromĀ left to right, searching for allĀ non-overlapping matchesĀ of theĀ pattern. It returns aĀ list of stringsĀ in the matching order when scanning the string from left to right.

Code:

import re
my_string = "#@1abc3$!*5xyz"
res = re.findall('\D+', my_string)
print(res)

# ['#@', 'abc', '$!*', 'xyz']

Explanation: TheĀ \DĀ special characterĀ matches all characters except any digit between 0 and 9. Thus, you are essentially finding all character groups that appear before the occurrence of a digit.

Do you want to master the regex superpower? Check out my new book The Smartest Way to Learn Regular Expressions in Python with the innovative 3-step approach for active learning: (1) study a book chapter, (2) solve a code puzzle, and (3) watch an educational chapter video.

Method 3: itertools.groupby()

Code:

from itertools import groupby
my_string = "#@1abc3$!*5xyz"
li = [''.join(g) for _, g in groupby(my_string, str.isdigit)]
res = [x for x in li if x.isdigit() == False]
print(res)

# ['#@', 'abc', '$!*', 'xyz']

Explanation:

  • TheĀ itertools.groupby(iterable, key=None)Ā function creates an iterator that returns tuplesĀ (key, group-iterator)Ā grouped by each value ofĀ key. We use theĀ str.isdigit()Ā function as key function.
  • TheĀ str.isdigit()Ā function returnsĀ TrueĀ if the string consists only of numeric characters. Thus, you will have a list created by using numbers as separators. Note that this list will also contain the numbers as items within it.
  • In order to eliminate the numbers, use another list comprehension that checks if an element in the list returned previously is a digit or not with the help of the isdigit method. If it is a digit, the item will be discarded. Otherwise it will be stored in the list.

Method 4: Replace Using a for Loop

Approach: Use a for loop to iterate through the characters of the given string. Check if a character is a digit or not. As soon as a digit is found, replace that character/digit with a delimiter string ( we have used a comma here) with the help of the replace() method. This basically means that you are placing a particular character in the string whenever a number appears. Once all the digits are replaced by the separator string, split the string by passing the separator string as a delimiter to the split method.

Code:

my_string = "#@1abc3$!*5xyz"
res = []
for i in my_string:
    if i.isdigit():
        my_string = my_string.replace(i, ",")
print(my_string.split(","))

# ['#@', 'abc', '$!*', 'xyz']

Conclusion

Phew! We have successfully solved the given problem and managed to do so using four different ways. I hope you found this article helpful and it answered your queries. Please subscribe and stay tuned for more solutions and tutorials.

Happy coding! šŸ™‚

šŸŒRelated Read: How to Split a String Between Numbers and Letters?