HOME
ABOUT
- RESULTS
- differences
- BENEFITS
- HISTORY
- TEAM
- LOCATION
- FACILITIES
- BANKING
- MEMBERSHIPS
- APPROVALS
- LICENCES
- SUPPLIERS
- SPONSORSHIPS
- MEDIA
- PRIVACY
AUCTIONS
SHIPPING
FEES
- TS REWARDS
TOOLS
guides
FAQ
CONTACT
- CONNECT

VEHICLES
BRAND
- JAPANESE CARS
  - DAIHATSU
  - EUNOS
  - FORD
  - HONDA
  - ISUZU
  - LEXUS
  - MAZDA
  - MITSUBISHI
  - MITSUOKA
  - NISSAN
  - SUBARU
  - SUZUKI
  - TOYOTA
- GERMAN CARS
- AMERICAN CARS
- BRITISH CARS
- ITALIAN CARS
- FRENCH CARS
- SWEDISH CARS
- KOREAN CARS
TYPE
- mobility
- VENDING
- instruction
- TAXIS
- AMBULANCES
- FIRE ENGINES
- HEARSES
- LIMOUSINES
- COMMERCIAL
CLASS
FUEL
TRUCKS
minitrucks
- DAIHATSU
- HONDA
- MAZDA
- MITSUBISHI
- NISSAN
- SUBARU
- SUZUKI
- DUMP
- CRANE
- CAMPER
- REFRIGERATED
- 4WD
- NEW
BUSES
MOTORHOMES
- YAHOO!
- RAKUTEN
- DEALER

PARTS
- FREE REPORT
- PARTS CONTAINERS
- PARTS SYSTEMS
- PARTS PROTECTION
- BODY SHELLS
- DISMANTLING
- ONLINE PARTS
- NEW PARTS
- INTERIOR PARTS
- EXTERIOR PARTS
  - BONNETS
  - BUMPERS
  - GRILLES
  - FENDERS
  - DOORS
  - TRUNKS
  - SPOILERS
  - LIGHTS
  - EMBLEMS
  - CAMERAS
- ENGINES
- TRANSMISSIONS
- WHEELS & TYRES
  - WHEELS
  - TYRES
CUTS
PERFORMANCE PARTS
TRUCK PARTS
MOTORBIKE PARTS
- MOTORBIKE ENGINES
- MOTORBIKE ACCESSORIES

MOTORBIKES
MARINE
FORKLIFTS
MACHINERY
AGRICULTURAL
OTHER
COUNTRY
- AUSTRALIA
- CANADA
- KENYA
- MYANMAR
- NEW ZEALAND
- PAKISTAN
- TANZANIA
- UNITED STATES

CARVIEW

MOTORHOMES

Select Language

HTTP/2 301 content-type: text/html; charset=UTF-8 content-length: 20 location: https://www.geeksforgeeks.org/python/python-extract-words-from-given-string/ date: Tue, 29 Jul 2025 05:58:33 GMT server: Apache strict-transport-security: max-age=31536000; includeSubDomains cache-control: s-maxage=95597, max-age=3, stale-while-revalidate=0, stale-if-error=0 content-encoding: gzip access-control-allow-credentials: true x-frame-options: DENY x-content-type-options: nosniff vary: Accept-Encoding,Cookie x-cache: Hit from cloudfront via: 1.1 ae1ff6a29c1ba6ff820d29b2fdaca142.cloudfront.net (CloudFront) x-amz-cf-pop: MAA51-P2 x-amz-cf-id: e7kVeK7TLwPE7C2uzL76TQ1sNqlebRvNHUMOlE3PdvQEuskGtDvGBw== age: 6867 HTTP/2 200 content-type: text/html; charset="UTF-8" date: Mon, 28 Jul 2025 04:30:59 GMT server: Apache strict-transport-security: max-age=31536000; includeSubDomains cache-control: s-maxage=109759, max-age=3, stale-while-revalidate=0, stale-if-error=0 set-cookie: 260972=1789%2C2601%2C3287; expires=Mon, 28-Jul-2025 04:32:39 GMT; Max-Age=100; path=/ content-encoding: gzip access-control-allow-credentials: true x-frame-options: DENY x-content-type-options: nosniff vary: Accept-Encoding,Cookie x-cache: Hit from cloudfront via: 1.1 ae1ff6a29c1ba6ff820d29b2fdaca142.cloudfront.net (CloudFront) x-amz-cf-pop: MAA51-P2 x-amz-cf-id: R2WVadteFtj9BB-2O_fINljenivaZriyIj0prY7__pyEivg-Fh-n3A== age: 98520 Python | Extract words from given string - GeeksforGeeks

Courses
Tutorials
Practice
Jobs

Notifications

Mark all as read

All

View All

Notifications

Mark all as read

All

Unread

Read

You're all caught up!!

Python Course
Python Tutorial
Interview Questions
Python Quiz
Python Glossary
Python Projects
Practice Python
Data Science With Python
Python Web Dev
DSA with Python
Python OOPs

Open In App

Explore GfG Courses

Share Your Experiences

Iterate over words of a String in Python Iterate over words of a String in Python Iterate over words of a String in Python Python - Get Nth word in given String Python - Word starting at Index Python - Extract K length substrings

DSA to Development Course

Python | Extract words from given string

Last Updated : 11 Jul, 2025

Comments

Improve

Suggest changes

Like Article

Report

Extracting words from a given string refers to identifying and separating individual words from a block of text or sentence. This is a common task in text processing, searching, filtering or analyzing content.

Example: Here, each word is extracted from a given string.

Input: GeeksForGeeks is the best Computer Science Portal
Output: ['GeeksForGeeks', 'is', 'the', 'best', 'Computer', 'Science', 'Portal']

Python provides different methods to extract words from a string. Let’s explore them one by one.

Using Split()

split() method splits the string at spaces (or a specified delimiter) and returns a list of individual words. However, it does not remove punctuation marks, so it may stay attached to the words.

Example:

In this Example, split() method is used to extract individual words from a given string.

Python

Str = "Python is a powerful and versatile programming language"
print(Str.split())

Output

['Python', 'is', 'a', 'powerful', 'and', 'versatile', 'programming', 'language']

Explanation: split() is used to extract each word from Str and it seperates the words based on spaces and return them as a list.

Using Regex

Regular expressions allow extracting words from text with punctuation or special characters. Python’s re.findall() helps filter out only the valid words.

Example:

This program uses re.findall() method from Python’s re module to extract words from a string.

Python

import re
Str = "Python,    is widely-used @# for Data Science and AI.!!!"
T = re.findall(r'\w+', Str)
print(T)

Output

['Python', 'is', 'widely', 'used', 'for', 'Data', 'Science', 'and', 'AI']

Explanation: re.findall(r'\w+', Str) extracts all sequences of letters, digits and underscores effectively skipping punctuation and special characters.

Using List Comprehension

List comprehension allows filtering out punctuation by combining with functions like strip() and isalnum() helping to collect clean and valid words in a compact way.

Example:

Here, list comprehension is used along with string.punctuation and isalnum() methods to extract clean words from a string.

Python

import string
Str = "Python,    is simple @# yet powerful Programming Language.!!!"
T = [w.strip(string.punctuation) for w in Str.split() if w.strip(string.punctuation).isalnum()]
print(T)

Output

['Python', 'is', 'simple', 'yet', 'powerful', 'Programming', 'Language']

Explanation:

Str.split() splits the string into words.
w.strip(string.punctuation) removes punctuation from each word.
isalnum() ensures only alphanumeric words are kept.

Using Regex() + String.Punctuation

Regular expressions combined with Python’s string.punctuation to remove all punctuation marks from a string before extracting words. It's useful when text contains various special characters ensuring cleaner word extraction.

Example:

This code extracts words from a string by removing all punctuation using regular expressions.

Python

import re
import string
Str = "Python,    is simple @# yet powerful Programming Language.!!!"
a = "[" + re.escape(string.punctuation) + "]"
T = re.sub(a, "", Str).split()
print(T)

Output

['Python', 'is', 'simple', 'yet', 'powerful', 'Programming', 'Language']

Explanation:

re.escape(string.punctuation) safely escapes all punctuation characters and a = "[" + ... + "]" creates a regex pattern to match them.
re.sub(a, "", Str) removes all punctuation from the string.
.split() splits cleaned string into individual words.

Using NLP Libraries

Natural Language Processing (NLP) libraries like NLTK provide powerful tools for text analysis. When extracting words from a string they offer more accuracy by properly handling punctuation, contractions and tokenization making them ideal for complex or real-world text data.

Example:

This program demonstrates how to extract words from a string using NLTK's word_tokenize() function.

Python

import nltk
string = "Python is easy-to-learn, powerful and widely used in tech!"
words = nltk.word_tokenize(string)
print(words)

Output

['Python', 'is', 'easy-to-learn', ',', 'powerful', 'and', 'widely', 'used', 'in', 'tech', '!']

Explanation:

nltk.word_tokenize() splits the string into tokens (words and punctuation).
It preserves punctuation as separate tokens: ',' and '!'
It treats hyphen words like "easy-to-learn" as one token.

Related Articles:
split() method
RegEx in Python
List Comprehension
String punctuation
NLP

Iterate over words of a String in Python

manjeet_04

Improve

Article Tags :

Python
Python Programs
Python string-programs

Practice Tags :

python

491k+ interested Geeks

Complete Machine Learning & Data Science Program

265k+ interested Geeks

Master Competitive Programming - Complete Beginner to Advanced

3k+ interested Geeks

GATE CSE 2027 [Semester & Placement Preparation]

Corporate & Communications Address:

A-143, 7th Floor, Sovereign Corporate Tower, Sector- 136, Noida, Uttar Pradesh (201305)

Registered Address:

K 061, Tower K, Gulshan Vivante Apartment, Sector 137, Noida, Gautam Buddh Nagar, Uttar Pradesh, 201305

Advertise with us

Company
About Us
Legal
Privacy Policy
Careers
In Media
Contact Us
Corporate Solution
Campus Training Program

Explore
Job-A-Thon
Offline Classroom Program
DSA in JAVA/C++
Master System Design
Master CP
Videos

Tutorials
Python
Java
C++
PHP
GoLang
SQL
R Language
Android

DSA
DSA Tutorial
Problem Of The Day
GfG 160
DSA 360
DSA Roadmap
DSA Interview Questions
Competitive Programming

Data Science & ML
Data Science With Python
Machine Learning
ML Maths
Data Visualisation
Pandas
NumPy
NLP
Deep Learning

Web Technologies
HTML
CSS
JavaScript
TypeScript
ReactJS
NextJS
NodeJs
Bootstrap
Tailwind CSS

Python Tutorial
Python Examples
Django Tutorial
Python Projects
Python Tkinter
Web Scraping
OpenCV Tutorial
Python Interview Question

Computer Science
GATE CS Notes
Operating Systems
Computer Network
Database Management System
Software Engineering
Digital Logic Design
Engineering Maths

DevOps
Git
AWS
Docker
Kubernetes
Azure
GCP
DevOps Roadmap

System Design
High Level Design
Low Level Design
UML Diagrams
Interview Guide
Design Patterns
OOAD
System Design Bootcamp
Interview Questions

School Subjects
Mathematics
Physics
Chemistry
Biology
Social Science
English Grammar

Databases
SQL
MYSQL
PostgreSQL
PL/SQL
MongoDB

Preparation Corner
Company-Wise Recruitment Process
Aptitude Preparation
Puzzles
Company-Wise Preparation

More Tutorials
Software Development
Software Testing
Product Management
Project Management
Linux
Excel
All Cheat Sheets

Courses
IBM Certification Courses
DSA and Placements
Web Development
Data Science
Programming Languages
DevOps & Cloud

Programming Languages
C Programming with Data Structures
C++ Programming Course
Java Programming Course
Python Full Course

Clouds/Devops
DevOps Engineering
AWS Solutions Architect Certification
Salesforce Certified Administrator Course

GATE 2026
GATE CS Rank Booster
GATE DA Rank Booster
GATE CS & IT Course - 2026
GATE DA Course 2026
GATE Rank Predictor

We use cookies to ensure you have the best browsing experience on our website. By using our site, you acknowledge that you have read and understood our Cookie Policy & Privacy Policy