Sorry, this page is no longer available

We may earn an affiliate commission when you visit our partners.

Course image

Web Scraping with Python

Course image

Alfredo Deza

In this 2-hour long project-based course, you will learn how to analyze complex HTML structures and identify the relevant data to be extracted using Scrapy and XPath. You will apply the concepts of web scraping, including setting up a Scrapy project, generating spiders, and using XPath queries to extract data from websites that do not provide an API. Additionally, you will evaluate the effectiveness and efficiency of your scraping code, considering factors such as changing webpage structures, scalability, and coding defensively to ensure robustness. The course includes hands-on labs where you will create a spider and parse complex HTML, allowing you to practice and reinforce the concepts learned.

Or subscribe to Coursera Plus

And get unlimited access to Coursera

Here's a deal for you

Save money when you learn with a deal that may be relevant to this course.

All coupon codes, vouchers, and discounts are applied automatically unless otherwise noted.

Valid until August 30

Google AI App Builder

Learn how to use Gemini API and API Studio with a three-course series from Google DeepMind

What's inside

Syllabus

Traffic lights

Read about what's good

what should give you pause

and possible dealbreakers

Taught by Alfredo Deza, who is recognized for their work in web scraping

Explores web scraping, which is standard in data analytics

Develops skills in creating spiders and using XPath queries, which are core for web scraping

Covers evaluating the effectiveness and efficiency of web scraping code, which is essential for data reliability

Offers hands-on labs, which reinforce learning and provide practical experience

Save this course

Create your own learning path. Save this course to your list so you can find it easily later.

Save

Reviews summary

Practical python web scraping with scrapy

According to learners, this course offers a practical and hands-on introduction to web scraping using Python, Scrapy, and XPath. Many find the project-based approach highly engaging and effective for quickly applying concepts to real-world tasks. It's lauded for providing a solid foundation in data extraction from websites without APIs. While offering clear and concise explanations, some students caution that the course assumes a baseline knowledge of Python and can be fast-paced for absolute beginners. Concerns about environment setup challenges and the limited coverage of advanced topics like dynamic content persist for a segment of learners, but overall it's seen as a highly valuable and efficient learning experience for those with some programming background.

Provides a good introduction, but doesn't deep dive into advanced techniques.

"It's a short course, so don't expect deep dives into advanced topics like handling dynamic content or anti-scraping techniques, but it's a solid foundation."

"This course is a gem for getting into web scraping quickly. It taught me exactly what I needed to know about Scrapy and XPath to get started."

"I gained a solid foundation from completing this course, allowing me to understand the core concepts of web scraping efficiently."

Instructor provides clear and concise explanations of complex concepts.

"The instructor's explanations were clear and concise."

"Very practical and to the point. The Scrapy concepts are introduced perfectly and easy to grasp."

"The instructor was engaging and clearly articulated the necessary steps to get a web scraping project off the ground."

Focuses on real-world application with effective hands-on labs.

"Fantastic course! The hands-on labs using Scrapy and XPath were incredibly practical. I already had some Python experience, and this course allowed me to quickly apply what I learned to my work."

"Excellent! Very practical and to the point. The project was realistic and I learned a lot. It's concise and respects your time."

"The hands-on coding and projects are the strongest part of the course for me, reinforcing concepts effectively."

Some learners experienced frustration with setting up the development environment.

"I struggled a bit with setting up the environment. I wish there were more comprehensive examples or a dedicated section on error handling."

"The course content is good, but the environment setup was quite frustrating. I spent more time debugging my setup than actually learning to scrape."

"Scrapy setup was a hassle, and the course didn't really help with common issues I encountered during installation."

Limited coverage of dynamic content and JavaScript rendering.

"I think it could benefit from an update to cover more modern web technologies like JavaScript rendering, as many sites now use these."

"I was hoping for more information on how to handle dynamic content or anti-scraping measures, but these topics weren't covered in depth."

"The content felt a bit outdated for tackling modern websites that rely heavily on client-side rendering."

Best for those with some Python background, challenging for absolute beginners.

"For someone with zero prior programming experience, this might be a bit fast-paced. I struggled a bit with setting up the environment. The instructor assumes a baseline knowledge of Python."

"It's a useful course, especially for beginners with some Python background. I found the HTML parsing with XPath a bit challenging initially, but the examples helped."

"I felt the instructor assumed too much prior knowledge in Python; it was hard to follow at times."

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Web Scraping with Python with these activities:

Review HTML Basics

Show steps

Review basics of HTML markup for web development to refresh

Browse courses on Web Technologies

Show steps

Review HTML elements and tags
Practice writing simple HTML code

Learn XPath Syntax

Show steps

Follow tutorials on XPath syntax for extracting data from HTML

Browse courses on Xpath

Show steps

Read documentation or watch videos on XPath
Practice writing XPath expressions
Use XPath tools to validate expressions

Connect with Web Scraping Experts

Show steps

Seek guidance and advice from experts in web scraping

Browse courses on Mentoring

Show steps

Attend industry events and meetups
Join online forums and communities
Reach out to professionals on LinkedIn

Three other activities

Expand to see all activities and additional details

Show all six activities

Attend a Web Scraping Workshop

Show steps

Participate in a workshop to gain hands-on experience with web scraping

Show steps

Research and identify relevant workshops
Register and attend the workshop
Engage with instructors and participants
Practice and apply learned techniques

Contribute to Open Source Web Scraping Projects

Show steps

Contribute to open-source projects related to web scraping

Browse courses on Open Source Projects

Show steps

Find open-source web scraping projects
Identify areas to contribute
Submit code contributions
Participate in discussions and issue tracking

Share Your Web Scraping Knowledge

Show steps

Write a blog post or article sharing tips and tricks on web scraping

Browse courses on Blogging

Show steps

Choose a topic related to web scraping
Conduct research and gather information
Write and edit your content
Publish your article or blog post

Career center

Learners who complete Web Scraping with Python will develop knowledge and skills that may be useful to these careers:

Web Developer

Web Developers are responsible for designing and developing websites. They use their knowledge of HTML, CSS, and JavaScript to create user interfaces, implement features, and ensure the website functions as intended. This course can help Web Developers build a foundation in web scraping, which is an essential skill for extracting data from websites. By learning how to use Scrapy and XPath, Web Developers can improve their ability to gather data, analyze it, and use it to make informed decisions.

See salaries and explore the career path for Web Developer

Data Analyst

Data Analysts collect, clean, and analyze data to identify trends and patterns. They use their findings to make recommendations and inform decision-making. This course can help Data Analysts develop skills in web scraping, which is a valuable tool for gathering data from the web. By learning how to use Scrapy and XPath, Data Analysts can expand their data sources and improve the accuracy and efficiency of their analyses.

See salaries and explore the career path for Data Analyst

Business Analyst

Business Analysts work with businesses to identify and solve problems. They use their analytical skills to assess business needs, develop solutions, and improve processes. This course can help Business Analysts develop skills in web scraping, which can be used to gather data from websites and analyze it to identify trends and patterns. By learning how to use Scrapy and XPath, Business Analysts can improve their ability to understand business needs and develop effective solutions.

See salaries and explore the career path for Business Analyst

Software Engineer

Software Engineers design, develop, and maintain software systems. They use their knowledge of programming languages and software engineering principles to create software that meets the needs of users. This course can help Software Engineers develop skills in web scraping, which can be used to gather data from websites and analyze it to identify bugs and improve performance. By learning how to use Scrapy and XPath, Software Engineers can improve their ability to develop robust and efficient software systems.

See salaries and explore the career path for Software Engineer

Data Scientist

Data Scientists use their knowledge of statistics, machine learning, and data analysis to extract insights from data. They use these insights to make predictions, develop models, and solve business problems. This course can help Data Scientists develop skills in web scraping, which can be used to gather data from websites and analyze it to identify trends and patterns. By learning how to use Scrapy and XPath, Data Scientists can expand their data sources and improve the accuracy and efficiency of their analyses.

See salaries and explore the career path for Data Scientist

Information Security Analyst

Information Security Analysts protect computer systems and networks from unauthorized access, use, disclosure, disruption, modification, or destruction. They use their knowledge of security principles and practices to identify and mitigate threats. This course can help Information Security Analysts develop skills in web scraping, which can be used to gather data from websites and analyze it to identify vulnerabilities and threats. By learning how to use Scrapy and XPath, Information Security Analysts can improve their ability to protect computer systems and networks from cyberattacks.

See salaries and explore the career path for Information Security Analyst

UX Designer

UX Designers design and evaluate user interfaces to ensure that they are easy to use, efficient, and enjoyable. They use their knowledge of human factors, design principles, and usability testing to create user interfaces that meet the needs of users. This course can help UX Designers develop skills in web scraping, which can be used to gather data from websites and analyze it to identify user behavior and preferences. By learning how to use Scrapy and XPath, UX Designers can improve their ability to design user interfaces that are both effective and engaging.

See salaries and explore the career path for UX Designer

Product Manager

Product Managers are responsible for planning, developing, and launching products. They work with engineering, design, and marketing teams to ensure that products meet the needs of users and are successful in the market. This course can help Product Managers develop skills in web scraping, which can be used to gather data from websites and analyze it to identify market trends and customer feedback. By learning how to use Scrapy and XPath, Product Managers can improve their ability to develop and launch successful products.

See salaries and explore the career path for Product Manager

Information Architect

Information Architects design and organize information systems to make them easy to find and use. They use their knowledge of information science, user experience, and design principles to create information systems that meet the needs of users. This course can help Information Architects develop skills in web scraping, which can be used to gather data from websites and analyze it to identify user behavior and preferences. By learning how to use Scrapy and XPath, Information Architects can improve their ability to design and organize information systems that are both effective and efficient.

See salaries and explore the career path for Information Architect

Webmaster

Webmasters are responsible for maintaining and updating websites. They use their knowledge of HTML, CSS, and JavaScript to ensure that websites are functioning properly and are up-to-date. This course can help Webmasters develop skills in web scraping, which can be used to gather data from websites and analyze it to identify performance issues and improve the user experience. By learning how to use Scrapy and XPath, Webmasters can improve their ability to maintain and update websites efficiently and effectively.

See salaries and explore the career path for Webmaster

Market Researcher

Market Researchers collect and analyze data to understand market trends and customer behavior. They use their findings to make recommendations and inform decision-making. This course can help Market Researchers develop skills in web scraping, which can be used to gather data from websites and analyze it to identify market trends and customer feedback. By learning how to use Scrapy and XPath, Market Researchers can improve their ability to gather and analyze data, and develop insights that can help businesses make informed decisions.

See salaries and explore the career path for Market Researcher

Content Strategist

Content Strategists develop and implement content strategies to achieve business goals. They use their knowledge of content marketing, SEO, and social media to create and distribute content that attracts and engages target audiences. This course can help Content Strategists develop skills in web scraping, which can be used to gather data from websites and analyze it to identify content trends and audience behavior. By learning how to use Scrapy and XPath, Content Strategists can improve their ability to develop and implement effective content strategies.

See salaries and explore the career path for Content Strategist

Technical Writer

Technical Writers create and edit technical documentation, such as user manuals, white papers, and training materials. They use their knowledge of technical writing principles and style to create documentation that is clear, concise, and accurate. This course can help Technical Writers develop skills in web scraping, which can be used to gather data from websites and analyze it to identify technical issues and improve documentation. By learning how to use Scrapy and XPath, Technical Writers can improve their ability to create and edit technical documentation that is both effective and informative.

See salaries and explore the career path for Technical Writer

Salesforce Administrator

Salesforce Administrators manage and configure Salesforce software to meet the needs of businesses. They use their knowledge of Salesforce functionality and best practices to ensure that Salesforce is implemented and used effectively. This course can help Salesforce Administrators develop skills in web scraping, which can be used to gather data from websites and analyze it to identify sales leads and opportunities. By learning how to use Scrapy and XPath, Salesforce Administrators can improve their ability to manage and configure Salesforce to help businesses achieve their sales goals.

See salaries and explore the career path for Salesforce Administrator

SEO Specialist

SEO Specialists optimize websites and content to improve their ranking in search engine results pages (SERPs). They use their knowledge of SEO techniques and best practices to increase website traffic and visibility. This course can help SEO Specialists develop skills in web scraping, which can be used to gather data from websites and analyze it to identify keyword trends and競爭者策略. By learning how to use Scrapy and XPath, SEO Specialists can improve their ability to optimize websites and content for search engines.

See salaries and explore the career path for SEO Specialist

Reading list

We've selected seven books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Web Scraping with Python.

Cover image

Cover image

Web Scraping with Python

Save

Provides a comprehensive guide to web scraping with Python, covering essential techniques and best practices. It is particularly useful for understanding the fundamentals of web scraping and for gaining practical experience with tools such as Scrapy and BeautifulSoup.

Web Scraping with Python

Web Scraping with Python

Cover image

Cover image

Learning SPARQL

Save

Provides a comprehensive guide to web scraping, covering the entire process from planning and preparation to data extraction and analysis. It valuable resource for learners who want to gain a holistic understanding of web scraping and develop effective strategies for their projects.

Learning SPARQL: Querying and Updating with SPARQL...

Learning SPARQL 1st (first) Edition by Bob DuCharme...

Unknown Binding

Learning SPARQL: Querying and Updating with SPARQL...

Cover image

Cover image

RESTful Web Services

Save

The official documentation for Beautiful Soup, a popular Python library for parsing HTML and XML documents, provides detailed information on its features and usage. It is an essential reference for learners who want to master the library and use it effectively for web scraping.

RESTful Web Services

RESTful Web Services

Cover image

Cover image

Python Web Scraping

Save

Offers a practical guide to web scraping with Python, covering essential techniques and tools. It good resource for learners who want to quickly get started with web scraping and build their own scripts.

Python Web Scraping - Second Edition: Hands-on data...

Web Scraping With Python: Scrape Data from Any...

Python web scraping (Korean Edition)

Python Web Scraping: Hands-on data scraping and...

Web Scraping with Python (Community Experience...

Cover image

Cover image

Head First JavaScript Programming

Save

Provides a comprehensive and accessible introduction to HTML and CSS, covering the fundamentals of web development. It useful resource for learners who want to understand the structure and styling of web pages, which is essential for effective web scraping.

Head First JavaScript Programming

Head First JavaScript Programming

Cover image

Cover image

Learning Python

Save

Provides a comprehensive and up-to-date overview of Python programming, covering both the basics and advanced topics. It valuable reference for learners who want to deepen their understanding of Python and become proficient in the language.

Learning Python: Powerful Object-Oriented...

Learning Python

Learning Python: Powerful Object-Oriented...

Coding for Beginners Using Python: A Hands-On,...

Audible Audiobook

Learning Python

Learning Python: Powerful Object-Oriented...

Learning Python: Powerful Object-Oriented...

Learning Python

Learning Python

Cover image

Cover image

Automate the Boring Stuff with Python, 2nd Edition

Save

While not specifically focused on web scraping, this book provides a comprehensive introduction to Python programming, covering essential concepts and techniques. It useful resource for learners who are new to programming or need a refresher on the fundamentals.

Automate the Boring Stuff with Python, 2nd Edition:...

Automate the Boring Stuff with Python, 2nd Edition:...

Share

Help others find this course page by sharing it with your friends and followers:

Copy Link

Similar courses

Similar courses are unavailable at this time. Please try again later.

Level

Intermediate

Via

Coursera

Institution

Duke University

Instructor

Alfredo Deza

Language

English

Traffic lights

Read about what's good

what should give you pause

and possible dealbreakers

Taught by Alfredo Deza, who is recognized for their work in web scraping

Explores web scraping, which is standard in data analytics

Develops skills in creating spiders and using XPath queries, which are core for web scraping

Covers evaluating the effectiveness and efficiency of web scraping code, which is essential for data reliability

Offers hands-on labs, which reinforce learning and provide practical experience

Share this

Share to help others discover this course.

Link

Begin learning today

Enroll now to gain full access to Web Scraping with Python.

Enroll now Enroll in this course

Save for later

Add this course to your list. Find it anytime.

Save

Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2025 OpenCourser