We may earn an affiliate commission when you visit our partners.
Course image
Course image
Coursera logo

Exploratory Data Analysis with Textual Data in R / Quanteda

Nicole Baerg

In this 1-hour long project-based course, you will learn how to explore presidential concession speeches by US presidential candidates over time, looking specifically at speech length and top words and examining variation by Democrat and Republican candidates. You will learn how to import textual data stored in raw text files, turn these files into a corpus (a collection of textual documents) and tokenize the text all using the software package quanteda. You will also learn how to extract useful information from filenames and how to use this information to generate visualizations of textual data using the stringr and ggplot2 packages.

Read more

In this 1-hour long project-based course, you will learn how to explore presidential concession speeches by US presidential candidates over time, looking specifically at speech length and top words and examining variation by Democrat and Republican candidates. You will learn how to import textual data stored in raw text files, turn these files into a corpus (a collection of textual documents) and tokenize the text all using the software package quanteda. You will also learn how to extract useful information from filenames and how to use this information to generate visualizations of textual data using the stringr and ggplot2 packages.

Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.

Enroll now

What's inside

Syllabus

Exploratory Data Analysis with Textual Data in R using Quanteda
By the end of this project, you will learn how to import textual data stored in raw text files into R, turn these files into a corpus (a collection of textual documents) and tokenize the text all using the R software package quanteda. You will also learn how to extract useful information from filenames and how to use this information to generate visualizations of textual data using the stringr and ggplot packages in R. At the end of this project, among other things you will explore presidential concession speeches by US presidential candidates over time, looking specifically at speech length and top words and examining variation by Democrat and Republican candidates. This guided project is for beginners interested in quantitative text analysis in R. It assumes no knowledge of textual analysis and focuses on exploring textual data (US Presidential Concession Speeches). Users should have a basic understanding of the statistical programming language R. By the end of the exercise, learners will know how to load textual data into R, summarize the data using descriptive quantities of interest, turn text into tokens, and visualize changes over time as well as top words.. Familiarity with R including stringr and ggplot is useful but not essential.

Good to know

Know what's good
, what to watch for
, and possible dealbreakers
Develops skills in data analysis of text using R packages like quanteda and stringr, which are essential in fields where working with text data is important
Focuses on beginner-friendly text analysis, making it accessible to those new to the field
Provides hands-on practice with real-world data, offering practical experience in text analysis
Requires basic knowledge of R programming, which may limit accessibility for complete beginners
Assumes familiarity with stringr and ggplot packages, which may require additional learning for those unfamiliar
Focuses primarily on presidential concession speeches, which may limit the scope of its applicability to other domains

Save this course

Save Exploratory Data Analysis with Textual Data in R / Quanteda to your list so you can find it easily later:
Save

Reviews summary

Highly rated eda with text data

Learners say this course on Exploratory Data Analysis with Textual Data in R / Quanteda is a great learning experience, with nice explanations and thanks to the teacher.

Activities

Coming soon We're preparing activities for Exploratory Data Analysis with Textual Data in R / Quanteda. These are activities you can do either before, during, or after a course.

Career center

Learners who complete Exploratory Data Analysis with Textual Data in R / Quanteda will develop knowledge and skills that may be useful to these careers:
Natural Language Processing Scientist
A Natural Language Processing Scientist develops, builds, and maintains computer systems that understand and process human language. They use their knowledge of linguistics, computer science, and mathematics to create algorithms that can identify parts of speech, extract meaning from text, and generate natural language text. This course can help you build a foundation in natural language processing by teaching you how to import, clean, and analyze textual data using the R programming language and the quanteda package. You will also learn how to visualize your results and communicate your findings effectively.
Data Scientist
A Data Scientist uses their knowledge of mathematics, statistics, and computer science to extract insights from data. They use their skills to develop and build models that can predict future trends, identify patterns, and optimize processes. This course can help you build a foundation in data science by teaching you how to import, clean, and analyze textual data using the R programming language and the quanteda package. You will also learn how to visualize your results and communicate your findings effectively.
Machine Learning Engineer
A Machine Learning Engineer develops and builds machine learning models that can learn from data and make predictions. They use their knowledge of mathematics, statistics, and computer science to create algorithms that can identify patterns, classify data, and make predictions. This course can help you build a foundation in machine learning by teaching you how to import, clean, and analyze textual data using the R programming language and the quanteda package. You will also learn how to visualize your results and communicate your findings effectively.
Computational Linguist
A Computational Linguist develops and builds computer systems that understand and process human language. They use their knowledge of linguistics, computer science, and mathematics to create algorithms that can identify parts of speech, extract meaning from text, and generate natural language text. This course can help you build a foundation in computational linguistics by teaching you how to import, clean, and analyze textual data using the R programming language and the quanteda package. You will also learn how to visualize your results and communicate your findings effectively.
Text Analyst
A Text Analyst uses their knowledge of language, linguistics, and computer science to analyze and interpret text data. They use their skills to identify patterns, extract meaning, and summarize findings. This course can help you build a foundation in text analysis by teaching you how to import, clean, and analyze textual data using the R programming language and the quanteda package. You will also learn how to visualize your results and communicate your findings effectively.
Content Strategist
A Content Strategist develops and implements content strategies that help organizations achieve their business goals. They use their knowledge of marketing, communication, and technology to create and deliver content that is engaging, informative, and persuasive. This course can help you build a foundation in content strategy by teaching you how to analyze and interpret text data. You will also learn how to use your findings to develop and implement effective content strategies.
Digital Marketing Manager
A Digital Marketing Manager plans and executes digital marketing campaigns that help organizations reach their target audience and achieve their business goals. They use their knowledge of marketing, advertising, and technology to create and deliver digital content that is engaging, informative, and persuasive. This course can help you build a foundation in digital marketing by teaching you how to analyze and interpret text data. You will also learn how to use your findings to develop and implement effective digital marketing campaigns.
Public relations manager
A Public Relations Manager develops and implements public relations campaigns that help organizations build and maintain a positive reputation. They use their knowledge of media, communication, and public affairs to create and deliver messages that are informative, persuasive, and engaging. This course can help you build a foundation in public relations by teaching you how to analyze and interpret text data. You will also learn how to use your findings to develop and implement effective public relations campaigns.
Market Research Analyst
A Market Research Analyst conducts market research studies to help organizations understand their target audience and make informed decisions. They use their knowledge of research methods, statistics, and data analysis to collect, analyze, and interpret data. This course can help you build a foundation in market research by teaching you how to analyze and interpret text data. You will also learn how to use your findings to develop and implement effective market research studies.
Business Analyst
A Business Analyst analyzes business processes and systems to identify areas for improvement. They use their knowledge of business, technology, and data analysis to develop and implement solutions that help organizations achieve their goals. This course can help you build a foundation in business analysis by teaching you how to analyze and interpret text data. You will also learn how to use your findings to develop and implement effective business solutions.
Speechwriter
A Speechwriter writes speeches for public figures, such as politicians, business leaders, and celebrities. They use their knowledge of rhetoric, communication, and public speaking to create speeches that are informative, persuasive, and engaging. This course can help you build a foundation in speechwriting by teaching you how to analyze and interpret text data. You will also learn how to use your findings to develop and write effective speeches.
Technical Writer
A Technical Writer develops and writes technical documentation, such as user manuals, white papers, and training materials. They use their knowledge of technical writing, grammar, and style to create documents that are clear, concise, and informative. This course can help you build a foundation in technical writing by teaching you how to analyze and interpret text data. You will also learn how to use your findings to develop and write effective technical documents.
Editor
An Editor reviews, edits, and proofreads written content, such as books, articles, and website content. They use their knowledge of grammar, style, and punctuation to ensure that written content is clear, concise, and error-free. This course can help you build a foundation in editing by teaching you how to analyze and interpret text data. You will also learn how to use your findings to develop and implement effective editing strategies.
Journalist
A Journalist researches, writes, and reports on news and current events. They use their knowledge of journalism, writing, and communication to create news stories that are informative, accurate, and engaging. This course can help you build a foundation in journalism by teaching you how to analyze and interpret text data. You will also learn how to use your findings to develop and write effective news stories.
Librarian
A Librarian helps people find and access information. They use their knowledge of library science, information technology, and customer service to provide a variety of services, such as reference assistance, book recommendations, and technology training. This course can help you build a foundation in library science by teaching you how to analyze and interpret text data. You will also learn how to use your findings to develop and implement effective library services.

Reading list

We've selected seven books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Exploratory Data Analysis with Textual Data in R / Quanteda.
Provides a practical introduction to text mining using the tidytext package in R, covering topics such as text cleaning, tokenization, and text visualization.
Provides a comprehensive introduction to the R programming language, covering topics such as data manipulation, statistical analysis, and graphics.
Provides a comprehensive overview of natural language processing (NLP) techniques using Python, covering topics such as text classification, sentiment analysis, and machine translation.
Provides a comprehensive overview of information retrieval techniques, covering topics such as text retrieval, web search, and social media analysis.
Provides a comprehensive overview of speech and language processing techniques, covering topics such as speech recognition, natural language understanding, and machine translation.
Provides a comprehensive overview of computational linguistics, covering topics such as natural language understanding, machine translation, and speech recognition.

Share

Help others find this course page by sharing it with your friends and followers:

Similar courses

Here are nine courses similar to Exploratory Data Analysis with Textual Data in R / Quanteda.
Introduction to Text Classification in R with quanteda
Most relevant
Building Features from Text Data
Most relevant
Introduction to Sentiment Analysis in R with quanteda
Most relevant
Discussing Politics and Debates Teach-Out
Most relevant
Processing and Filtering Text Data with Bash and Z Shell
Vim Masterclass
Spring AI: Beginner to Guru
Quantitative Text Analysis and Measures of Readability in...
Text Mining and Natural Language Processing in R
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2024 OpenCourser