We may earn an affiliate commission when you visit our partners.
Course image
Charles Ivan Niswander II
In this 1-hour long project-based course, you will learn how to collect and label images and use them to train a Tensorflow CNN (convolutional neural network) model to recognize relevant areas of (typeface) text in any image, video frame or frame from webcam video. You will learn how to extract image segments that your detector has identified as containing text and enhance them using various image filters from the OpenCV module. Then you will learn how to pass the result image to Google's open-source OCR (Optical Character Recognition) software using the pytesseract python library and read the text to whatever form of output you...
Read more
In this 1-hour long project-based course, you will learn how to collect and label images and use them to train a Tensorflow CNN (convolutional neural network) model to recognize relevant areas of (typeface) text in any image, video frame or frame from webcam video. You will learn how to extract image segments that your detector has identified as containing text and enhance them using various image filters from the OpenCV module. Then you will learn how to pass the result image to Google's open-source OCR (Optical Character Recognition) software using the pytesseract python library and read the text to whatever form of output you like. All of this will be done on Windows, but can be accomplished with very little alteration on Linux as well. We will be using the IDLE development environment to write a single script to scan our video, webcam input, or array of images for text and read that text into our output. Tensorflow, the Tensorflow Object Detection API, Tesseract, the pytesseract library, labelImg for image annotation, OpenCV, and all other required software has already been installed for you in your Rhyme desktop. Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.
Enroll now

Good to know

Know what's good
, what to watch for
, and possible dealbreakers
Provides hands-on practice with image processing and OCR
Introduces learners to several libraries such as TensorFlow, OpenCV, and tesseract
Develops skills in image annotation, image enhancement, and text recognition
Provides a comprehensive introduction to text recognition in images using TensorFlow and OpenCV
Requires learners to have a background in Python and basic image processing concepts
Assumes learners have access to Windows or Linux operating systems and specific software

Save this course

Save Real-time OCR and Text Detection with Tensorflow, OpenCV and Tesseract to your list so you can find it easily later:
Save

Reviews summary

Real-time ocr with tensorflow, opencv and tesseract

This 1-hour long course focuses on using Tensorflow, OpenCV, and Tesseract for real-time OCR and text detection. As it is project-based, learners will work on a single script to scan video, webcam input, or an array of images for text. Students should ideally have some familiarity with Python and a basic knowledge of Tensorflow. The course works best for learners in the North America region.
The emphasis is on coding.
"Just comment a few lines of code."
"It includes a lot of pre-coded modules and it is not possible to run them on our own PC."
"I did not like the approach of typing all the code"
Access to the platform may be limited.
"The platform is not helpful to practice and there is no support from RHYME."
"The cloud environment has a problem."
The course lacks detailed explanations.
"Almost no intuitive explanations "
"It keeps changing automatically, you have to stop coding, pause the video and click on the button next to the Fit one."
"No explanation indetails"
Expect a challenging course.
"The course is very poorly organised."
"NEED TO HAVE A DEEP PREVIOUS KNOWLEDGE ON PYTHON AND SOME OTHER STUFF TO UNDERSTAND THE CORE OF THIS."
"Too fast, explanation is too vague and the guy seems to assume that we are already too well versed with this subject."

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Real-time OCR and Text Detection with Tensorflow, OpenCV and Tesseract with these activities:
Follow tutorials on Tensorflow and OpenCV
Following tutorials on Tensorflow and OpenCV will provide you with a solid foundation in these technologies, which will be essential for completing the course projects.
Browse courses on TensorFlow
Show steps
  • Find tutorials on Tensorflow and OpenCV
  • Follow the tutorials step-by-step
  • Complete the exercises
Review 'Deep Learning with Python'
Reviewing 'Deep Learning with Python' will provide you with a deeper understanding of the concepts behind Tensorflow and deep learning, which will be beneficial for completing the course projects.
Show steps
  • Read the book
  • Work through the exercises
  • Apply the concepts to the course projects
Implement a Tensorflow CNN to detect text
Creating a Tensorflow CNN to detect text will solidify your understanding of the course material and help you develop practical skills in computer vision.
Browse courses on TensorFlow
Show steps
  • Collect images with text
  • Annotate the images with text labels
  • Train a Tensorflow CNN model to detect text
  • Evaluate the performance of the model
Four other activities
Expand to see all activities and additional details
Show all seven activities
Practice extracting text from images
Practicing extracting text from images will help you develop proficiency in using OpenCV and other computer vision tools.
Browse courses on Computer Vision
Show steps
  • Use OpenCV to load an image
  • Convert the image to grayscale
  • Apply image processing techniques to enhance the text
  • Use OCR to extract the text
Start a project to build a text detection application
Starting a project to build a text detection application will give you hands-on experience in applying the concepts you learn in the course.
Browse courses on Computer Vision
Show steps
  • Define the scope of the project
  • Gather the necessary resources
  • Develop a plan for the project
  • Start working on the project
  • Complete the project
Create a blog post or video tutorial on text detection
Creating a blog post or video tutorial on text detection will help you solidify your understanding of the concepts and share your knowledge with others.
Browse courses on Computer Vision
Show steps
  • Choose a topic for your blog post or video tutorial
  • Research the topic
  • Write or record your blog post or video tutorial
  • Publish or share your blog post or video tutorial
Participate in a Kaggle competition on text detection
Participating in a Kaggle competition on text detection will give you the opportunity to apply your skills to a real-world problem and compete with other data scientists.
Browse courses on Computer Vision
Show steps
  • Find a Kaggle competition on text detection
  • Join the competition
  • Develop a solution to the competition
  • Submit your solution
  • Evaluate your results

Career center

Learners who complete Real-time OCR and Text Detection with Tensorflow, OpenCV and Tesseract will develop knowledge and skills that may be useful to these careers:
Data Scientist
Data Scientists use data to solve business problems. They may work with data from a variety of sources, such as surveys, experiments, and social media. This course provides a foundation in data science concepts and techniques that are essential for success in this role. It also covers topics such as data visualization and machine learning, which are important for many data science applications.
Machine Learning Engineer
Machine Learning Engineers work with large amounts of data to train machine learning models to solve business challenges. They may also build and maintain the infrastructure that supports these models. This course provides a foundation in machine learning concepts and techniques that are essential for success in this role. It also covers topics such as image processing and natural language processing, which are important for many machine learning applications.
AI Engineer
AI Engineers build and maintain AI systems. They may work with a variety of AI technologies, such as machine learning, natural language processing, and computer vision. This course provides a foundation in AI concepts and techniques that are essential for success in this role. It also covers topics such as image processing and natural language processing, which are important for many AI applications.
Software Engineer
Software Engineers design, develop, and maintain software applications. They may work on a variety of projects, such as website development, mobile app development, and data analysis. This course provides a foundation in software engineering concepts and techniques that are essential for success in this role. It also covers topics such as object-oriented programming and data structures, which are important for many software engineering applications.
Image Processing Engineer
Image Processing Engineers develop and maintain software that can process images to enhance their quality or extract information from them. They may work on a variety of projects, such as medical imaging, remote sensing, and industrial inspection. This course provides a foundation in image processing concepts and techniques that are essential for success in this role. It also covers topics such as image enhancement and image segmentation, which are important for many image processing applications.
Research Scientist
Research Scientists conduct research to advance knowledge in a particular field. They may work in a variety of settings, such as universities, government laboratories, and private companies. This course provides a foundation in research methods and techniques that are essential for success in this role. It also covers topics such as data analysis and scientific writing, which are important for many research projects.
Computer Vision Engineer
Computer Vision Engineers develop and maintain software that can interpret and understand images and videos. They may work on a variety of projects, such as facial recognition, object detection, and medical imaging. This course provides a foundation in computer vision concepts and techniques that are essential for success in this role. It also covers topics such as image processing and machine learning, which are important for many computer vision applications.
Natural Language Processing Engineer
Natural Language Processing Engineers develop and maintain software that can understand and generate human language. They may work on a variety of projects, such as machine translation, speech recognition, and text summarization. This course provides a foundation in natural language processing concepts and techniques that are essential for success in this role. It also covers topics such as text classification and named entity recognition, which are important for many natural language processing applications.
Financial Analyst
Financial Analysts use financial data to make investment recommendations. They may work for a variety of financial institutions, such as investment banks, hedge funds, and pension funds. This course provides a foundation in financial analysis concepts and techniques that are essential for success in this role. It also covers topics such as financial modeling and valuation, which are important for many financial analysis applications.
Actuary
Actuaries use mathematical and statistical methods to assess risk and uncertainty. They may work for a variety of insurance companies, pension funds, and other financial institutions. This course provides a foundation in actuarial science concepts and techniques that are essential for success in this role. It also covers topics such as risk management and financial modeling, which are important for many actuarial applications.
Quantitative Analyst
Quantitative Analysts use mathematical and statistical methods to analyze financial data. They may work for a variety of financial institutions, such as investment banks, hedge funds, and pension funds. This course provides a foundation in quantitative analysis concepts and techniques that are essential for success in this role. It also covers topics such as risk management and financial modeling, which are important for many quantitative analysis applications.
Product Manager
Product Managers are responsible for the development and launch of new products. They may work for a variety of organizations, such as technology companies, consumer goods companies, and manufacturing companies. This course provides a foundation in product management concepts and techniques that are essential for success in this role. It also covers topics such as market research and product development, which are important for many product management applications.
Marketing Analyst
Marketing Analysts use data to analyze marketing campaigns and make recommendations for improvement. They may work for a variety of organizations, such as marketing agencies, corporations, and government agencies. This course provides a foundation in marketing analysis concepts and techniques that are essential for success in this role. It also covers topics such as market research and data analytics, which are important for many marketing analysis applications.
User Experience Designer
User Experience Designers design and evaluate user interfaces. They may work for a variety of organizations, such as technology companies, design firms, and e-commerce companies. This course provides a foundation in user experience design concepts and techniques that are essential for success in this role. It also covers topics such as human-computer interaction and usability testing, which are important for many user experience design applications.
Business Analyst
Business Analysts use data to analyze business processes and make recommendations for improvement. They may work for a variety of organizations, such as consulting firms, corporations, and government agencies. This course provides a foundation in business analysis concepts and techniques that are essential for success in this role. It also covers topics such as data analysis and process improvement, which are important for many business analysis applications.

Reading list

We've selected seven books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Real-time OCR and Text Detection with Tensorflow, OpenCV and Tesseract.
Provides a comprehensive overview of deep learning for computer vision. It covers the basics of deep learning as well as more advanced topics such as convolutional neural networks and generative adversarial networks.
Provides a comprehensive overview of deep learning for natural language processing. It covers the basics of deep learning as well as more advanced topics such as recurrent neural networks and attention mechanisms.
Provides a comprehensive overview of machine learning with TensorFlow. It covers the basics of machine learning as well as more advanced topics such as deep learning and natural language processing.
Provides a comprehensive overview of deep learning with R. It covers the basics of deep learning as well as more advanced topics such as convolutional neural networks and recurrent neural networks.
Provides a comprehensive overview of computer vision for visual effects. It covers the basics of computer vision as well as more advanced topics such as motion capture and image-based rendering.
Provides a practical guide to using OpenCV for computer vision and image processing. It covers a wide range of topics, including image processing, object detection, and video analysis.

Share

Help others find this course page by sharing it with your friends and followers:

Similar courses

Here are nine courses similar to Real-time OCR and Text Detection with Tensorflow, OpenCV and Tesseract.
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2024 OpenCourser