Question about Optical Character Recognition Software

hermosatony · Jul 14, 2022

Hello, I am hoping someone could point me in the right direction.

I know very little about programming but I would like to have software that I could load videos into which could be analyzed. Specifically, I’d like the software to be able to output a text file or spreadsheet that contains a timestamp of any time recognizable text was encountered and an output of said text.

Would this even be possible to do? I’m assuming creating Optical Character Recognition from scratch would be prohibitively difficult and expensive but I’m hoping that there is some way to license existing technology for use in a desktop application that I’m hoping to develop.

Any thoughts or ideas are greatly appreciated.

Jenner · Jul 29, 2022

Essentially, this application would be broken down in the following way....

You load in a video file into it.
Detecting and decoding the video format type, the application then renders a series of "frames" as images every "X" seconds - Most likely, a video engine will be needed here, such as FFmpeg - "X" being an arbitrary number, perhaps user defined
These "frame" images are then run through a computer vision engine to analyze them for text - Most likely something like Google's Cloud Vision API
Any returned text is then added to an output file along with the frame number it's found in - which determines the 'time' in the video it appeared.
A final "cleanup" routine then scrubs sequential duplicates or possible over-reporting in the datafile (such as if a scene keeps switching camera 1 to camera 2, and each has some text in the shot)

Deleted member 2829 · Nov 21, 2022

technanosoft said:
If you are talking about facial recognition security technology it's going amazing to talk. This technology of the new world that makes our society's work easier. AI Facial recognition security software is developed to focus on facial recognition technology recognising human faces' sizes and places in digital photographs.
The most used machine learning algorithm for facial recognition is a deep learning Convolutional Neural Network (CNN). CNNs are a type of artificial neural network well-suited for image classification tasks.

Very interesting... but totally irrelevant to the question that was asked !

Welcome!

Question about Optical Character Recognition Software

hermosatony

New Coder

Jenner

Coder

Deleted member 2829

Guest

New Threads

Latest posts

Share this page

Buy us a coffee!

About Us

Site links

We value your privacy

Welcome!

Question about Optical Character Recognition Software

hermosatony

New Coder

Jenner

Coder

Deleted member 2829

Guest

Log in

New Threads

Latest posts

Share this page

Buy us a coffee!

About Us

Site links

Stay Connected

We value your privacy