Python Pdf 183672 | Ijtsrd41099

Partial capture of text on file.
                                            International Journal of Trend in Scientific Research and Development (IJTSRD) 
                                                                          Volume 5 Issue 3, March-April 2021 Available Online: www.ijtsrd.com e-ISSN: 2456 – 6470 
                                       
                                                                                                                          Optical Character Recognition Using Python 
                                                                                                                                                                                                                            Ponvizhi. U, Ramya. P, Ramya. R 
                                                         UG Scholar, Department of Information Technology, S.A Engineering College, Chennai, Tamil Nadu, India 
                                      ABSTRACT                                                                                                                                                                                                                                                                                                                                                                    How to  cite  this  paper:  Ponvizhi.  U  | 
                                      Optical Character Recognition is a process of classifying optical patterns with                                                                                                                                                                                                                                                                                             Ramya. P | Ramya. R "Optical Character 
                                      respect to alphanumeric or other characters. It also includes segmentation,                                                                                                                                                                                                                                                                                                 Recognition                                                        Using 
                                      feature extraction and classification.                                                                                                                                                                                                                                                                                                                                      Python" Published in 
                                      Deep learning is part of a broader family of machine learning methods based                                                                                                                                                                                                                                                                                                 International Journal 
                                      on artificial neural networks with. representation learning                                                                                                                                                                                                                                                                                                                 of Trend in Scientific 
                                                                                                                                                                                                                                                                                                                                                                                                                  Research                                                                    and                                                                                               
                                      The idea of the project is to extract text from image using Deep Learning by                                                                                                                                                                                                                                                                                                Development (ijtsrd),                                                                                                                          
                                      OCR                                                                                                                                                                                                                                                                                                                                                                         ISSN:                                    2456-6470,                                                            IJTSRD41099 
                                       
                                                                                                                                                                                                                                                                                                                                                                                                                  Volume-5  |  Issue-3, 
                                      KEYWORDS:  OCR-EASYOCR-DEEP  LEARNING-TEXT  DETECTION-TEXT                                                                                                                                                                                                                                                                                                                  April                                 2021,                                    pp.1052-1054,                                                                    URL: 
                                      RECOGNITION-IMAGE EXTRACTION                                                                                                                                                                                                                                                                                                                                                www.ijtsrd.com/papers/ijtsrd41099.pdf 
                                       
                                                                                                                                                                                                                                                                                                                                                                                                                   
                                       
                                                                                                                                                                                                                                                                                                                                                                                                                  Copyright  ©  2021  by  author  (s)  and 
                                       
                                                                                                                                                                                                                                                                                                                                                                                                                  International Journal of Trend in Scientific 
                                       
                                                                                                                                                                                                                                                                                                                                                                                                                  Research and Development Journal. This 
                                       
                                                                                                                                                                                                                                                                                                                                                                                                                  is  an  Open  Access  article  distributed 
                                       
                                                                                                                                                                                                                                                                                                                                                                                                                  under  the  terms  of 
                                       
                                                                                                                                                                                                                                                                                                                                                                                                                  the                                                      Creative 
                                       
                                                                                                                                                                                                                                                                                                                                                                                                                  Commons Attribution 
                                       
                                                                                                                                                                                                                                                                                                                                                                                                                  License                                                            (CC                                               BY                                             4.0) 
                                                                                                                                                                                                                                                                                                                                                                                                                  (http://creativecommons.org/licenses/by/4.0) 
                                      1.  INTRODUCTION                                                                                                                                                                                                                                                                            3.  Motivation And Scope 
                                      OCR, or optical character recognition, is one of the earliest                                                                                                                                                                                                                               Optical  Character  Recognition  is  needed  when  the 
                                      addressed computer vision tasks, since in some aspects it                                                                                                                                                                                                                                   information should be readable both to humans and to a 
                                      does  not  require  deep  learning.  Therefore  there  were                                                                                                                                                                                                                                 machine. 
                                      different  OCR  implementations  even  before  the  deep                                                                                                                                                                                                                                    The  scope  of  this  project  is  to  provide  an  efficient  and 
                                      learning boom in 2012.                                                                                                                                                                                                                                                                      enhanced software for the users to perform Document Image 
                                      This makes many people think the OCR challenge is “solved”,                                                                                                                                                                                                                                 Analysis, document processing by reading and recognizing 
                                      it is no longer challenging. Another belief which comes from                                                                                                                                                                                                                                the  characters  in  research,  academic,  governmental  and 
                                      similar sources is that OCR does not require deep learning,                                                                                                                                                                                                                                 business  organizations  that  are  having  large  pool  of 
                                      or in other words, using deep learning for OCR is an overkill.                                                                                                                                                                                                                              document, scanned images. 
                                      2.  Existing system                                                                                                                                                                                                                                                                         4.  SYSTEM ARCHITECTURE 
                                      In the running world there is growing demand for the users                                                                                                                                                                                                                                  components of the system consist of: Preprocessing, Feature 
                                      to convert the printed documents into electronic document                                                                                                                                                                                                                                   extraction, Preprocessing: This sub-system performs noise 
                                      for maintaining the security of their data.                                                                                                                                                                                                                                                 removal, deploring, filtering and linearization on the input 
                                      Hence the basic OCR system invented to convert the data                                                                                                                                                                                                                                     image.  Next  samples  out  characters  from  preprocessed 
                                      available on papers into computer process-able documents.                                                                                                                                                                                                                                   ancient  documents.  Feature  Extraction:  This  component 
                                                                                                                                                                                                                                                                                                                                  extracts  features  from  the  input  image  and  stores  the 
                                      So the documents can be editable and reusable. Drawback-In                                                                                                                                                                                                                                  extracted features in a feature vector. 
                                      early  OCR  systems  is  that  they  only  have  capability  to 
                                      convert  &  recognize  only  the  documents  of  English  or 
                                      specific. 
                                       
                                       
                                       
                                       
                                       
                                       
                                       
                                       
                                       
                                       
                                       
                                      @ IJTSRD     |     Unique Paper ID – IJTSRD41099      |     Volume – 5 | Issue – 3     |     March-April 2021                                                                                                                                                                                                                                                                                                                                                                                                      Page 1052 
             International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 
          5.  ARCHITECTURE OF OCR 
          6.  LIST OF MODULES                                                                                           
          The recognition system has two main modules:  
          Text detection based on Connectionist Text Proposal Network  
          Text recognition based on Attention-based Encoder-Decoder.  
          Text detection based on Connectionist Text Proposal Network 
          Connectionist Text Proposal Network (CTPN) that accurately localizes text lines in natural image. The CTPN detects a text line 
          in a sequence of fine-scale text proposals directly in convolution feature maps.  
          The CTPN works reliably on multi-scale and multi-language text without further post-processing, departing from previous 
          bottom-up methods requiring multi-step post filtering 
          Text recognition based on Attention-based Encoder-Decoder 
          Accurate and rich semantic information carried by the text is important for many application scenarios such as image 
          searching, intelligent inspection, product recognition and autonomous driving. For these reasons, scene text recognition has 
          been an active research field in computer vision  
          Although optical character recognition in scanned documents has been considered as a solved problem  
          7.  ALGORITHM 
          Convolution Recurrent Neural Networks 
            Convolution Neural Networks (CNN). 
            Recurrent Neural Networks (RNN). 
            Long Short Term Memory Networks (LSTMs). 
                                                                           CNN                                                         
                                                                               
                                                                               
          @ IJTSRD     |     Unique Paper ID – IJTSRD41099      |     Volume – 5 | Issue – 3     |     March-April 2021               Page 1053 
             International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 
                                                                            RNN                                                               
                                                                                
                                                                           LSTMs                                                              
          8.  RESULT 
          바보처럼 너만생각해                                                                                                              
          Like afool, 
          only think of you 
          940812.TUMBLA.COM 
           
          9.  RELATED WORK                                                                
          [1]     1982. Schantz, H. The History of OCR. Manchester                [3]    1990.     Adams,  R.  Sourcebook  of  Automatic 
                  Center,    VT:   Recognition      Technologies      Users              Identification  and  Data  Collection.  New  York:  Van 
                  Association. (The history of OCR is related from its                   Nostrand  Reinhold.  (This  book  is  a  good  general 
                  inauspicious beginnings up to its current commercial                   reference  for  OCR.  It  also  considers  a  number  of 
                  success.) Google ScholarDigital Library.                               commercially  available  OCR  systems.  Names, 
          [2]     1985. Smith, J. W., and Merali, Z. Optical Character                   addresses, and phone numbers of many OCR vendors 
                  Recognition: The Technology and its Application in                     are given.) Google ScholarDigital Library 
                  Information Units and Libraries. The British Library.           [4]    1999. Rice, S. V., Nagy, G., and Nartker, T. A. Optical 
                  (This report is intended for use by anyone who is                      Character Recognition: An Illustrated Guide to the 
                  considering OCR in an information or library context.                  Frontier.  Boston:  Kluwer.  Google  ScholarDigital 
                  Since minimal knowledge of OCR is assumed, general                     Library Show Fewer References Index Term. 
                  background material is abundant.)Google Scholar               
          @ IJTSRD     |     Unique Paper ID – IJTSRD41099      |     Volume – 5 | Issue – 3     |     March-April 2021                 Page 1054
The words contained in this file might help you see if this file matches what you are looking for:

...International journal of trend in scientific research and development ijtsrd volume issue march april available online www com e issn optical character recognition using python ponvizhi u ramya p r ug scholar department information technology s a engineering college chennai tamil nadu india abstract how to cite this paper is process classifying patterns with respect alphanumeric or other characters it also includes segmentation feature extraction classification published deep learning part broader family machine methods based on artificial neural networks representation the idea project extract text from image by ocr keywords easyocr detection pp url papers pdf copyright author an open access article distributed under terms...
Related files

Share

Help

Related files

Share

Share to social media

Help

Login Area