An OCR Post-Correction Approach Using Deep Learning for Processing Medical Reports

An OCR Post-Correction Approach Using Deep Learning for Processing Medical Reports

₹5,500.00 ₹4,000.00
Product Code: Python - Deep Learning
Availability: In Stock
Viewed 2050 times

Product Description

Aim:

          Optical character recognition (OCR) can be used for the online retrieval of the printed material such as medical documents, forms, or applications for retrieving valuable information that was available in the printed documents. Deep learning approaches have been used to solve natural language problems.


Abstract:

           According to a recent study, the COVID-19 pandemic continues to place a huge strain on the global health care sector.As a result, the amount of digitally stored patient data such as discharge letters, scan images, test results or free text entries by doctors has grown significantly.This medical data does not conform to a generic structure and is mostly in the form of unstructured digitally generated or scanned paper documents stored as part of a patient’s medical reports. This unstructured data is digitised using Optical Character Recognition (OCR) process. A key challenge here is that the accuracy of the OCR process varies due to the inability of current OCR engines to correctly transcribe scanned or handwritten documents in which text may be skewed, obscured or illegible.The proposed work uses a deep neural network based self-supervised pre-training technique,Robustly Optimized Bidirectional Encoder Representations from Transformers (RoBERTa) that can learn to predict hidden (masked) sections of text to fill in the gaps of non-transcribable parts of the documents being processed. Evaluating the proposed method on domain-specific datasets which include real medical documents, shows a significantly reduced word error rate demonstrating the effectiveness of the approach.


Proposed System:
          More recently, neural networks and deep learning approaches have been used to solve natural language problems. Post-correction methods have been particularly developed and applied such as auto-encoders on Twitter and Wikipedia corpus which had promising improvement in the accuracy with appropriate settings like word lengths and type of the lexicon used to find the nearest match to the incorrect word or neural text embeddings. Equally Long short term memory (LSTMs) have been used for character-aligned strings or the Bidirectional Long Short Term Memory Networks (biLSTMs) to produce a robust character-based language model which does not require annotated training data.


Advantage:

         Using the proposed method, the authors were able to reduce the error rate from 21.2% to 4.2% in the document. Our approach uses the pre-trained model which is not as computationally intense as the approach proposed by Bassil et al. given that our approach is not accessing a tremendous database.


When you order from finalyearprojects.in, you will receive a confirmation email. Once your order is shipped, you will be emailed the tracking information for your order's shipment. You can choose your preferred shipping method on the Order Information page during the checkout process.

The total time it takes to receive your order is shown below:

The total delivery time is calculated from the time your order is placed until the time it is delivered to you. Total delivery time is broken down into processing time and shipping time.

Processing time: The time it takes to prepare your item(s) to ship from our warehouse. This includes preparing your items, performing quality checks, and packing for shipment.

Shipping time: The time for your item(s) to tarvel from our warehouse to your destination.

Shipping from your local warehouse is significantly faster. Some charges may apply.

In addition, the transit time depends on where you're located and where your package comes from. If you want to know more information, please contact the customer service. We will settle your problem as soon as possible. Enjoy shopping!

Download Abstract

Click the below button to download the abstract.

Package Includes

Software Projects Includes

  1. Demo  Video
  2. Abstract
  3. Base paper
  4. Full Project PPT
  5. UML Diagrams
  6. SRS
  7. Source Code
  8. Screen Shots
  9. Software Links
  10. Reference Papers
  11. Full Project Documentation
  12. Online support


The Delivery time for software projects is 2 -3 working days. Some of the software projects will require Hardware interface. Please go through the hardware Requirements in the abstract carefully. The Hardware will take 7-8 Working Days

 

Hardware Projects Includes

  1. Demo  Video
  2. Abstract
  3. Base paper
  4. Full Project PPT
  5. Datasheets
  6. Circuit Diagrams
  7. Source Code
  8. Screen Shots & Photos
  9. Software Links
  10. Reference Papers
  11. Lit survey
  12. Full Project Documentation
  13. Online support


The Delivery time for Hardware projects is 7-8 working days.

   

Mini Projects: Software Includes

  1. Demo  Video
  2. Abstract
  3. Base paper
  4. Full Project PPT
  5. UML Diagrams
  6. SRS
  7. Source Code
  8. Screen Shots
  9. Software Links
  10. Reference Papers
  11. Full Project Documentation
  12. Online support

 

The Delivery time for software Miniprojects is 2 -3 working days.

 

Mini Projects - Hardware includes

  1. Demo  Video
  2. Abstract
  3. PPT
  4. Datasheets
  5. Circuit Diagrams
  6. Source Code
  7. Screen Shots & Photos
  8. Software Links
  9. Reference Papers
  10. Full Project Documentation
  11. Online support

The Delivery time for Hardware Mini projects is 7-8 working days.