2025 RoEduNet Conference: Networking in Education and Research

Name: 2025 RoEduNet Conference: Networking in Education and Research
Start: 2025-09-17T10:00:00+03:00
End: 2025-09-19T23:59:00+03:00
Location: Tehnical University of Moldova

17–19 Sept 2025

Tehnical University of Moldova

Europe/Bucharest timezone

Contact

conference@roedu.net

Exploring OCR: Combining Open-Source Engines for Improved Document Digitization

18 Sept 2025, 12:25

15m

Room 3

Technical University of Moldova

Open Source and GNU in Education and Research Open Source Education and Research

Mr Mihai-Lucian PANDELICĂ (Universitatea Politehnica Bucuresti)

Document digitization involves converting physical documents into editable digital text, a process that offers significant benefits such as preserving archives, enabling remote access, and simplifying content modification. Optical Character Recognition (OCR) technologies facilitate this transformation by extracting text from scanned or photographed document images. However, OCR accuracy can be hindered by the wide variety of document layouts and conditions, including issues like faded text and uneven lighting. In this study, we investigate the potential of combining multiple open-source OCR engines to improve digitization accuracy, focusing on the Tesseract and EasyOCR engines. We developed a testing pipeline and conducted experiments targeting challenging scenarios for character recognition. Our results demonstrate that integrating outputs from both engines can enhance performance, highlighting their complementary strengths and the promise of ensemble approaches for more reliable document digitization.

Mr Mihai-Lucian PANDELICĂ (Universitatea Politehnica Bucuresti)

Giorgiana Vlăsceanu (University Politehnica of Bucharest) Mihai Turcanu (Technical University of Moldova)

There are no materials yet.

2025 RoEduNet Conference: Networking in Education and Research

Contact

Exploring OCR: Combining Open-Source Engines for Improved Document Digitization

Room 3

Speaker

Description

Author

Co-authors

Presentation materials

Choose timezone

2025 RoEduNet Conference: Networking in Education and Research

Contact

Speaker

Description

Author

Co-authors

Presentation materials