1. Jain P., Taneja K., Taneja H. Which OCR toolset is good and why: A comparative study // Kuwait Journal of Science. - 2021. - V. 48, No 2. DOI: 10.48129/KJS.V48I2.9589 EDN: ARCPBN
2. Singh A., Bacchuwar K., Bhasin A. A survey of OCR applications // International Journal of Machine Learning and Computing. - 2012. -V. 2, No 3. - P. 314. DOI: 10.7763/ijm-lc.2012.v2.137
3. Chaudhuri A. [et al.] Optical character recognition systems. - Springer International Publishing, 2017. - P. 9-41. DOI: 10.1007/978-3-319-50252-6_2
4. Isheawy N. A. M., Hasan H. Optical character recognition (OCR) system // IOSR Journal of Computer Engineering (IOSR-JCE), e-ISSN. -2015. - P. 22-26.
5. Memon J. [et al.] Handwritten optical character recognition (OCR): A comprehensive systematic literature review (SLR) // IEEE access. - 2020. - V. 8. DOI: 10.1109/ACCESS.2020.3012542 EDN: SLVUTL
6. Smith R. W. History of the Tesseract OCR engine: what worked and what didn’t //Document Recognition and Retrieval XX. - SPIE, 2013. DOI: 10.1117/12.2010051
7. Smith R. An overview of the Tesseract OCR engine //Ninth international conference on document analysis and recognition (ICDAR 2007). - IEEE, 2007. - V. 2. - P. 629-633. DOI: 10.1109/ICDAR.2007.4376991
8. Smith R., Antonova D., Lee D. S. Adapting the Tesseract open source OCR engine for multilingual OCR //Proceedings of the international workshop on multilingual OCR. - 2009. - P. 1-8. DOI: 10.1145/1577802.1577804
9. Badla S. Improving the efficiency of Tesseract OCR Engine. - 2014. DOI: 10.31979/etd.5avd-kf2g
10. Garlapati B. M., Chalamala S. R. A system for handwritten and printed text classification // 2017 UKSim-AMSS 19th International Conference on Computer Modelling & Simulation (UKSim). - IEEE, 2017. - P. 50-54. DOI: 10.1109/UKSim.2017.37
11. Khan K. [et al.] Urdu text classification using decision trees // 2015 12th International Con ference on High-capacity Optical Networks and Enabling/Emerging Technologies (HONET). -IEEE, 2015. - P. 1-4. DOI: 10.1109/HONET.2015.7395445
12. Springmann U. [et al.] Ground Truth for training OCR engines on historical documents in German Fraktur and Early Modern Latin //arXiv preprint arXiv:1809.05501. - 2018. DOI: 10.21248/jlcl.33.2018.220
13. Reul C. [et al.] State of the art optical character recognition of 19th century fraktur scripts using open source engines //arXiv preprint arXiv:1810.03436. - 2018.
14. Chaitra Y. L. [et al] Text Detection and Recognition from the Scene Images Using RCNN and EasyOCR // International Conference on Information and Communication Technology for Intelligent Systems. - Singapore: Springer Nature Singapore, 2023. - P. 75-85. DOI: 10.1007/978-981-99-3761-5_8
15. GitHub - JaidedAI/EasyOCR: Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc. [Электронный ресурс]. URL: https://github.com/JaidedAI/EasyOCR (дата обращения: 15.11.2023).
16. Marne M. G. [et al.] Identification of optimal optical character recognition (OCR) engine for proposed system // 2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA). - IEEE, 2018. - P. 1-4. DOI: 10.1109/IC-CUBEA.2018.8697487
17. Gordin S., Romach A. Gordin S., Romach A. Optical Character Recognition for Complex Scripts: A Case-study in Cuneiform // ADHO 2022-Tokyo. - 2022.
18. Du Y. [et al.] PP-OCR: A practical ultra lightweight OCR system. arXiv 2020 //arXiv preprint arXiv:2009.09941. - 2009.
19. Li C. [et al.] PP-OCRv3: More attempts for the improvement of ultra lightweight OCR system //arXiv preprint arXiv:2206.03001. - 2022.
20. Wang H. [et al.] Pre-trained language models and their applications //Engineering. - 2022. https://doi.org/10.1016Zj.eng.2022.04.024.
21. Devlin J. [et al.] (2018) BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.
22. Radford A. [et al.] (2018) Improving language understanding by generative pre-training.
23. Beltagy I., Peters M. E. and Cohan A. (2020) Longformer: The long-document transformer. arXiv preprint arXiv:2004.05150.
24. Bazzo G. T. [et al.] (2020) Assessing the impact of OCR errors in information retrieval. Advances in Information Retrieval: 42nd European Conference on IR Research, ECIR 2020, Lisbon, Portugal, April 14-17, 2020, Proceedings, Part II 42. Springer International Publishing. P. 102-109. DOI: 10.1007/978-3-030-454425_13
25. kazzand/ru-longformer-base-4096 - Hugging Face https://huggingface.co/kazzand/ru-longformer-base-4096.