Autoencoder image denoising to increase optical character recognition performance in text conversion

N Alamsyah, MN Fauzan, AG Putrada… - … on Advanced Creative …, 2022 - ieeexplore.ieee.org
2022 International Conference on Advanced Creative Networks and …, 2022ieeexplore.ieee.org
Document digitization has an important role in helping the company's activities be more
efficient, such as detecting text in invoice document images using optical character
recognition (OCR). However, writing in images has many problems, especially tediously
saved documents that can cause noise or interference in the picture, resulting in difficultly
recognized writing. Our research aims to build an autoencoder for denoising text images
and evaluate the OCR's performance in converting the denoised image into text. The first …
Document digitization has an important role in helping the company’s activities be more efficient, such as detecting text in invoice document images using optical character recognition (OCR). However, writing in images has many problems, especially tediously saved documents that can cause noise or interference in the picture, resulting in difficultly recognized writing. Our research aims to build an autoencoder for denoising text images and evaluate the OCR’s performance in converting the denoised image into text. The first step in the research is to test the OCR characteristics on the original text image and the text image given Gaussian noise. The next step is to build the optimal autoencoder model for denoising by studying the effect of dataset size and optimizer type. The last step is to test the OCR performance on the denoised text image produced by the optimum autoencoder model. The test results show that datasetsize affects denoising performance and OCR performance. From several autoencoder models compared, the autoencoder with dataset size =40 has the optimum performance, where the MSE values of the model for train and validation are 1277 and 1385, respectively. With images denoised from the optimum model, the OCR performance in converting images into text is 100%.
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果