Loading...
Loading...

Go to the content (press return)

Improving accuracy and speeding up document image classification through parallel systems

Author
Ferrando, J.; Domínguez, J.; Torres, J.; García, R.; García, D.; Garrido, D.; Cortada, J.; Valero, M.
Type of activity
Presentation of work at congresses
Name of edition
20th International Conference on Computational Science
Date of publication
2020
Presentation's date
2020-06
Book of congress proceedings
Computational Science, ICCS 2020, 20th International Conference: Amsterdam, The Netherlands, June 3–5, 2020: proceedings, part II
First page
387
Last page
400
Publisher
Springer
DOI
10.1007/978-3-030-50417-5_29
Project funding
High performance computing VII
Models de programació i entorns d'execució paral·lels
Repository
http://hdl.handle.net/2117/191632 Open in new window
URL
https://link.springer.com/chapter/10.1007%2F978-3-030-50417-5_29 Open in new window
Abstract
This paper presents a study showing the benefits of the EfficientNet models compared with heavier Convolutional Neural Networks (CNNs) in the Document Classification task, essential problem in the digitalization process of institutions. We show in the RVL-CDIP dataset that we can improve previous results with a much lighter model and present its transfer learning capabilities on a smaller in-domain dataset such as Tobacco3482. Moreover, we present an ensemble pipeline which is able to boost sole...
Citation
Ferrando, J. [et al.]. Improving accuracy and speeding up document image classification through parallel systems. A: International Conference on Computational Science. "Computational Science, ICCS 2020, 20th International Conference: Amsterdam, The Netherlands, June 3–5, 2020: proceedings, part II". Berlín: Springer, 2020, p. 387-400.
Keywords
BERT, Deep learning, Document image classification, Parallel system, PyTorch, Scalability, TensorFlow, s EfficientNet
Group of research
CAP - High Performace Computing Group

Participants

  • Ferrando Monsonis, Javier  (author and speaker )
  • Domínguez, Juan Luis  (author and speaker )
  • Torres Viñals, Jordi  (author and speaker )
  • García Fuentes, Raul  (author and speaker )
  • García Doménech, David  (author and speaker )
  • Garrido Miñambres, Daniel  (author and speaker )
  • Cortada, Jordi  (author and speaker )
  • Valero Cortes, Mateo  (author and speaker )

Attachments