Skip to main content

Offline Arabic Handwriting Recognition with Multidimensional Recurrent Neural Networks

  • Chapter

Abstract

Offline handwriting recognition requires a combination of computer vision and sequence learning. In most systems the two elements are handled separately, with sophisticated pre-processing techniques used to extract the image features and sequential models such as HMMs used to provide the transcriptions. This chapter considers an alternative system, based on multidimensional recurrent neural networks, that learns directly from pixel data, and describes its winning entry to a major Arabic offline handwriting recognition competition.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Baldi, P., Pollastri, G.: The principled design of large-scale recursive neural network architectures-DAG-RNNs and the protein structure prediction problem. J. Mach. Learn. Res. 4, 575–602 (2003)

    Google Scholar 

  2. Bridle, J.S.: Probabilistic interpretation of feedforward classification network outputs, with relationships to statistical pattern recognition. In: Fogleman-Soulie, F., Herault, J. (eds.) Neurocomputing: Algorithms, Architectures and Applications, pp. 227–236. Springer, Berlin (1990)

    Google Scholar 

  3. Gers, F., Schraudolph, N., Schmidhuber, J.: Learning precise timing with LSTM recurrent networks. J. Mach. Learn. Res. 3, 115–143 (2002)

    MathSciNet  Google Scholar 

  4. Graves, A.: Supervised sequence labelling with recurrent neural networks. Ph.D. in Informatics, Fakultat für Informatik—Technische Universität München (2008)

    Google Scholar 

  5. Graves, A., Fernández, S., Gomez, F., Schmidhuber, J.: Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In: Proceedings of the International Conference on Machine Learning, ICML 2006, Pittsburgh, PA, USA (2006)

    Google Scholar 

  6. Graves, A., Fernández, S., Liwicki, M., Bunke, H., Schmidhuber, J.: Unconstrained online handwriting recognition with recurrent neural networks. In: Platt, J., Koller, D., Singer, Y., Roweis, S. (eds.) Advances in Neural Information Processing Systems, vol. 20. MIT Press, Cambridge (2008)

    Google Scholar 

  7. Graves, A., Fernández, S., Schmidhuber, J.: Multidimensional recurrent neural networks. In: Proceedings of the 2007 International Conference on Artificial Neural Networks, Porto, Portugal, September 2007

    Google Scholar 

  8. Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)

    Article  Google Scholar 

  9. Hu, J., Lim, S.G., Brown, M.K.: Writer independent on-line handwriting recognition using an HMM approach. Pattern Recognit. 33, 133–147 (2000)

    Article  Google Scholar 

  10. Jaeger, S., Manke, S., Reichert, J., Waibel, A.: On-line handwriting recognition: the NPen++ recognizer. Int. J. Doc. Anal. Recognit. 3, 169–180 (2001)

    Article  Google Scholar 

  11. Jiang, H.: Discriminative training of HMMs for automatic speech recognition: A survey. Comput. Speech Lang. 24(4), 589–608 (2010)

    Article  Google Scholar 

  12. LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)

    Article  Google Scholar 

  13. LeCun, Y., Muller, U., Ben, J., Cosatto, E., Flepp, B.: Off-road obstacle avoidance through end-to-end learning. In: Advances in Neural Information Processing Systems (NIPS 2005). MIT Press, Cambridge (2005)

    Google Scholar 

  14. Märgner, V., El Abed, H.: In: ICDAR 2009 Arabic Handwriting Recognition Competition, Jul. 2009, pp. 1383–1387 (2009)

    Google Scholar 

  15. Ng, A.Y., Jordan, M.I.: On discriminative vs. generative classifiers: A comparison of logistic regression and naive Bayes. In: NIPS, pp. 841–848 (2001)

    Google Scholar 

  16. Pechwitz, M., Maddouri, S.S., Märgner, V., Ellouze, N., Amiri, H.: IFN/ENIT-database of handwritten Arabic words. In: 7th Colloque International Francophone sur l’Ecrit et le Document (CIFED 2002), Hammamet, Tunis (2002)

    Google Scholar 

  17. Reisenhuber, M., Poggio, T.: Hierarchical models of object recognition in cortex. Nat. Neurosci. 2(11), 1019–1025 (1999)

    Article  Google Scholar 

  18. Schuster, M., Paliwal, K.K.: Bidirectional recurrent neural networks. IEEE Trans. Signal Process. 45, 2673–2681 (1997)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alex Graves .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag London

About this chapter

Cite this chapter

Graves, A. (2012). Offline Arabic Handwriting Recognition with Multidimensional Recurrent Neural Networks. In: Märgner, V., El Abed, H. (eds) Guide to OCR for Arabic Scripts. Springer, London. https://doi.org/10.1007/978-1-4471-4072-6_12

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-4072-6_12

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-4471-4071-9

  • Online ISBN: 978-1-4471-4072-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics