some papers and datasets links collected from:
- [1] wanghaisheng/awesome-ocr
- [2] kba/awesome-ocr
- [3] chongyangtao/Awesome-Scene-Text-Recognition
- [4] whitelok/image-text-localization-recognition
- [5] 文字检测与识别资源
- [6] OCR material
- [7] handong1587
- [8] hs105/Deep-Learning-for-OCR
- [9] 文字检测与识别资料整理
- [10] hwalsuklee/awesome-deep-text-detection-recognition
you can access the website ICDAR, and see some awesome ocr models on the "Ranking Table" of each competition's result page
- 【Synthetic data】de T. Campos, B. R. Babu, and M. Varma. Character recognition in natural images. In VISAPP, 2009
- Epshtein B, Ofek E, Wexler Y. Detecting text in natural scenes with stroke width transform[C]//Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on. IEEE, 2010: 2963-2970.
code:[code]
- Rusinol M, Aldavert D, Toledo R, et al. Browsing heterogeneous document collections by a segmentation-free word spotting method[C]//Document Analysis and Recognition (ICDAR), 2011 International Conference on. IEEE, 2011: 63-67.
- Neumann L, Matas J. Text localization in real-world images using efficiently pruned exhaustive search[C]//Document Analysis and Recognition (ICDAR), 2011 International Conference on. IEEE, 2011: 687-691.
- 【Synthetic data】Wang T, Wu D J, Coates A, et al. End-to-end text recognition with convolutional neural networks[C]//Pattern Recognition (ICPR), 2012 21st International Conference on. IEEE, 2012: 3304-3308.
code:[code] - Elagouni K, Garcia C, Mamalet F, et al. Text recognition in videos using a recurrent connectionist approach[C]//International Conference on Artificial Neural Networks. Springer, Berlin, Heidelberg, 2012: 172-179.
- Frinken V, Fischer A, Manmatha R, et al. A novel word spotting method based on recurrent neural networks[J]. IEEE transactions on pattern analysis and machine intelligence, 2012, 34(2): 211-224.
- Neumann L, Matas J. Real-time scene text localization and recognition[C]//Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on. IEEE, 2012: 3538-3545.
code:[code] - Mishra A, Alahari K, Jawahar C V. Top-down and bottom-up cues for scene text recognition[C]//Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on. IEEE, 2012: 2687-2694.
- Yin X C, Yin X, Huang K, et al. Robust text detection in natural scene images[J]. IEEE transactions on pattern analysis and machine intelligence, 2014, 36(5): 970-983.
- Bissacco A, Cummins M, Netzer Y, et al. Photoocr: Reading text in uncontrolled conditions[C]//Proceedings of the IEEE International Conference on Computer Vision. 2013: 785-792.
- Breuel T M, Ul-Hasan A, Al-Azawi M A, et al. High-performance OCR for printed English and Fraktur using LSTM networks[C]//Document Analysis and Recognition (ICDAR), 2013 12th International Conference on. IEEE, 2013: 683-687.
code:[code] - Milyaev S, Barinova O, Novikova T, et al. Image binarization for end-to-end text understanding in natural images[C]//Document Analysis and Recognition (ICDAR), 2013 12th International Conference on. IEEE, 2013: 128-132.
- Neumann L, Matas J. On combining multiple segmentations in scene text recognition[C]//Document Analysis and Recognition (ICDAR), 2013 12th International Conference on. IEEE, 2013: 523-527.
- Koo H I, Kim D H. Scene text detection via connected component clustering and nontext filtering[J]. IEEE transactions on image processing, 2013, 22(6): 2296-2305.
- Shi C, Wang C, Xiao B, et al. Scene text recognition using part-based tree-structured character detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2013: 2961-2968.
- Halima M B, Karray H, Alimi A M. Arabic text recognition in video sequences[J]. arXiv preprint arXiv:1308.3243, 2013.
- Zaghden N, Khelifi B, Alimi A M, et al. Text Recognition in both ancient and cartographic documents[J]. arXiv preprint arXiv:1308.6309, 2013.
- Alsharif O, Pineau J. End-to-end text recognition with hybrid HMM maxout models[J]. arXiv preprint arXiv:1310.1811, 2013.
- Louradour J, Kermorvant C. Curriculum learning for handwritten text line recognition[C]//Document Analysis Systems (DAS), 2014 11th IAPR International Workshop on. IEEE, 2014: 56-60.
- Goodfellow I J, Bulatov Y, Ibarz J, et al. Multi-digit number recognition from street view imagery using deep convolutional neural networks[J]. arXiv preprint arXiv:1312.6082, 2013.
- Bušta M, Drtina T, Helekal D, et al. Efficient character skew rectification in scene text images[C]//Asian Conference on Computer Vision. Springer, Cham, 2014: 134-146.
- Almazán J, Gordo A, Fornés A, et al. Word spotting and recognition with embedded attributes[J]. IEEE transactions on pattern analysis and machine intelligence, 2014, 36(12): 2552-2566.
code:[code] - Jaderberg M, Vedaldi A, Zisserman A. Deep features for text spotting[C]//European conference on computer vision. Springer, Cham, 2014: 512-528.
code:[code] - Bluche T, Ney H, Kermorvant C. A comparison of sequence-trained deep neural networks and recurrent neural networks optical modeling for handwriting recognition[C]//International Conference on Statistical Language and Speech Processing. Springer, Cham, 2014: 199-210.
- Yao C, Bai X, Liu W. A unified framework for multioriented text detection and recognition[J]. IEEE Transactions on Image Processing, 2014, 23(11): 4737-4749.
- Huang W, Qiao Y, Tang X. Robust scene text detection with convolution neural network induced mser trees[C]//European Conference on Computer Vision. Springer, Cham, 2014: 497-511.
- Bhowmick S, Banerjee P. Bangla text recognition from video sequence: A new focus[J]. arXiv preprint arXiv:1401.1190, 2014.
- 【Synthetic data】Jaderberg M, Simonyan K, Vedaldi A, et al. Synthetic data and artificial neural networks for natural scene text recognition[J]. arXiv preprint arXiv:1406.2227, 2014.
code:[model;offical website] - Jaderberg M, Simonyan K, Vedaldi A, et al. Reading text in the wild with convolutional neural networks[J]. International Journal of Computer Vision, 2016, 116(1): 1-20.
offical website:[offical website] - Jaderberg M, Simonyan K, Vedaldi A, et al. Deep structured output learning for unconstrained text recognition[J]. arXiv preprint arXiv:1412.5903, 2014.
- Kim B S, Koo H I, Cho N I. Document dewarping via text-line based optimization[J]. Pattern Recognition, 2015, 48(11): 3600-3614.
- Ye Q, Doermann D. Text detection and recognition in imagery: A survey[J]. IEEE transactions on pattern analysis and machine intelligence, 2015, 37(7): 1480-1500.
- Jaderberg M. Deep learning for text spotting[D]. University of Oxford, 2015.
- Ren X, Chen K, Yang X, et al. A new unsupervised convolutional neural network model for Chinese scene text detection[C]//Signal and Information Processing (ChinaSIP), 2015 IEEE China Summit and International Conference on. IEEE, 2015: 428-432.
- Wang Z, Yang J, Jin H, et al. Deepfont: Identify your font from an image[C]//Proceedings of the 23rd ACM international conference on Multimedia. ACM, 2015: 451-459.
- Gomez L, Karatzas D. Object proposals for text extraction in the wild[C]//Document Analysis and Recognition (ICDAR), 2015 13th International Conference on. IEEE, 2015: 206-210.[code]
- Shi B, Yao C, Zhang C, et al. Automatic script identification in the wild[C]//Document Analysis and Recognition (ICDAR), 2015 13th International Conference on. IEEE, 2015: 531-535.
- Busta M, Neumann L, Matas J. Fastext: Efficient unconstrained scene text detector[C]//Proceedings of the IEEE International Conference on Computer Vision. 2015: 1206-1214.[code]
- Zhang Z, Shen W, Yao C, et al. Symmetry-based text line detection in natural scenes[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2015: 2558-2567.
code:[code] - Ray A, Rajeswar S, Chaudhury S. A hypothesize-and-verify framework for text recognition using deep recurrent neural networks[C]//Document Analysis and Recognition (ICDAR), 2015 13th International Conference on. IEEE, 2015: 936-940.
- Neumann L, Matas J. Efficient scene text localization and recognition with local character refinement[C]//Document Analysis and Recognition (ICDAR), 2015 13th International Conference on. IEEE, 2015: 746-750.
- Visin F, Kastner K, Cho K, et al. Renet: A recurrent neural network based alternative to convolutional networks[J]. arXiv preprint arXiv:1505.00393, 2015.
- Zhong Z, Jin L, Xie Z. High performance offline handwritten chinese character recognition using googlenet and directional feature maps[C]//Document Analysis and Recognition (ICDAR), 2015 13th International Conference on. IEEE, 2015: 846-850.
code:[code] - 【CRNN】Shi B, Bai X, Yao C. An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition[J]. IEEE transactions on pattern analysis and machine intelligence, 2017, 39(11): 2298-2304.
code:【1 - offical】; 【2 - crnn.pytorch】; 【3 - unfinished】; 【4 - crnn.pytorch-chinese】; 【5 - crnn+stn-tf】; 【6 - lstm+ctc】; 【7 - ctpn+crnn-merge-cannot-train】; 【8 - crnn-mnist-keras】; 【9 - crnn-tf】; 【10 - crnn-tf-could-be-better】; 【11 - crnn.mxnet】; 【12 - crnn-tf-estimators】; 【13 - crnn-attention-tf】; 【14 - crnn.caffe】; 【15 - chinese.ocr-ctpn+crnn-tf+pytorch】; 【16 - another.crnn-attentive pooling】; 【17 - crnn-tf-music】; 【18 - crnn-tf-developing】; 【19 - crnn-torch】; 【20 - crnn-tf-developing】; 【21 - chinese-ocr-keras】; 【22 - crnn-tf-developing】; 【23 - ctpn+crnn-cannot-train-7】; 【24 - crnn-pytorch】; 【25 - cnn+lstm+ctc-tf】; 【26 - crnn-tf-resnet]】;【27 - caffe_ocr】 - He T, Huang W, Qiao Y, et al. Text-attentional convolutional neural network for scene text detection[J]. IEEE transactions on image processing, 2016, 25(6): 2529-2541.
- Sahu D K, Sukhwani M. Sequence to sequence learning for optical character recognition[J]. arXiv preprint arXiv:1511.04176, 2015.
- Hosseini-Asl E, Guha A. Similarity-based Text Recognition by Deeply Supervised Siamese Network[J]. arXiv preprint arXiv:1511.04397, 2015.
- Wang D H, Wang H, Zhang D, et al. Robust Scene Text Recognition Using Sparse Coding based Features[J]. arXiv preprint arXiv:1512.08669, 2015.
- Yin X C, Zuo Z Y, Tian S, et al. Text detection, tracking and recognition in video: a comprehensive survey[J]. IEEE Transactions on Image Processing, 2016, 25(6): 2752-2773.
- Zhu Y, Yao C, Bai X. Scene text detection and recognition: Recent advances and future trends[J]. Frontiers of Computer Science, 2016, 10(1): 19-36.
- He P, Huang W, Qiao Y, et al. Reading Scene Text in Deep Convolutional Sequences[C]//AAAI. 2016: 3501-3508.
code:[code] - Lee C Y, Osindero S. Recursive recurrent nets with attention modeling for OCR in the wild[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016: 2231-2239.
- 【Synthetic data】Gupta A, Vedaldi A, Zisserman A. Synthetic data for text localisation in natural images[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016: 2315-2324.
code:[offical;vgg;other] - Sivakorn S, Polakis J, Keromytis A D. I’m not a human: Breaking the Google reCAPTCHA[J]. Black Hat,(i), 2016: 1-12.
- Sivakorn S, Polakis I, Keromytis A D. I am robot:(deep) learning to break semantic image captchas[C]//Security and Privacy (EuroS&P), 2016 IEEE European Symposium on. IEEE, 2016: 388-403.
- Lee C Y, Osindero S. Recursive recurrent nets with attention modeling for OCR in the wild[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016: 2231-2239.
- Neumann L, Matas J. Real-time lexicon-free scene text localization and recognition[J]. IEEE transactions on pattern analysis and machine intelligence, 2016, 38(9): 1872-1885.
- Zhang Z, Zhang C, Shen W, et al. Multi-oriented text detection with fully convolutional networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016: 4159-4167.
- Fabrizio J, Robert-Seidowsky M, Dubuisson S, et al. TextCatcher: a method to detect curved and challenging text in natural scenes[J]. International Journal on Document Analysis and Recognition (IJDAR), 2016, 19(2): 99-117.
- Cho H, Sung M, Jun B. Canny text detector: Fast and robust scene text localization algorithm[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016: 3566-3573.
- Qiang G, Dan T, Guohui L, et al. Memory Matters: Convolutional Recurrent Neural Network for Scene Text Recognition[J]. arXiv preprint arXiv:1601.01100, 2016.
- Mishra A, Alahari K, Jawahar C V. Enhancing energy minimization framework for scene text recognition with top-down cues[J]. Computer Vision and Image Understanding, 2016, 145: 30-42.
- Li H, Shen C. Reading car license plates using deep convolutional neural networks and lstms[J]. arXiv preprint arXiv:1601.05610, 2016.
- 【Dataset】Veit A, Matera T, Neumann L, et al. Coco-text: Dataset and benchmark for text detection and recognition in natural images[J]. arXiv preprint arXiv:1601.07140, 2016.
- Huang W. Context modeling for semantic text matching and scene text detection[M]. The Pennsylvania State University, 2016.
- Tian S, Pei W Y, Zuo Z Y, et al. Scene Text Detection in Video by Learning Locally and Globally[C]//IJCAI. 2016: 2647-2653.
- Shi B, Wang X, Lyu P, et al. Robust scene text recognition with automatic rectification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016: 4168-4176.
- Shuye Zhang, Mude Lin, Tianshui Chen, Lianwen Jin, Liang Lin. Character Proposal Network for Robust Text Extraction. arXiv preprint arXiv:1602.04348, 2016.
- Lluis Gomez, Dimosthenis Karatzas. A fine-grained approach to scene text script identification. arXiv preprint arXiv:1602.07475, 2016.
- Lluis Gomez, Anguelos Nicolaou, Dimosthenis Karatzas. Improving patch-based scene text script identification with ensembles of conjoined networks. arXiv preprint arXiv:1602.07480, 2016.
- He T, Huang W, Qiao Y, et al. Accurate text localization in natural image with cascaded convolutional text network[J]. arXiv preprint arXiv:1603.09423, 2016.
- Hafemann L G, Sabourin R, Oliveira L S. Writer-independent feature learning for offline signature verification using deep convolutional neural networks[C]//Neural Networks (IJCNN), 2016 International Joint Conference on. IEEE, 2016: 2576-2583.
- Ren X, Chen K, Sun J. A CNN Based Scene Chinese Text Recognition Algorithm With Synthetic Data Engine[J]. arXiv preprint arXiv:1604.01891, 2016.
- Xiaohang Ren, Kai Chen, Jun Sun. A Novel Scene Text Detection Algorithm Based On Convolutional Neural Network. arXiv preprint arXiv:1604.01894, 2016.
- Gómez L, Karatzas D. Textproposals: a text-specific selective search algorithm for word spotting in the wild[J]. Pattern Recognition, 2017, 70: 60-74.[code]
- Bluche T, Louradour J, Messina R. Scan, attend and read: End-to-end handwritten paragraph recognition with mdlstm attention[J]. arXiv preprint arXiv:1604.03286, 2016.
- Zheng Zhang, Chengquan Zhang, Wei Shen, Cong Yao, Wenyu Liu, Xiang Bai. Multi-Oriented Text Detection with Fully Convolutional Networks. arXiv preprint arXiv:1604.04018, 2016.
- Xie Z, Sun Z, Jin L, et al. Fully convolutional recurrent network for handwritten Chinese text recognition[C]//Pattern Recognition (ICPR), 2016 23rd International Conference on. IEEE, 2016: 4011-4016.
- Shangxuan Tian, Yifeng Pan, Chang Huang, Shijian Lu, Kai Yu, Chew Lim Tan. Text Flow: A Unified Text Detection System in Natural Scene Images. arXiv preprint arXiv:1604.06877, 2016.
- Zhong Z, Jin L, Zhang S, et al. Deeptext: A unified framework for text proposal generation and text detection in natural images[J]. arXiv preprint arXiv:1605.07314, 2016.
- Zhang X Y, Yin F, Zhang Y M, et al. Drawing and recognizing chinese characters with recurrent neural network[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017.
- Yao C, Bai X, Sang N, et al. Scene text detection via holistic, multi-channel prediction[J]. arXiv preprint arXiv:1606.09002, 2016.
- Hassanien A M A. Sequence to sequence learning for unconstrained scene text recognition[J]. arXiv preprint arXiv:1607.06125, 2016.
- Nitigya Sambyal, Pawanesh Abrol. Automatic text extraction and character segmentation using maximally stable extremal regions. arXiv preprint arXiv:1608.03374, 2016.
- 【Synthetic data】 Krishnan P, Jawahar C V. Generating Synthetic Data for Text Recognition[J]. arXiv preprint arXiv:1608.04224, 2016.
- 【CTPN】Tian Z, Huang W, He T, et al. Detecting text in natural image with connectionist text proposal network[C]//European Conference on Computer Vision. Springer International Publishing, 2016: 56-72.
code:[code;cuda8-caffe;offical;ocr_detection_ctpn;keras_ocr]
dataset:[ICDAR 2011; ICDAR 2013; ICDAR 2015; SWT; Multilingual dataset] - Xie Z, Sun Z, Jin L, et al. Learning spatial-semantic context with fully convolutional recurrent network for online handwritten chinese text recognition[J]. IEEE transactions on pattern analysis and machine intelligence, 2017.
- Hu B, Liu X, Wu X, et al. Stroke Sequence-Dependent Deep Convolutional Neural Network for Online Handwritten Chinese Character Recognition[J]. arXiv preprint arXiv:1610.04057, 2016.
- 【Dataset】Ahmed Ibrahim, A. Lynn Abbott, Mohamed E. Hussein. An Image Dataset of Text Patches in Everyday Scenes. arXiv preprint arXiv:1610.06494, 2016.
- Lou X, Kansky K, Lehrach W, et al. Generative Shape Models: Joint Text Recognition and Segmentation with Very Little Training Data[C]//Advances in Neural Information Processing Systems. 2016: 2793-2801.
- Xu Y, Shan S, Qiu Z, et al. End-to-End Subtitle Detection and Recognition for Videos in East Asian Languages via CNN Ensemble with Near-Human-Level Performance[J]. arXiv preprint arXiv:1611.06159, 2016.
- Chengzhe Yan, Jie Hu, Changshui Zhang. A DNN Framework For Text Image Rectification From Planar Transformations. arXiv preprint arXiv:1611.04298, 2016.
- Minghui Liao, Baoguang Shi, Xiang Bai, Xinggang Wang, Wenyu Liu. TextBoxes: A Fast Text Detector with a Single Deep Neural Network. arXiv preprint arXiv:1611.06779, 2016.
- Jie Mei, Aminul Islam, Yajing Wu, Abidalrahman Moh'd, Evangelos E. Milios. Statistical Learning for OCR Text Correction. arXiv preprint arXiv:1611.06950, 2016.
- Yang X, He D, Huang W, et al. Smart Library: Identifying Books in a Library using Richly Supervised Deep Scene Text Reading[J]. arXiv preprint arXiv:1611.07385, 2016.
- Junnan Yu, Xuna Ma, Ting Han. Usability Investigation on the Localization of Text CAPTCHAs: Take Chinese Characters as a Case Study. arXiv preprint arXiv:1612.01070, 2016.
- Singh Vijendra, Nisha Vasudeva, Hem Jyotsana Parashar. Recognition of Text Image Using Multilayer Perceptron. arXiv preprint arXiv:1612.00625, 2016.
- Zichuan Liu, Yixing Li, Fengbo Ren, Hao Yu. A Binary Convolutional Encoder-decoder Network for Real-time Natural Scene Text Processing. arXiv preprint arXiv:1612.03630, 2016.
- Kil T, Seo W, Koo H I, et al. Robust Document Image Dewarping Method Using Text-Lines and Line Segments[C]//2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR). IEEE, 2017, 1: 865-870.
[code:xellows1305/Document-Image-Dewarping] - Raj D, SAHU S, Anand A. Learning local and global contexts using a convolutional recurrent network model for relation classification in biomedical text[C]//Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017). 2017: 311-321.
code:[code] - Florian Fink, Klaus-U. Schulz, Uwe Springmann. Profiling of OCR'ed Historical Texts Revisited. arXiv preprint arXiv:1701.05377, 2017.
- Cheang T K, Chong Y S, Tay Y H. Segmentation-free Vehicle License Plate Recognition using ConvNet-RNN[J]. arXiv preprint arXiv:1701.06439, 2017.
- Shahin A A. Printed Arabic Text Recognition using Linear and Nonlinear Regression[J]. arXiv preprint arXiv:1702.01444, 2017.
- 【Dataset】Smith R, Gu C, Lee D S, et al. End-to-end interpretation of the french street name signs dataset[C]//European Conference on Computer Vision. Springer International Publishing, 2016: 411-426.
code:[code] - Bazazian D, Gomez R, Nicolaou A, et al. Improving Text Proposals for Scene Images with Fully Convolutional Networks[J]. arXiv preprint arXiv:1702.05089, 2017.
- 【synthetic Captcha】Le T A, Baydin A G, Zinkov R, et al. Using Synthetic Data to Train Neural Networks is Model-Based Reasoning[J]. arXiv preprint arXiv:1703.00868, 2017.
- Jianqi Ma, Weiyuan Shao, Hao Ye, Li Wang, Hong Wang, Yingbin Zheng, Xiangyang Xue. Arbitrary-Oriented Scene Text Detection via Rotation Proposals. arXiv preprint arXiv:1703.01086, 2017.
- Liu Y, Jin L. Deep matching prior network: Toward tighter multi-oriented text detection[J]. arXiv preprint arXiv:1703.01425, 2017.
- Shi B, Bai X, Belongie S. Detecting Oriented Text in Natural Images by Linking Segments[J]. arXiv preprint arXiv:1703.06520, 2017.
code:[code] - Masood S Z, Shu G, Dehghan A, et al. License Plate Detection and Recognition Using Deeply Learned Convolutional Neural Networks[J]. arXiv preprint arXiv:1703.07330, 2017.
- Liao M, Shi B, Bai X, et al. TextBoxes: A Fast Text Detector with a Single Deep Neural Network[C]//AAAI. 2017: 4161-4167.
code:[code;code] - He W, Zhang X Y, Yin F, et al. Deep Direct Regression for Multi-Oriented Scene Text Detection[J]. arXiv preprint arXiv:1703.08289, 2017.
- Ma J, Shao W, Ye H, et al. Arbitrary-Oriented Scene Text Detection via Rotation Proposals[J]. arXiv preprint arXiv:1703.01086, 2017.
- Qin S, Manduchi R. Cascaded Segmentation-Detection Networks for Word-Level Text Spotting[J]. arXiv preprint arXiv:1704.00834, 2017.
- Zhou X, Yao C, Wen H, et al. EAST: An Efficient and Accurate Scene Text Detector[J]. arXiv preprint arXiv:1704.03155, 2017.
code:[code] - Wojna Z, Gorban A, Lee D S, et al. Attention-based Extraction of Structured Information from Street View Imagery[J]. arXiv preprint arXiv:1704.03549, 2017.
: code:[offical;similar] - Moysset B, Kermorvant C, Wolf C. Full-Page Text Recognition: Learning Where to Start and When to Stop[J]. arXiv preprint arXiv:1704.08628, 2017.
- Nakamura T, Zhu A, Yanai K, et al. Scene Text Eraser[J]. arXiv preprint arXiv:1705.02772, 2017.
- Xiao X, Yang Y, Ahmad T, et al. Design of a Very Compact CNN Classifier for Online Handwritten Chinese Character Recognition Using DropWeight and Global Pooling[J]. arXiv preprint arXiv:1705.05207, 2017.
- Polzounov A, Ablavatski A, Escalera S, et al. WordFence: Text Detection in Natural Images with Border Awareness[J]. arXiv preprint arXiv:1705.05483, 2017.
- Ghosh S K, Valveny E, Bagdanov A D. Visual attention models for scene text recognition[J]. arXiv preprint arXiv:1706.01487, 2017.
- Lyu P, Bai X, Yao C, et al. Auto-Encoder Guided GAN for Chinese Calligraphy Synthesis[J]. arXiv preprint arXiv:1706.04041, 2017.
- Shervin Minaee, Yao Wang. Text Extraction From Texture Images Using Masked Signal Decomposition. arXiv preprint arXiv:1706.08789, 2017.
- Jiang Y, Zhu X, Wang X, et al. R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection[J]. arXiv preprint arXiv:1706.09579, 2017.
- Ghosh S, Valveny E. R-PHOC: Segmentation-Free Word Spotting using CNN[J]. arXiv preprint arXiv:1707.01294, 2017.
- Wang X, You M, Shen C. Adversarial generation of training examples for vehicle license plate recognition[J]. arXiv preprint arXiv:1707.03124, 2017.
- Li H, Wang P, Shen C. Towards End-to-end Text Spotting with Convolutional Recurrent Neural Networks[J]. arXiv preprint arXiv:1707.03985, 2017.
- Aneeshan Sain, Ayan Kumar Bhunia, Partha Pratim Roy, Umapada Pal. Multi-Oriented Text Detection and Verification in Video Frames and Scene Images. arXiv preprint arXiv:1707.07150, 2017.
- Bhunia A K, Kumar G, Roy P P, et al. Text recognition in scene image and video frame using Color Channel selection[J]. Multimedia Tools and Applications, 2017: 1-28.
- Partha Pratim Roy, Ayan Kumar Bhunia, Umapada Pal. Date-Field Retrieval in Scene Image and Video Frames using Text Enhancement and Shape Coding. arXiv preprint arXiv:1707.06833, 2017.
- Bartz C, Yang H, Meinel C. STN-OCR: A single Neural Network for Text Detection and Text Recognition[J]. arXiv preprint arXiv:1707.08831, 2017.
code:[code] - Jiang F, Hao Z, Liu X. Deep Scene Text Detection with Connected Component Proposals[J]. arXiv preprint arXiv:1708.05133, 2017.
- Amarnath R, P. Nagabhushan. Spotting Separator Points at Line Terminals in Compressed Document Images for Text-line Segmentation. arXiv preprint arXiv:1708.05545, 2017.
- P. Shivakumara, D. S. Guru, H.T. Basavaraju. Color and Gradient Features for Text Segmentation from Video Frames. arXiv preprint arXiv:1708.06561, 2017.
- Hu H, Zhang C, Luo Y, et al. Wordsup: Exploiting word annotations for character based text detection[C]//Proceedings of the IEEE International Conference on Computer Vision. 2017.
- He P, Huang W, He T, et al. Single shot text detector with regional attention[C]//The IEEE International Conference on Computer Vision (ICCV). 2017.
code:[code;code] - Yin F, Wu Y C, Zhang X Y, et al. Scene Text Recognition with Sliding Convolutional Character Models[J]. arXiv preprint arXiv:1709.01727, 2017.
- Ekta Vats, Anders Hast. On-the-fly Historical Handwritten Text Annotation. arXiv preprint arXiv:1709.01775, 2017.
- Cheng Z, Bai F, Xu Y, et al. Focusing Attention: Towards Accurate Text Recognition in Natural Images[C]//2017 IEEE International Conference on Computer Vision (ICCV). IEEE, 2017: 5086-5094.
- Dai Y, Huang Z, Gao Y, et al. Fused Text Segmentation Networks for Multi-oriented Scene Text Detection[J]. arXiv preprint arXiv:1709.03272, 2017.
- Teresa Nicole Brooks. Exploring Geometric Property Thresholds For Filtering Non-Text Regions In A Connected Component Based Text Detection Application. arXiv preprint arXiv:1709.03548, 2017.
- Yunze Gao, Yingying Chen, Jinqiao Wang, Hanqing Lu .Reading Scene Text with Attention Convolutional Sequence Modeling. arXiv preprint arXiv:1709.04303, 2017.
- Li H, Wang P, Shen C. Towards End-to-End Car License Plates Detection and Recognition with Deep Neural Networks[J]. arXiv preprint arXiv:1709.08828, 2017.
- Kazem Qazanfari, Saeed Shiri. Real time text localization for Indoor Mobile Robot Navigation. arXiv preprint arXiv:1709.09634, 2017.
- Zhan H, Wang Q, Lu Y. Handwritten digit string recognition by combination of residual network and RNN-CTC[C]//International Conference on Neural Information Processing. Springer, Cham, 2017: 583-591.
- Yang C, Yin X C, Li Z, et al. AdaDNNs: Adaptive Ensemble of Deep Neural Networks for Scene Text Recognition[J]. arXiv preprint arXiv:1710.03425, 2017.
- Tian S, Lu S, Li C. WeText: Scene Text Detection under Weak Supervision[J]. arXiv preprint arXiv:1710.04826, 2017.
- 【Dataset】Kheng Chng C, Chan C S. Total-Text: A Comprehensive Dataset for Scene Text Detection and Recognition[J]. arXiv preprint arXiv:1710.10400, 2017.
- Jain M, Mathew M, Jawahar C V. Unconstrained scene text and video text recognition for Arabic script[C]//Arabic Script Analysis and Recognition (ASAR), 2017 1st International Workshop on. IEEE, 2017: 26-30.
- Ren H, Wang W. A New Hybrid-parameter Recurrent Neural Networks for Online Handwritten Chinese Character Recognition[J]. arXiv preprint arXiv:1711.02809, 2017.
- Zhu X, Jiang Y, Yang S, et al. Deep Residual Text Detection Network for Scene Text[J]. arXiv preprint arXiv:1711.04147, 2017.
- Cheng Z, Liu X, Bai F, et al. Arbitrarily-Oriented Text Recognition[J]. arXiv preprint arXiv:1711.04226, 2017.
- Zhang S, Liu Y, Jin L, et al. Feature Enhancement Network: A Refined Scene Text Detector[J]. arXiv preprint arXiv:1711.04249, 2017.
- Xing D, Li Z, Chen X, et al. ArbiText: Arbitrary-Oriented Text Detection in Unconstrained Scene[J]. arXiv preprint arXiv:1711.11249, 2017.
- 【Dataset】Yuliang L, Lianwen J, Shuaitao Z, et al. Detecting Curve Text in the Wild: New Dataset and New Solution[J]. arXiv preprint arXiv:1712.02170, 2017.
code:[code] - Jason Poulos, Rafael Valle. Attention networks for image-to-text. arXiv preprint arXiv:1712.04046, 2017.
- Aarushi Agrawal, Prerana Mukherjee, Siddharth Srivastava, Brejesh Lall. Enhanced Characterness for Text Detection in the Wild. arXiv preprint arXiv:1712.04927, 2017.
- Bartz C, Yang H, Meinel C. SEE: Towards Semi-Supervised End-to-End Scene Text Recognition[J]. arXiv preprint arXiv:1712.05404, 2017.
- Kang C, Kim G, Yoo S I. Detection and Recognition of Text Embedded in Online Images via Neural Context Models[C]//AAAI. 2017: 4103-4110.
code:[code] - Busta M, Neumann L, Matas J. Deep TextSpotter: An End-to-End Trainable Scene Text Localization and Recognition Framework[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017: 2204-2212.[code]
- Wu Y, Natarajan P. Self-organized Text Detection with Minimal Post-processing via Border Learning[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017: 5000-5009.
- Rong X, Yi C, Tian Y. Unambiguous text localization and retrieval for cluttered scenes[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2017: 3279-3287.
- Deng D, Liu H, Li X, et al. PixelLink: Detecting Scene Text via Instance Segmentation[J]. arXiv preprint arXiv:1801.01315, 2018.
- Agnese Chiatti, Mu Jung Cho, Anupriya Gagneja, Xiao Yang, Miriam Brinberg, Katie Roehrick, Sagnik Ray Choudhury, Nilam Ram, Byron Reeves, C. Lee Giles. Text Extraction and Retrieval from Smartphone Screenshots: Building a Repository for Life in Media. arXiv preprint arXiv:1801.01316, 2018.
- Liu X, Liang D, Yan S, et al. FOTS: Fast Oriented Text Spotting with a Unified Network[J]. arXiv preprint arXiv:1801.01671, 2018.
- Liao M, Shi B, Bai X. TextBoxes++: A Single-Shot Oriented Scene Text Detector[J]. arXiv preprint arXiv:1801.02765, 2018.
- Anders Hast, Per Cullhed, Ekta Vats. TexT - Text Extractor Tool for Handwritten Document Transcription and Annotation. arXiv preprint arXiv:1801.05367, 2018.
- Yash Patel, Michal Bušta, Jiri Matas. E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text. arXiv preprint arXiv:1801.09919, 2018.
- Yixing Zhu, Jun Du. Sliding Line Point Regression for Shape Robust Scene Text Detection. arXiv preprint arXiv:1801.09969, 2018.
- Tobias Grüning, Gundram Leifert, Tobias Strauß, Roger Labahn. A Two-Stage Method for Text Line Detection in Historical Documents. arXiv preprint arXiv:1802.03345, 2018.
- Congzheng Song, Vitaly Shmatikov. Fooling OCR Systems with Adversarial Text Images. arXiv preprint arXiv:1802.05385, 2018.
- Pengyuan Lyu, Cong Yao, Wenhao Wu, Shuicheng Yan, Xiang Bai. Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation. arXiv preprint arXiv:1802.08948, 2018.
- Tai-Ling Yuan, Zhe Zhu, Kun Xu, Cheng-Jun Li, Shi-Min Hu. Chinese Text in the Wild. arXiv preprint arXiv:1803.00085, 2018.
- Liao M, Zhu Z, Shi B, et al. Rotation-Sensitive Regression for Oriented Scene Text Detection. [C]arXiv preprint arXiv:1803.05265, 2018.
- Carbonell M, Villegas M, Fornés A, et al. Joint Recognition of Handwritten Text and Named Entities with a Neural End-to-end Model[J]. arXiv preprint arXiv:1803.06252, 2018.
- Goswami T, Barad Z, Desai P, et al. Text Detection and Recognition in images: A survey[J]. arXiv preprint arXiv:1803.07278, 2018.
- José Carlos Aradillas, Juan José Murillo-Fuentes, Pablo M. Olmos. Boosting Handwriting Text Recognition in Small Databases with Transfer Learning[J]. arXiv preprint arXiv: 1803.01527, 2018.
- Linjie Deng, Yanxiang Gong, Yi Lin, Jingwen Shuai, Xiaoguang Tu, Yufei Zhang, Zheng Ma, Mei Xie. Detecting Multi-Oriented Text with Corner-based Region Proposals[J]. arXiv preprint arXiv:1804.02690, 2018.
- Partha Pratim Roy, Akash Mohta, Bidyut B. Chaudhuri. Synthetic data generation for Indic handwritten text recognition[J]. arXiv preprint arXiv:1804.06254, 2018.
- Dafang He, Yeqing Li, Alexander Gorban, Derrall Heath, Julian Ibarz, Qian Yu, Daniel Kifer, C. Lee Giles. Guided Attention for Large Scale Scene Text Verification[J]. arXiv preprint arXiv:1804.08588, 2018.
- Zhuoyao Zhong, Lei Sun, Qiang Huo. An Anchor-Free Region Proposal Network for Faster R-CNN based Text Detection Approaches[J]. arXiv preprint arXiv:1804.09003, 2018.
- 【alibaba】Qiangpeng Yang, Mengli Cheng, Wenmeng Zhou, Yan Chen, Minghui Qiu, Wei Lin, Wei Chu. IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection[J]. arXiv preprint arXiv:1805.01167, 2018.
- Francisco Cruz, Oriol Ramos Terrades. A probabilistic framework for handwritten text line segmentation[J]. arXiv preprint arXiv:1805.02536, 2018.
- Fan Bai, Zhanzhan Cheng, Yi Niu, Shiliang Pu, Shuigeng Zhou. Edit Probability for Scene Text Recognition[J]. arXiv preprint arXiv:1805.03384, 2018.
- Xiaoyu Yue, Zhanghui Kuang, Zhaoyang Zhang, Zhenfang Chen, Pan He, Yu Qiao, Wei Zhang. Boosting up Scene Text Detectors with Guided CNN[J]. arXiv preprint arXiv:1805.04132, 2018.
- Zichuan Liu, Guosheng Lin, Sheng Yang, Jiashi Feng, Weisi Lin, Wang Ling Goh. Learning Markov Clustering Networks for Scene Text Detection[J]. arXiv preprint arXiv:1805.08365, 2018.
- Yi-Chao Wu, Fei Yin, Xu-Yao Zhang, Li Liu, Cheng-Lin Liu. SCAN: Sliding Convolutional Attention Network for Scene Text Recognition[J]. arXiv preprint arXiv:1806.00578, 2018.
- Fenfen Sheng, Zhineng Chen, Bo Xu. NRTR: A No-Recurrence Sequence-to-Sequence Model For Scene Text Recognition[J]. arXiv preprint arXiv:1806.00926, 2018.
- 【PSENet】Xiang Li, Wenhai Wang, Wenbo Hou, Ruo-Ze Liu, Tong Lu, Jian Yang. Shape Robust Text Detection with Progressive Scale Expansion Network[J]. arXiv preprint arXiv:1806.02559, 2018.
- Sauradip Nag, Pallab Kumar Ganguly, Sumit Roy, Sourab Jha, Krishna Bose, Abhishek Jha, Kousik Dasgupta. Offline Extraction of Indic Regional Language from Natural Scene Image using Text Segmentation and Deep Convolutional Sequence[J]. arXiv preprint arXiv:1806.06208, 2018.
- Arka Ujjal dey, Suman K. Ghosh, Ernest Valveny. Don't only Feel Read: Using Scene text to understand advertisements[J]. arXiv preprint arXiv:1806.08279, 2018.
- Shangbang Long, Jiaqiang Ruan, Wenjie Zhang, Xin He, Wenhao Wu, Cong Yao. TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes[J]. arXiv preprint arXiv:1807.01544, 2018.
- Qi Yuan, Bingwang Zhang, Haojie Li, Zhihui Wang, Zhongxuan Luo. A Single Shot Text Detector with Scale-adaptive Anchors[J]. arXiv preprint arXiv:1807.01884, 2018.
- Pengyuan Lyu, Minghui Liao, Cong Yao, Wenhao Wu, Xiang Bai. Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes[J]. arXiv preprint arXiv:1807.02242, 2018.
- Fangneng Zhan, Shijian Lu, Chuhui Xue. Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in Scenes[J]. arXiv preprint arXiv:1807.03021, 2018.
- Xiaoyong Yuan, Pan He, Xiaolin Andy Li. Adaptive Adversarial Attack on Scene Text Recognition[J]. arXiv preprint arXiv:1807.03326, 2018.
- Chuhui Xue, Shijian Lu, Fangneng Zhan. Accurate Scene Text Detection through Border Semantics Awareness and Bootstrapping[J]. arXiv preprint arXiv:1807.03547, 2018.
- Arindam Chowdhury, Lovekesh Vig. An Efficient End-to-End Neural Model for Handwritten Text Recognition[J]. arXiv preprint arXiv:1807.07965, 2018.
- Yuting Gao, Zheng Huang, Yuchen Dai. Double Supervised Network with Attention Mechanism for Scene Text Recognition[J]. arXiv preprint arXiv:1808.00677, 2018.
- Wenchao Wang, Jun Du, Zi-Rui Wang. Parsimonious HMMs for Offline Handwritten Chinese Text Recognition[J]. arXiv preprint arXiv:1808.04138, 2018.
- Lluís Gómez, Andrés Mafla, Marçal Rusiñol, DimosthenisKaratzas. Single Shot Scene Text Retrieval[J]. arXiv preprint arXiv:1808.09044, 2018.
- Dafang He, Xiao Yang, Daniel Kifer, C.Lee Giles .TextContourNet: a Flexible and Effective Framework for Improving Scene Text Detection Architecture with a Multi-task Cascade .[J] arXiv preprint arXiv:1809.03050.
- Minghui Liao, Jian Zhang, Zhaoyi Wan, Fengming Xie, Jiajun Liang, Pengyuan Lyu, Cong Yao, Xiang Bai .Scene Text Recognition from Two-Dimensional Perspective .[J] arXiv preprint arXiv:1809.06508.
- Mayank Gupta, Abhinav Kumar, Sriganesh Madhvanath .Parametric Synthesis of Text on Stylized Backgrounds using PGGANs .[J] arXiv preprint arXiv:1809.08488.
- Saad Bin Ahmed, Saeeda Naz, Muhammad Imran Razzak, Rubiyah Yusof .Cursive Scene Text Analysis by Deep Convolutional Linear Pyramids .[J] arXiv preprint arXiv:1809.10792.
- Zichuan Liu, Guosheng Lin, Wang Ling Goh, Fayao Liu, Chunhua Shen, Xiaokang Yang .Correlation Propagation Networks for Scene Text Detection .[J] arXiv preprint arXiv:1810.00304.
- Ahmed Sabir, Francesc Moreno-Noguer, Lluís Padró .Visual Semantic Re-ranker for Text Spotting .[J] arXiv preprint arXiv:1810.09776.
- Ahmed Sabir, Francesc Moreno-Noguer, Lluís Padró .Visual Re-ranking with Natural Language Understanding for Text Spotting .[J] arXiv preprint arXiv:1810.12738.
- Hui Li, Peng Wang, Chunhua Shen, Guyu Zhang .Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition .[J] arXiv preprint arXiv:1811.00751.
- Shangbang Long, Xin He, Cong Ya .Scene Text Detection and Recognition: The Deep Learning Era .[J] arXiv preprint arXiv:1811.04256.
- Jing Huang, Viswanath Sivakumar, Mher Mnatsakanyan, Guan Pang .Improving Rotated Text Detection with Rotation Region Proposal Networks .[J] arXiv preprint arXiv:1811.07031.
- Yuan Li, Yuanjie Yu, Zefeng Li, Yangkun Lin, Meifang Xu, Jiwei Li, Xi Zhou .Pixel-Anchor: A Fast Oriented Scene Text Detector with Combined Networks .[J] arXiv preprint arXiv:1811.07432.
- Wanchen Sui, Qing Zhang, Jun Yang, Wei Chu .A Novel Integrated Framework for Learning both Text Detection and Recognition .[J] arXiv preprint arXiv:1811.08611.
- Zhida Huang, Zhuoyao Zhong, Lei Sun, Qiang Huo .Mask R-CNN with Pyramid Attention Network for Scene Text Detection .[J] arXiv preprint arXiv:1811.09058.
- Dinh NguyenVan, Shijian Lu, Shangxuan Tian, Nizar Ouarti, Mounir Mokhtari .A pooling based scene text proposal technique for scene text reading in the wild .[J] arXiv preprint arXiv:1811.10003.
- Hanh T. M. Tran, Tien Ho-Phuoc .Deep Laplacian Pyramid Network for Text Images Super-Resolution .[J] arXiv preprint arXiv:1811.10449.
- Yixing Zhu, Jun Du .TextMountain: Accurate Scene Text Detection via Instance Segmentation .[J] arXiv preprint arXiv:1811.12786.
- Shuaitao Zhang, Yuliang Liu, Lianwen Jin, Yaoxiong Huang, Songxuan Lai .EnsNet: Ensconce Text in the Wild .[J] arXiv preprint arXiv:1812.00723.
- Yongchao Xu, Yukang Wang, Wei Zhou, Yongpan Wang, Zhibo Yang, Xiang Bai .TextField: Learning A Deep Direction Field for Irregular Scene Text Detection .[J] arXiv preprint arXiv:1812.01393.
- Najoua Rahal, Maroua Tounsi, Adel M. Alimi .Auto-Encoder-BoF/HMM System for Arabic Text Recognition .[J] arXiv preprint arXiv:1812.03680.
- 【Dataset】Masakazu Iwamura .Advances of Scene Text Datasets .[J] arXiv preprint arXiv:1812.05219.
- Fangneng Zhan, Shijian Lu .ESIR: End-to-end Scene Text Recognition via Iterative Image Rectification .[J] arXiv preprint arXiv:1812.05824.
- Shuai Yang, Jiaying Liu, Wenjing Wang, Zongming Guo .TET-GAN: Text Effects Transfer via Stylization and Destylization .[J] arXiv preprint arXiv:1812.06384.
- Chankyu Choi, Youngmin Yoon, Junsu Lee, Junseok Kim .Simultaneous Recognition of Horizontal and Vertical Text in Natural Images .[J] arXiv preprint arXiv:1812.07059.
- Yunze Gao, Yingying Chen, Jinqiao Wang, Zhen Lei, Xiao-Yu Zhang, Hanqing Lu .Recurrent Calibration Network for Irregular Text Recognition .[J] arXiv preprint arXiv:1812.07145.
- Zi-Rui Wang, Jun Du, Jia-Ming Wang .Writer-Aware CNN for Parsimonious HMM-Based Offline Handwritten Chinese Text Recognition .[J] arXiv preprint arXiv:1812.09809.
- Yipeng Sun, Chengquan Zhang, Zuming Huang, Jiaming Liu, Junyu Han, Errui Ding .TextNet: Irregular Text Reading from Images with an End-to-End Trainable Network .[J] arXiv preprint arXiv:1812.09900.
- Mohamed Yousef, Khaled F. Hussain, Usama S. Mohammed .Accurate, Data-Efficient, Unconstrained Text Recognition with Convolutional Neural Networks .[J] arXiv preprint arXiv:1812.11894.
- Jiaming Liu, Chengquan Zhang, Yipeng Sun, Junyu Han, Errui Ding .Detecting Text in the Wild with Deep Character Embedding Network .[J] arXiv preprint arXiv:1901.00363.
- Chuhui Xue, Shijian Lu, Wei Zhang .MSR: Multi-Scale Shape Regression for Scene Text Detection .[J] arXiv preprint arXiv:1901.02596.
- 【MORAN】Canjie Luo, Lianwen Jin, Zenghui Sun .A Multi-Object Rectified Attention Network for Scene Text Recognition .[J] arXiv preprint arXiv:1901.03003.
[code: Canjie-Luo/MORAN_v2] - Wei Liu, Chaofeng Chen, Kwan-Yee K. Wong .SAFE: Scale Aware Feature Encoder for Scene Text Recognition .[J] arXiv preprint arXiv:1901.05770.
- Yanxiang Gong, Linjie Deng, Zheng Ma, Mei Xie .Generating Text Sequence Images for Recognition .[J] arXiv preprint arXiv:1901.06782.
- Fangneng Zhan, Hongyuan Zhu, Shijian Lu .Scene Text Synthesis for Efficient and Effective Deep Network Training .[J] arXiv preprint arXiv:1901.09193.
- Amarnath R, P Nagabhushan .Text line Segmentation in Compressed Representation of Handwritten Document using Tunneling Algorithm .[J] arXiv preprint arXiv:1901.11477.
- Eloi Alonso, Bastien Moysset, Ronaldo Messina .Adversarial Generation of Handwritten Text Images Conditioned on Sequences .[J] arXiv preprint arXiv:1903.00277.
- Prasun Roy, Saumik Bhattacharya, Subhankar Ghosh, Umapada Pal .STEFANN: Scene Text Editor using Font Adaptive Neural Network .[J] arXiv preprint arXiv:1903.01192.
- Zhanzhan Cheng, Jing Lu, Jianwen Xie, Yi Niu, Shiliang Pu, Fei Wu .Efficient Video Scene Text Spotting: Unifying Detection, Tracking, and Recognition .[J] arXiv preprint arXiv:1903.03299.
- Bastien Moysset, Ronaldo Messina .Manifold Mixup improves text recognition with CTC loss .[J] arXiv preprint arXiv:1903.04246.
- Johannes Michael, Roger Labahn, Tobias Grüning, Jochen Zöllner .Evaluating Sequence-to-Sequence Models for Handwritten Text Recognition .[J] arXiv preprint arXiv:1903.07377.
- Zichuan Liu, Guosheng Lin, Sheng Yang, Fayao Liu, Weisi Lin, Wang Ling Goh .Towards Robust Curve Text Detection with Conditional Spatial Expansion .[J] arXiv preprint arXiv:1903.08836.
- Zhao Zhou, Shufan Wu, Shuchen Kong, Yingbin Zheng, Hao Ye, Luhui Chen, Jian Pu .Curve Text Detection with Local Segmentation Network and Curve Connection .[J] arXiv preprint arXiv:1903.09837.
- 【Dataset】Chongsheng Zhang, Guowen Peng, Yuefeng Tao, Feifei Fu, Wei Jiang, George Almpanidis, Ke Chen .ShopSign: a Diverse Scene Text Dataset of Chinese Shop Signs in Street Views .[J] arXiv preprint arXiv:1903.10412.
- Jingchao Liu, Xuebo Liu, Jie Sheng, Ding Liang, Xin Li, Qingjie Liu .Pyramid Mask Text Detector .[J] arXiv preprint arXiv:1903.11800.
- Xiaohui Zhao, Zhuo Wu, Xiaoguang Wang .CUTIE: Learning to Understand Documents with Convolutional Universal Text Information Extractor .[J] arXiv preprint arXiv:1903.12363.
- Wenhai Wang, Enze Xie, Xiang Li, Wenbo Hou, Tong Lu, Gang Yu, Shuai Shao .Shape Robust Text Detection with Progressive Scale Expansion Network .[J] arXiv preprint arXiv:1903.12473.
- Yuliang Liu, Lianwen Jin, Zecheng Xie, Canjie Luo, Shuaitao Zhang, Lele Xie .Tightness-aware Evaluation Protocol for Scene Text Detection .[J] arXiv preprint arXiv:1904.00813.
- 【Dataset】Simone Bonechi, Paolo Andreini, Monica Bianchini, Franco Scarselli .COCO_TS Dataset: Pixel-level Annotations Based on Weak Supervision for Scene Text Segmentation .[J] arXiv preprint arXiv:1904.00818.
- Peng Wang, Lu Yang, Hui Li, Yuyan Deng, Chunhua Shen, Yanning Zhang .A Simple and Robust Convolutional-Attention Network for Irregular Text Recognition .[J] arXiv preprint arXiv:1904.01375.
- Jeonghun Baek, Geewook Kim, Junyeop Lee, Sungrae Park, Dongyoon Han, Sangdoo Yun, Seong Joon Oh, Hwalsuk Lee .What is wrong with scene text recognition model comparisons? dataset and model analysis .[J] arXiv preprint arXiv:1904.01906.
- Youngmin Baek, Bado Lee, Dongyoon Han, Sangdoo Yun, Hwalsuk Lee .Character Region Awareness for Text Detection .[J] arXiv preprint arXiv:1904.01941.
- Chengquan Zhang, Borong Liang, Zuming Huang, Mengyi En, Junyu Han, Errui Ding, Xinghao Ding .Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes .[J] arXiv preprint arXiv:1904.06535.
- 【Dataset】Vinoj Jayasundara, Sandaru Jayasekara, Hirunima Jayasekara, Jathushan Rajasegaran, Suranga Seneviratne, Ranga Rodrigo .TextCaps : Handwritten Character Recognition with Very Small Datasets .[J] arXiv preprint arXiv:1904.08095.
- R. Reeve Ingle, Yasuhisa Fujii, Thomas Deselaers, Jonathan Baccash, Ashok C. Popat .A Scalable Handwritten Text Recognition System .[J] arXiv preprint arXiv:1904.09150.
- Qingqing Wang, Wenjing Jia, Xiangjian He, Yue Lu, Michael Blumenstein, Ye Huang .FACLSTM: ConvLSTM with Focused Attention for Scene Text Recognition .[J] arXiv preprint arXiv:1904.09405.
- Fady Medhat, Mahnaz Mohammadi, Sardar Jaf, Chris G. Willcocks, Toby P. Breckon, Peter Matthews, Andrew Stephen McGough, Georgios Theodoropoulos, Boguslaw Obara .TMIXT: A process flow for Transcribing MIXed handwritten and machine-printed Text .[J] arXiv preprint arXiv:1904.12387.
- Weijia Wu, Jici Xing, Hong Zhou .TextCohesion: Detecting Text for Arbitrary Shapes .[J] arXiv preprint arXiv:1904.12640.
- Shuai Yang, Zhangyang Wang, Zhaowen Wang, Ning Xu, Jiaying Liu, Zongming Guo .Controllable Artistic Text Style Transfer via Shape-Matching GAN [J]. arXiv preprint arXiv:1905.01354.
- Shuai Yang, Wenjing Wang, Jiaying Liu .TE141K: Artistic Text Benchmark for Text Effects Transfer [J]. arXiv preprint arXiv:1905.03646.
- Danlu Chen, Xu-Yao Zhang, Wei Zhang, Yao Lu, Xiuli Li, Tao Mei .Predictive Ensemble Learning with Application to Scene Text Detection [J]. arXiv preprint arXiv:1905.04641.
- Xiaobing Wang, Yingying Jiang, Zhenbo Luo, Cheng-Lin Liu, Hyunsoo Choi, Sungjin Kim .Arbitrary Shape Scene Text Detection with Adaptive Text Region Representation [J]. arXiv preprint arXiv:1905.05980.
- Arka Ujjal Dey, Suman Kumar Ghosh, Ernest Valveny .Beyond Visual Semantics: Exploring the Role of Scene Text in Image Understanding [J]. arXiv preprint arXiv:1905.10622.
- Ali Furkan Biten, Ruben Tito, Andres Mafla, Lluis Gomez, Marçal Rusiñol, Ernest Valveny, C.V. Jawahar, Dimosthenis Karatzas .Scene Text Visual Question Answering [J]. arXiv preprint arXiv:1905.13648.
- Raul Gomez, Ali Furkan Biten, Lluis Gomez, Jaume Gibert, Marçal Rusiñol, Dimosthenis Karatzas .Selective Style Transfer for Text [J]. arXiv preprint arXiv:1906.01466.
- 【Dataset】Hongyu Li, Fan Zhu, Junhua Qiu .Towards Document Image Quality Assessment: A Text Line Based Framework and A Synthetic Text Line Image Dataset [J]. arXiv preprint arXiv:1906.01907.
- Yuliang Liu, Sheng Zhang, Lianwen Jin, Lele Xie, Yaqiang Wu, Zhepeng Wang .Omnidirectional Scene Text Detection with Sequential-free Box Discretization [J]. arXiv preprint arXiv:1906.02371.
- Junho Jo, Hyung Il Koo, Jae Woong Soh, Nam Ik Cho .Handwritten Text Segmentation via End-to-End Learning of Convolutional Neural Network [J]. arXiv preprint arXiv:1906.05229.
- Pengyuan Lyu, Zhicheng Yang, Xinhang Leng, Xiaojun Wu, Ruiyu Li, Xiaoyong Shen .2D Attentional Irregular Scene Text Recognizer [J]. arXiv preprint arXiv:1906.05708.
- Hui Li, Peng Wang, Chunhua Shen .Towards End-to-End Text Spotting in Natural Scenes [J]. arXiv preprint arXiv:1906.06013.
- Michele Alberti, Lars Vögtlin, Vinaychandran Pondenkandath, Mathias Seuret, Rolf Ingold, Marcus Liwicki .Labeling, Cutting, Grouping: an Efficient Text Line Segmentation Method for Medieval Manuscripts [J]. arXiv preprint arXiv:1906.11894.
- Ali Furkan Biten, Rubèn Tito, Andres Mafla, Lluis Gomez, Marçal Rusiñol, Minesh Mathew, C.V. Jawahar, Ernest Valveny, Dimosthenis Karatzas .ICDAR 2019 Competition on Scene Text Visual Question Answering [J]. arXiv preprint arXiv:1907.00490.
- Toshiki Nakamura, Anna Zhu, Seiichi Uchida .Scene Text Magnifier [J]. arXiv preprint arXiv:1907.00693.
- Nibal Nayef, Yash Patel, Michal Busta, Pinaki Nath Chowdhury, Dimosthenis Karatzas, Wafa Khlif, Jiri Matas, Umapada Pal, Jean-Christophe Burie, Cheng-lin Liu, Jean-Marc Ogier .ICDAR2019 Robust Reading Challenge on Multi-lingual Scene Text Detection and Recognition -- RRC-MLT-2019 [J]. arXiv preprint arXiv:1907.00945.
- Chae Young Lee, Youngmin Baek, Hwalsuk Lee .TedEval: A Fair Evaluation Metric for Scene Text Detectors [J]. arXiv preprint arXiv:1907.01227.
- Pranay Dugar, Anirban Chatterjee, Rajesh Shreedhar Bhat, Saswata Sahoo .Semi-Bagging Based Deep Neural Architecture to Extract Text from High Entropy Images [J]. arXiv preprint arXiv:1907.01284.
- Christen M, AB Saravanan .RFBTD: RFB Text Detector [J]. arXiv preprint arXiv:1907.02228.
- Minghui Liao, Boyu Song, Minghang He, Shangbang Long, Cong Yao, Xiang Bai .SynthText3D: Synthesizing Scene Text Images from 3D Virtual Worlds [J]. arXiv preprint arXiv:1907.06007.
- Fangneng Zhan, Chuhui Xue, Shijian Lu .GA-DAN: Geometry-Aware Domain Adaptation Network for Scene Text Detection and Recognition [J]. arXiv preprint arXiv:1907.09653.
- Zhaoyi Wan, Fengming Xie, Yibo Liu, Xiang Bai, Cong Yao .2D-CTC for Scene Text Recognition [J]. arXiv preprint arXiv:1907.09705.
- Bo Ji, Tianyi Chen .Generative Adversarial Network for Handwritten Text [J]. arXiv preprint arXiv:1907.11845.
- Elad Richardson, Yaniv Azar, Or Avioz, Niv Geron, Tomer Ronen, Zach Avraham, Stav Shapiro .It's All About The Scale -- Efficient Text Detection Using Adaptive Scaling [J]. arXiv preprint arXiv:1907.12122.
- Bulla Rajesh, Mohammed Javed, P Nagabhushan .Automatic Text Line Segmentation Directly in JPEG Compressed Document Images [J]. arXiv preprint arXiv:1907.12219.
- Xu Zhenlong, Zhou shuigeng, Cheng zhanzhan, Bai fan, Niu yi, Pu shiliang .Towards Pure End-to-End Learning for Recognizing Multiple Text Sequences from an Image [J]. arXiv preprint arXiv:1907.12791.
- Yi Zheng, Qitong Wang, Margrit Betke .Deep Neural Network for Semantic-based Text Recognition in Images [J]. arXiv preprint arXiv:1908.01403.
- MingKun Yang, Yushuo Guan, Minghui Liao, Xin He, Kaigui Bian, Song Bai, Cong Yao, Xiang Bai .Symmetry-constrained Rectification Network for Scene Text Recognition [J]. arXiv preprint arXiv:1908.01957.
- Liang Wu, Chengquan Zhang, Jiaming Liu, Junyu Han, Jingtuo Liu, Errui Ding, Xiang Bai .Editing Text in the Wild [J]. arXiv preprint arXiv:1908.03047.
- Wenhai Wang, Enze Xie, Xiaoge Song, Yuhang Zang, Wenjia Wang, Tong Lu, Gang Yu, Chunhua Shen .Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network [J]. arXiv preprint arXiv:1908.05900.
- Hongyuan Yu, Chengquan Zhang, Xuan Li, Junyu Han, Errui Ding, Liang Wang .An End-to-end Video Text Detector with Online Tracking [J]. arXiv preprint arXiv:1908.07135.
- Minghui Liao, Pengyuan Lyu, Minghang He, Cong Yao, Wenhao Wu, Xiang Bai .Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes [J]. arXiv preprint arXiv:1908.08207.
- Alexander Filonenko, Konstantin Gudkov, Aleksei Lebedev, Nikita Orlov, Ivan Zagaynov .FaSTExt: Fast and Small Text Extractor [J]. arXiv preprint arXiv:1908.08994.
- Siyang Qin, Alessandro Bissacco, Michalis Raptis, Yasuhisa Fujii, Ying Xiao .Towards Unconstrained End-to-End Text Spotting [J]. arXiv preprint arXiv:1908.09231.
- Xiaoxue Chen, Tianwei Wang, Yuanzhi Zhu, Lianwen Jin, Canjie Luo .Adaptive Embedding Gate for Attention-Based Scene Text Recognition [J]. arXiv preprint arXiv:1908.09475.
- Gundram Leifert, Roger Labahn, Tobias Grüning, Svenja Leifert .End-To-End Measure for Text Recognition [J]. arXiv preprint arXiv:1908.09584.
- Xugong Qin, Yu Zhou, Dongbao Yang, Weiping Wang .Curved Text Detection in Natural Scene Images with Semi- and Weakly-Supervised Learning [J]. arXiv preprint arXiv:1908.09990.
- Yanxiang Gong, Linjie Deng, Xinchen Lu, Xin Yi, Zheng Ma, Mei Xie .Focus-Enhanced Scene Text Recognition with Deformable Convolutions [J]. arXiv preprint arXiv:1908.10998.
- Shangbang Long, Yushuo Guan, Bingxuan Wang, Kaigui Bian, Cong Yao .Alchemy: Techniques for Rectification Based Irregular Scene Text Recognition [J]. arXiv preprint arXiv:1908.11834.
- Youjiang Xu, Jiaqi Duan, Zhanghui Kuang, Xiaoyu Yue, Hongbin Sun, Yue Guan, Wayne Zhang .Geometry Normalization Networks for Accurate Scene Text Detection [J]. arXiv preprint arXiv:1909.00794.
- Wenjia Wang, Enze Xie, Peize Sun, Wenhai Wang, Lixun Tian, Chunhua Shen, Ping Luo .TextSR: Content-Aware Text Super-Resolution Guided by Recognition [J]. arXiv preprint arXiv:1909.07113.
- Chee-Kheng Chng, Yuliang Liu, Yipeng Sun, Chun Chet Ng, Canjie Luo, Zihan Ni, ChuanMing Fang, Shuaitao Zhang, Junyu Han, Errui Ding, Jingtuo Liu, Dimosthenis Karatzas, Chee Seng Chan, Lianwen Jin .ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text (RRC-ArT) [J]. arXiv preprint arXiv:1909.07145.
- Linjie Deng, Yanxiang Gong, Xinchen Lu, Yi Lin, Zheng Ma, Mei Xie .STELA: A Real-Time Scene Text Detector with Learned Anchor [J]. arXiv preprint arXiv:1909.07549.
- Yipeng Sun, Zihan Ni, Chee-Kheng Chng, Yuliang Liu, Canjie Luo, Chun Chet Ng, Junyu Han, Errui Ding, Jingtuo Liu, Dimosthenis Karatzas, Chee Seng Chan, Lianwen Jin .ICDAR 2019 Competition on Large-scale Street View Text with Partial Labeling -- RRC-LSVT [J]. arXiv preprint arXiv:1909.07741.
- Yipeng Sun, Jiaming Liu, Wei Liu, Junyu Han, Errui Ding, Jingtuo Liu .Chinese Street View Text: Large-scale Chinese Text Reading with Partially Supervised Learning [J]. arXiv preprint arXiv:1909.07808.
- Han Xu, Yao Ma, Haochen Liu, Debayan Deb, Hui Liu, Jiliang Tang, Anil K. Jain .Adversarial Attacks and Defenses in Images, Graphs and Text: A Review [J]. arXiv preprint arXiv:1909.08072.
- He guo, Xiameng Qin, Jiaming Liu, Junyu Han, Jingtuo Liu, Errui Ding .EATEN: Entity-aware Attention for Single Shot Visual Text Extraction [J]. arXiv preprint arXiv:1909.09380.
- Ning Lu, Wenwen Yu, Xianbiao Qi, Yihao Chen, Ping Gong, Rong Xiao .MASTER: Multi-Aspect Non-local Network for Scene Text Recognition [J]. arXiv preprint arXiv:1910.02562.
- Konstantin Bulatov, Boris Savelyev, Vladimir V. Arlazarov .Next integrated result modelling for stopping the text field recognition process in a video using a result model with per-character alternatives [J]. arXiv preprint arXiv:1910.04107.
- Junyeop Lee, Sungrae Park, Jeonghun Baek, Seong Joon Oh, Seonghyeon Kim, Hwalsuk Lee .On Recognizing Texts of Arbitrary Shapes with 2D Self-Attention [J]. arXiv preprint arXiv:1910.04396.
- Fedor Borisyuk, Albert Gordo, Viswanath Sivakumar .Rosetta: Large scale system for text detection and recognition in images [J]. arXiv preprint arXiv:1910.05085.
- Mostafa Karimi, Gopalkrishna Veni, Yen-Yun Yu .Illegible Text to Readable Text: An Image-to-Image Transformation using Conditional Sliced Wasserstein Adversarial Networks [J]. arXiv preprint arXiv:1910.05425.
- Hannes Fassold, Ridouane Ghermi .OmniTrack: Real-time detection and tracking of objects, text and logos in video [J]. arXiv preprint arXiv:1910.06017.
- W. Ronny Huang, Yike Qi, Qianqian Li, Jonathan Degange .DeepErase: Weakly Supervised Ink Artifact Removal in Document Text Images [J]. arXiv preprint arXiv:1910.07070.
- Xiangcheng Du, Tianlong Ma, Yingbin Zheng, Hao Ye, Xingjiao Wu, Liang He .Scene Text Recognition with Temporal Convolutional Encoder [J]. arXiv preprint arXiv:1911.01051.
- Duc Nguyen, Nhan Tran, Hung Le .Improving Long Handwritten Text Line Recognition with Convolutional Multi-way Associative Memory [J]. arXiv preprint arXiv:1911.01577.
- Qitong Wang, Yi Zheng, Margrit Betke .SA-Text: Simple but Accurate Detector for Text of Arbitrary Shapes [J]. arXiv preprint arXiv:1911.07046.
- XiaoQian Li, Jie Liu, ShuWu Zhang, GuiXuan Zhang .Learning to Predict More Accurate Text Instances for Scene Text Detection [J]. arXiv preprint arXiv:1911.07423.
- Christian Bartz, Joseph Bethge, Haojin Yang, Christoph Meinel .KISS: Keeping It Simple for Scene Text Recognition [J]. arXiv preprint arXiv:1911.08400.
- Minghui Liao, Zhaoyi Wan, Cong Yao, Kai Chen, Xiang Bai .Real-time Scene Text Detection with Differentiable Binarization [J]. arXiv preprint arXiv:1911.08947.
- Simone Bonechi, Paolo Andreini, Monica Bianchini, Franco Scarselli .Weak Supervision for Generating Pixel-Level Annotations in Scene Text Segmentation [J]. arXiv preprint arXiv:1911.09026.
- Hao Wang, Pu Lu, Hui Zhang, Mingkun Yang, Xiang Bai, Yongchao Xu, Mengchao He, Yongpan Wang, Wenyu Liu .All You Need Is Boundary: Toward Arbitrary-Shaped Text Spotting [J]. arXiv preprint arXiv:1911.09550.
- Olga Petrova, Konstantin Bulatov, Vladimir L. Arlazarov .Methods of Weighted Combination for Text Field Recognition in a Video Stream [J]. arXiv preprint arXiv:1911.12028.
- Maurits Bleeker, Maarten de Rijke .Bidirectional Scene Text Recognition with a Single Decoder [J]. arXiv preprint arXiv:1912.03656.
- Changxu Cheng, Qiuhui Huang, Xiang Bai, Bin Feng, Wenyu Liu .Patch Aggregator for Scene Text Script Identification [J]. arXiv preprint arXiv:1912.03818.
- Jinjin Zhang, Wei Wang, Di Huang, Qingjie Liu, Yunhong Wang .A Feasible Framework for Arbitrary-Shaped Scene Text Recognition [J]. arXiv preprint arXiv:1912.04561.
- Boying Li, Danping Zou, Daniele Sartori, Ling Pei, Wenxian Yu .TextSLAM: Visual SLAM with Planar Text Features [J]. arXiv preprint arXiv:1912.05002.
- Lambert Schomaker .Lifelong learning for text retrieval and recognition in historical handwritten document collections [J]. arXiv preprint arXiv:1912.05156.
- Zhao Zhang, Zemin Tang, Zheng Zhang, Yang Wang, Jie Qin, Meng Wang .Fully-Convolutional Intensive Feature Flow Neural Network for Text Recognition [J]. arXiv preprint arXiv:1912.06446.
- Zhao Zhang, Zemin Tang, Yang Wang, Zheng Zhang, Shuicheng Yan, Meng Wang .Fast DenseNet: Towards Efficient and Accurate Text Recognition with Fast Dense Networks [J]. arXiv preprint arXiv:1912.07016.
- Osman Tursun, Simon Denman, Rui Zeng, Sabesan Sivapalan, Sridha Sridharan, Clinton Fookes .MTRNet++: One-stage Mask-based Scene Text Eraser [J]. arXiv preprint arXiv:1912.07183.
- Zi-Rui Wang, Jun Du .Joint Architecture and Knowledge Distillation in Convolutional Neural Network for Offline Handwritten Chinese Text Recognition [J]. arXiv preprint arXiv:1912.07806.
- Joël Seytre, Jon Wu, Alessandro Achille .TextTubes for Detecting Curved Text in the Wild [J]. arXiv preprint arXiv:1912.08990.
- Yuliang Liu, Tong He, Hao Chen, Xinyu Wang, Canjie Luo, Shuaitao Zhang, Chunhua Shen, Lianwen Jin .Exploring the Capacity of Sequential-free Box Discretization Network for Omnidirectional Scene Text Detection [J]. arXiv preprint arXiv:1912.09629.
- Xi Liu, Rui Zhang, Yongsheng Zhou, Qianyi Jiang, Qi Song, Nan Li, Kai Zhou, Lei Wang, Dong Wang, Minghui Liao, Mingkun Yang, Xiang Bai, Baoguang Shi, Dimosthenis Karatzas, Shijian Lu, C. V. Jawahar .ICDAR 2019 Robust Reading Challenge on Reading Chinese Text on Signboard [J]. arXiv preprint arXiv:1912.09641.
- Manuel Carbonell, Alicia Fornés, Mauricio Villegas, Josep Lladós .TreyNet: A Neural Model for Text Localization, Transcription and Named Entity Recognition in Full Pages [J]. arXiv preprint arXiv:1912.10016.
- Tianwei Wang, Yuanzhi Zhu, Lianwen Jin, Canjie Luo, Xiaoxue Chen, Yaqiang Wu, Qianying Wang, Mingxiang Cai .Decoupled Attention Network for Text Recognition [J]. arXiv preprint arXiv:1912.10205.
- Zhaoyi Wan, Minghang He, Haoran Chen, Xiang Bai, Cong Yao .TextScanner: Reading Characters in Order for Robust Scene Text Recognition [J]. arXiv preprint arXiv:1912.12422.
- Pei Xu, Shan Huang, Hongzhen Wang, Hao Song, Shen Huang, Qi Ju .A Multi-oriented Chinese Keyword Spotter Guided by Text Line Detection [J]. arXiv preprint arXiv:2001.00722.
- Canjie Luo, Qingxiang Lin, Yuliang Liu, Lianwen Jin, Chunhua Shen .Separating Content from Style Using Adversarial Learning for Recognizing Text in the Wild [J]. arXiv preprint arXiv:2001.04189.
- Mayank Wadhwani, Debapriya Kundu, Deepayan Chakraborty, Bhabatosh Chanda .Text Extraction and Restoration of Old Handwritten Documents [J]. arXiv preprint arXiv:2001.08742.
- Zhao Zhang, Zemin Tang, Yang Wang, Jie Qin, Haijun Zhang, Shuicheng Yan .Fast Dense Residual Network: Enhancing Global Dense Feature Flow for Text Recognition [J]. arXiv preprint arXiv:2001.09021.
- Gang Wang .Scene Text Recognition With Finer Grid Rectification [J]. arXiv preprint arXiv:2001.09389.
- Wenyang Hu, Xiaocong Cai, Jun Hou, Shuai Yi, Zhiping Lin .GTC: Guided Training of CTC Towards Efficient and Accurate Scene Text Recognition [J]. arXiv preprint arXiv:2002.01276.
- Shangbang Long, Yushuo Guan, Kaigui Bian, Cong Yao .A New Perspective for Flexible Feature Gathering in Scene Text Recognition Via Character Anchor Pooling [J]. arXiv preprint arXiv:2002.03509.
- Kinjal Dasgupta, Sudip Das, Ujjwal Bhattacharya .Scale-Invariant Multi-Oriented Text Detection in Wild Scene Images [J]. arXiv preprint arXiv:2002.06423.
- Liang Qiao, Sanli Tang, Zhanzhan Cheng, Yunlu Xu, Yi Niu, Shiliang Pu, Fei Wu .Text Perceptron: Towards End-to-End Arbitrary-Shaped Text Spotting [J]. arXiv preprint arXiv:2002.06820.
- Yuliang Liu, Hao Chen, Chunhua Shen, Tong He, Lianwen Jin, Liangwei Wang .ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network [J]. arXiv preprint arXiv:2002.10200.
- Jinyuan Zhao, Yanna Wang, Baihua Xiao, Cunzhao Shi, Fuxi Jia, Chunheng Wang .DGST : Discriminator Guided Scene Text detector [J]. arXiv preprint arXiv:2002.12509.
- Hui Zhang, Quanming Yao, Mingkun Yang, Yongchao Xu, Xiang Bai .Efficient Backbone Search for Scene Text Recognition [J]. arXiv preprint arXiv:2003.06567.
- Canjie Luo, Yuanzhi Zhu, Lianwen Jin, Yongpan Wang .Learn to Augment: Joint Data Augmentation and Network Optimization for Text Recognition [J]. arXiv preprint arXiv:2003.06606.
- Chixiang Ma, Lei Sun, Zhuoyao Zhong, Qiang Huo .ReLaText: Exploiting Visual Relationships for Arbitrary-Shaped Scene Text Detection with Graph Convolutional Networks [J]. arXiv preprint arXiv:2003.06999.
- Shi-Xue Zhang, Xiaobin Zhu, Jie-Bo Hou, Chang Liu, Chun Yang, Hongfa Wang, Xu-Cheng Yin .Deep Relational Reasoning Graph Network for Arbitrary Shape Text Detection [J]. arXiv preprint arXiv:2003.07493.
- Xinjie Feng, Hongxun Yao, Yuankai Qi, Jun Zhang, Shengping Zhang .Scene Text Recognition via Transformer [J]. arXiv preprint arXiv:2003.08077.
- Qiangpeng Yang, Hongsheng Jin, Jun Huang, Wei Lin .SwapText: Image Based Texts Transfer in Scenes [J]. arXiv preprint arXiv:2003.08152.
- Berat Kurar Barakat, Ahmad Droby, Rym Alasam, Boraq Madi, Irina Rabaev, Raed Shammes, Jihad El-Sana .Unsupervised text line segmentation [J]. arXiv preprint arXiv:2003.08632.
- Sharon Fogel (1), Hadar Averbuch-Elor (2), Sarel Cohen, Shai Mazor (1), Roee Litman (1) ((1) Amazon Rekognition Israel, (2) Cornell University) .ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation [J]. arXiv preprint arXiv:2003.10557.
- Shangbang Long, Cong Yao .UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World [J]. arXiv preprint arXiv:2003.10608.
- Deli Yu, Xuan Li, Chengquan Zhang, Junyu Han, Jingtuo Liu, Errui Ding .Towards Accurate Scene Text Recognition with Semantic Reasoning Networks [J]. arXiv preprint arXiv:2003.12294.
- Qi Song, Qianyi Jiang, Nan Li, Rui Zhang, Xiaolin Wei .ReADS: A Rectified Attentional Double Supervised Network for Scene Text Recognition [J]. arXiv preprint arXiv:2004.02070.
- Yuxin Wang, Hongtao Xie, Zhengjun Zha, Mengting Xing, Zilong Fu, Yongdong Zhang .ContourNet: Taking a Further Step toward Accurate Arbitrary-shaped Scene Text Detection [J]. arXiv preprint arXiv:2004.04940.
- Ebin Zacharias, Martin Teuchler, Bénédicte Bernier .Image Processing Based Scene-Text Detection and Recognition with Tesseract [J]. arXiv preprint arXiv:2004.08079.
- Zengyuan Guo, Zilin Wang, Zhihui Wang, Wanli Ouyang, Haojie Li, Wen Gao .Location-Aware Feature Selection for Scene Text Detection [J]. arXiv preprint arXiv:2004.10999.
- Meng Cao, Yuexian Zou .All you need is a second look: Towards Tighter Arbitrary shape text detection [J]. arXiv preprint arXiv:2004.12436.
- Wenjia Wang, Enze Xie, Xuebo Liu, Wenhai Wang, Ding Liang, Chunhua Shen, Xiang Bai .Scene Text Image Super-Resolution in the Wild [J]. arXiv preprint arXiv:2005.03341.
- Xiaoxue Chen, Lianwen Jin, Yuanzhi Zhu, Canjie Luo, Tianwei Wang .Text Recognition in the Wild: A Survey [J]. arXiv preprint arXiv:2005.03492.
- Zhaoyi Wan, Jielei Zhang, Liang Zhang, Jiebo Luo, Cong Yao .On Vocabulary Reliance in Scene Text Recognition [J]. arXiv preprint arXiv:2005.03959.
- Atique Ur Rehman, Sibt Ul Hussain .Large Scale Font Independent Urdu Text Recognition System [J]. arXiv preprint arXiv:2005.06752.
- 【Dataset】Sangeeth Reddy, Minesh Mathew, Lluis Gomez, Marcal Rusinol, Dimosthenis Karatzas., C.V. Jawahar .RoadText-1K: Text Detection & Recognition Dataset for Driving Videos [J]. arXiv preprint arXiv:2005.09496.
- Zhi Qiao, Yu Zhou, Dongbao Yang, Yucan Zhou, Weiping Wang .SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition [J]. arXiv preprint arXiv:2005.10977.
- Yudi Chen, Wei Wang, Yu Zhou, Fei Yang, Dongbao Yang, Weiping Wang .Self-Training for Domain Adaptive Scene Text Detection [J]. arXiv preprint arXiv:2005.11487.
- Mayank Kumar Singh, Sayan Banerjee, Shubhasis Chaudhuri .NENET: An Edge Learnable Network for Link Prediction in Scene Text [J]. arXiv preprint arXiv:2005.12147.
- Sihwan Kim, Taejang Park .Learning Robust Feature Representations for Scene Text Detection [J]. arXiv preprint arXiv:2005.12466.
- Sauradip Nag, Palaiahnakote Shivakumara, Umapada Pal, Tong Lu, Michael Blumenstein .A New Unified Method for Detecting Text from Marathon Runners and Sports Players in Video [J]. arXiv preprint arXiv:2005.12524.
- Lei Kang, Pau Riba, Marçal Rusiñol, Alicia Fornés, Mauricio Villegas .Pay Attention to What You Read: Non-recurrent Handwritten Text-Line Recognition [J]. arXiv preprint arXiv:2005.13044.
- Chengwei Zhang, Yunlu Xu, Zhanzhan Cheng, Shiliang Pu, Yi Niu, Fei Wu, Futai Zou .SPIN: Structure-Preserving Inner Offset Network for Scene Text Recognition [J]. arXiv preprint arXiv:2005.13117.
- Peng Zhang, Yunlu Xu, Zhanzhan Cheng, Shiliang Pu, Jing Lu, Liang Qiao, Yi Niu, Fei Wu .TRIE: End-to-End Text Reading and Information Extraction for Document Understanding [J]. arXiv preprint arXiv:2005.13118.
- Arseny Nerinovsky, Igor Buzhinsky, Andey Filchencov .Realistic text replacement with non-uniform style conditioning [J]. arXiv preprint arXiv:2006.04170.
- Zobeir Raisi, Mohamed A. Naiel, Paul Fieguth, Steven Wardell, John Zelek .Text Detection and Recognition in the Wild: A Review [J]. arXiv preprint arXiv:2006.04305.
- Youngmin Baek, Daehyun Nam, Sungrae Park, Junyeop Lee, Seung Shin, Jeonghun Baek, Chae Young Lee, Hwalsuk Lee .CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks [J]. arXiv preprint arXiv:2006.06244.
- Mohamed Yousef, Tom E. Bishop .OrigamiNet: Weakly-Supervised, Segmentation-Free, One-Step, Full Page Text Recognition by learning to unfold [J]. arXiv preprint arXiv:2006.07491.
- Shota Sakaguchi, Jun Kato, Masataka Goto, Seiichi Uchida .Lyric Video Analysis Using Text Detection and Tracking [J]. arXiv preprint arXiv:2006.11933.
- Jinghuang Lin, Zhanzhan Cheng, Fan Bai, Yi Niu, Shiliang Pu, Shuigeng Zhou .Text Recognition in Real Scenarios with a Few Labeled Samples [J]. arXiv preprint arXiv:2006.12209.
- Riku Anegawa, Masayoshi Aritsugi .Text Detection on Roughly Placed Books by Leveraging a Learning-based Model Trained with Another Domain Data [J]. arXiv preprint arXiv:2006.14808.
- Sahar Siddiqui, Elena Sizikova, Gemma Roig, Najib J. Majaj, Denis G. Pelli .Using Human Psychophysics to Evaluate Generalization in Scene Text Recognition Models [J]. arXiv preprint arXiv:2007.00083.
- Siddhant Bansal, Praveen Krishnan, C.V. Jawahar .Fused Text Recogniser and Deep Embeddings Improve Word Recognition and Retrieval [J]. arXiv preprint arXiv:2007.00166.
- Thiago M. Paixão, Rodrigo F. Berriel, Maria C. S. Boeres, Alessandro L. Koerich, Claudine Badue, Alberto F. de Souza, Thiago Oliveira-Santos .Self-supervised Deep Reconstruction of Mixed Strip-shredded Text Documents [J]. arXiv preprint arXiv:2007.00779.
- Klára Janoušková, Jiri Matas, Lluis Gomez, Dimosthenis Karatzas .Text Recognition -- Real World Data and Where to Find Them [J]. arXiv preprint arXiv:2007.03098.
- Changxu Cheng, Wuheng Xu, Xiang Bai, Bin Feng, Wenyu Liu .Maximum Entropy Regularization and Chinese Text Recognition [J]. arXiv preprint arXiv:2007.04651.
- Xugong Qin, Yu Zhou, Dayan Wu, Yinliang Yue, Weiping Wang .FC2RN: A Fully Convolutional Corner Refinement Network for Accurate Multi-Oriented Scene Text Detection [J]. arXiv preprint arXiv:2007.05113.
- Hanchi Ren, Jingjing Deng, Xianghua Xie .Privacy Preserving Text Recognition with Gradient-Boosting for Federated Learning [J]. arXiv preprint arXiv:2007.07296.
- Xiaoyu Yue, Zhanghui Kuang, Chenhao Lin, Hongbin Sun, Wayne Zhang .RobustScanner: Dynamically Enhancing Positional Clues for Robust Text Recognition [J]. arXiv preprint arXiv:2007.07542.
- Minghui Liao, Guan Pang, Jing Huang, Tal Hassner, Xiang Bai .Mask TextSpotter v3: Segmentation Proposal Network for Robust Scene Text Spotting [J]. arXiv preprint arXiv:2007.09482.
- Youngmin Baek, Seung Shin, Jeonghun Baek, Sungrae Park, Junyeop Lee, Daehyun Nam, Hwalsuk Lee .Character Region Attention For Text Spotting [J]. arXiv preprint arXiv:2007.09629.
- Wenqing Zhang, Yang Qiu, Song Bai, Rui Zhang, Xiaolin Wei, Xiang Bai .FedOCR: Communication-Efficient Federated Learning for Scene Text Recognition [J]. arXiv preprint arXiv:2007.11462.
- Wenhai Wang, Xuebo Liu, Xiaozhong Ji, Enze Xie, Ding Liang, Zhibo Yang, Tong Lu, Chunhua Shen, Ping Luo .AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting [J]. arXiv preprint arXiv:2008.00714.
- Konstantin Bulatov, Nadezhda Fedotova, Vladimir V. Arlazarov .Fast Approximate Modelling of the Next Combination Result for Stopping the Text Recognition in a Video [J]. arXiv preprint arXiv:2008.02566.
- Fangfang Wang, Yifeng Chen, Fei Wu, Xi Li .TextRay: Contour-based Geometric Modeling for Arbitrary-shaped Scene Text Detection [J]. arXiv preprint arXiv:2008.04851.
- Abdelrahman Abdallah, Mohamed Hamada, Daniyar Nurseitov .Attention-based Fully Gated CNN-BGRU for Russian Handwritten Text [J]. arXiv preprint arXiv:2008.05373.
- Kartik Chaudhary, Raghav Bali .EASTER: Efficient and Scalable Text Recognizer [J]. arXiv preprint arXiv:2008.07839.
- Anna Zhu, Hang Du, Shengwu Xiong .Scene Text Detection with Selected Anchor [J]. arXiv preprint arXiv:2008.08523.
- Shengjun Liu, Ningkang Jiang, Yuanbin Wu .Visual Attack and Defense on Text [J]. arXiv preprint arXiv:2008.10356.
- Chenhan Zhang .Complicating the Social Networks for Better Storytelling: An Empirical Study of Chinese Historical Text and Novel [J]. arXiv preprint arXiv:2008.10835.
- Chunhui Li, Xingshu Chen, Haizhou Wang, Yu Zhang, Peiming Wang .An End-to-End Attack on Text-based CAPTCHAs Based on Cycle-Consistent Generative Adversarial Network [J]. arXiv preprint arXiv:2008.11603.
- Brian Davis, Chris Tensmeyer, Brian Price, Curtis Wigington, Bryan Morse, Rajiv Jain .Text and Style Conditioned GAN for Generation of Offline Handwriting Lines [J]. arXiv preprint arXiv:2009.00678.
- Weijia Wu, Ning Lu, Enze Xie .Synthetic-to-Real Unsupervised Domain Adaptation for Scene Text Detection in the Wild [J]. arXiv preprint arXiv:2009.01766.
- Mohammad Fasha, Bassam Hammo, Nadim Obeid, Jabir Widian .A Hybrid Deep Learning Model for Arabic Text Recognition [J]. arXiv preprint arXiv:2009.01987.
- 【Dataset】Julián Del Gobbo, Rosana Matuk Herrera .Unconstrained Text Detection in Manga: a New Dataset and Baseline [J]. arXiv preprint arXiv:2009.04042.
- Hung Tuan Nguyen, Cuong Tuan Nguyen, Takeya Ino, Bipin Indurkhya, Masaki Nakagawa .Text-independent writer identification using convolutional neural network [J]. arXiv preprint arXiv:2009.04877.
- Chuhan Zhang, Ankush Gupta, Andrew Zisserman .Adaptive Text Recognition through Visual Matching [J]. arXiv preprint arXiv:2009.06610.
- Pawan Kumar Singh, Iman Chatterjee, Ram Sarkar, Mita Nasipuri .Handwritten Script Identification from Text Lines [J]. arXiv preprint arXiv:2009.07433.
- Yizhi Wang, Zhouhui Lian .Exploring Font-independent Features for Scene Text Recognition [J]. arXiv preprint arXiv:2009.07447.
- Andres Mafla, Sounak Dey, Ali Furkan Biten, Lluis Gomez, Dimosthenis Karatzas .Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval [J]. arXiv preprint arXiv:2009.09809.
- Bingcong Li, Xin Tang, Xianbiao Qi, Yihao Chen, Rong Xiao .Hamming OCR: A Locality Sensitive Hashing Neural Network for Scene Text Recognition [J]. arXiv preprint arXiv:2009.10874.
- Jianqi Ma .RRPN++: Guidance Towards More Accurate Scene Text Detection [J]. arXiv preprint arXiv:2009.13118.
- Julián Del Gobbo, Rosana Matuk Herrera .Unconstrained Text Detection in Manga [J]. arXiv preprint arXiv:2010.03997.
- Shao Wei Wang, Guan Jie Huang, Xiang Yu Luo .A Human Eye-based Text Color Scheme Generation Method for Image Synthesis [J]. arXiv preprint arXiv:2010.07510.
- Zhi Qiao, Xugong Qin, Yu Zhou, Fei Yang, Weiping Wang .Gaussian Constrained Attention Network for Scene Text Recognition [J]. arXiv preprint arXiv:2010.09169.
- Dongyoung Kim, Myungsung Kwak, Eunji Won, Sejung Shin, Jeongyeon Nam .TLGAN: document Text Localization using Generative Adversarial Nets [J]. arXiv preprint arXiv:2010.11547.
- Shuonan Pei, Mingzhi Zhu .Real-Time Text Detection and Recognition [J]. arXiv preprint arXiv:2011.00380.
- Shubham Vatsal, Nikhil Arora, Gopi Ramena, Sukumar Moharana, Dhruval Jain, Naresh Purre, Rachit S Munjal .On-Device Language Identification of Text in Images using Diacritic Characters [J]. arXiv preprint arXiv:2011.05108.
- Shruti Rijhwani, Antonios Anastasopoulos, Graham Neubig .OCR Post Correction for Endangered Language Texts [J]. arXiv preprint arXiv:2011.05402.
- Kunhong Yu, Yuze Zhang .Digging Deeper into CRNN Model in Chinese Text Images Recognition [J]. arXiv preprint arXiv:2011.08505.
- Xuewei Bian, Chaoqun Wang, Weize Quan, Juntao Ye, Xiaopeng Zhang, Dong-Ming Yan .Scene text removal via cascaded text stroke detection and erasing [J]. arXiv preprint arXiv:2011.09768.
- Yuanqiang Cai, Chang Liu, Weiqiang Wang, Qixiang Ye .Towards Spatio-Temporal Video Scene Text Detection via Temporal Clustering [J]. arXiv preprint arXiv:2011.09781.
- Dhruval Jain, Arun D Prabhu, Gopi Ramena, Manoj Goyal, Debi Prasanna Mohanty, Sukumar Moharana, Naresh Purre .On-Device Text Image Super Resolution [J]. arXiv preprint arXiv:2011.10251.
- Weijia Wu, Enze Xie, Ruimao Zhang, Wenhai Wang, Guan Pang, Zhen Li, Hong Zhou, Ping Luo .SelfText Beyond Polygon: Unconstrained Text Detection with Box Supervision and Dynamic Self-Training [J]. arXiv preprint arXiv:2011.13307.
- Chuang Yang, Zhitong Xiong, Mulin Chen, Qi Wang, Xuelong Li .BOTD: Bold Outline Text Detector [J]. arXiv preprint arXiv:2011.14714.
- Mengbiao Zhao, Wei Feng, Fei Yin, Xu-Yao Zhang, Cheng-Lin Liu .Weakly-Supervised Arbitrary-Shaped Text Detection with Expectation-Maximization Algorithm [J]. arXiv preprint arXiv:2012.00424.
- José Carlos Aradillas, Juan José Murillo-Fuentes, Pablo M. Olmos .Boosting offline handwritten text recognition in historical documents with few labeled lines [J]. arXiv preprint arXiv:2012.02544.
- Denis Coquenet, Clément Chatelain, Thierry Paquet .End-to-end Handwritten Paragraph Text Recognition Using a Vertical Attention Network [J]. arXiv preprint arXiv:2012.03868.
- Andrés Mafla, Rafael Sampaio de Rezende, Lluís Gómez, Diane Larlus, Dimosthenis Karatzas .StacMR: Scene-Text Aware Cross-Modal Retrieval [J]. arXiv preprint arXiv:2012.04329.
- Liang Qiao, Ying Chen, Zhanzhan Cheng, Yunlu Xu, Yi Niu, Shiliang Pu, Fei Wu .MANGO: A Mask Attention Guided One-Stage Scene Text Spotter [J]. arXiv preprint arXiv:2012.04350.
- Denis Coquenet, Yann Soullard, Clément Chatelain, Thierry Paquet .Have convolutions already made recurrence obsolete for unconstrained handwritten text recognition ? [J]. arXiv preprint arXiv:2012.04954.
- Denis Coquenet, Clément Chatelain, Thierry Paquet .Recurrence-free unconstrained handwritten text recognition using gated fully convolutional network [J]. arXiv preprint arXiv:2012.04961.
- Wenqing Zhang, Yang Qiu, Minghui Liao, Rui Zhang, Xiaolin Wei, Xiang Bai .Scene Text Detection with Scribble Lines [J]. arXiv preprint arXiv:2012.05030.
- Fukang Tian, Haiyu Wu, Bo Xu .Research on All-content Text Recognition Method for Financial Ticket Image [J]. arXiv preprint arXiv:2012.08168.
- Xuan Qin, Meizhu Liu, Yifan Hu, Christina Moo, Christian M. Riblet, Changwei Hu, Kevin Yen, Haibin Ling .Political Posters Identification with Appearance-Text Fusion [J]. arXiv preprint arXiv:2012.10728.
- Ron Slossberg, Oron Anschel, Amir Markovitz, Ron Litman, Aviad Aberdam, Shahar Tsiper, Shai Mazor, Jon Wu, R. Manmatha .On Calibration of Scene-Text Recognition Models [J]. arXiv preprint arXiv:2012.12643.
- Mélodie Boillet, Christopher Kermorvant, Thierry Paquet .Multiple Document Datasets Pre-training Improves Text Line Detection With Deep Neural Networks [J]. arXiv preprint arXiv:2012.14163.
- Vasiliki Tassopoulou, George Retsinas, Petros Maragos .Enhancing Handwritten Text Recognition with N-gram sequence decomposition and Multitask Learning [J]. arXiv preprint arXiv:2012.14459.
- Sagar Gubbi, Bharadwaj Amrutur .Scene Text Detection for Augmented Reality -- Character Bigram Approach to reduce False Positive Rate [J]. arXiv preprint arXiv:2101.01054.
- Fukang Tian, Haiyu Wu, Bo Xu .Research on Fast Text Recognition Method for Financial Ticket Image [J]. arXiv preprint arXiv:2101.01310.
- Rulin Shao, Zhouxing Shi, Jinfeng Yi, Pin-Yu Chen, Cho-Jui Hsieh .Robust Text CAPTCHAs Using Adversarial Examples [J]. arXiv preprint arXiv:2101.02483.
- Berat Kurar Barakat, Ahmad Droby, Reem Alaasam, Boraq Madi, Irina Rabaev, Jihad El-Sana .Text line extraction using fully convolutional network and energy minimization [J]. arXiv preprint arXiv:2101.07370.
- Berat Kurar Barakat, Rafi Cohen, Irina Rabaev, Jihad El-Sana .VML-MOC: Segmenting a multiply oriented and curved handwritten text lines dataset [J]. arXiv preprint arXiv:2101.07542.
- Berat Barakat, Ahmad Droby, Majeed Kassis, Jihad El-Sana .Text Line Segmentation for Challenging Handwritten Document Images Using Fully Convolutional Network [J]. arXiv preprint arXiv:2101.08299.
- Christian M. Dahl, Torben Johansen, Emil N. Sørensen, Simon Wittrock .HANA: A HAndwritten NAme Database for Offline Handwritten Text Recognition [J]. arXiv preprint arXiv:2101.10862.
there are three websites that have the dataset list of some different data type:
1 - www.iapr-tc11.org
2 - tc11.cvc.uab.es
3 - rrc.cvc.uab.es
-
2017 COCO-Text
2017 DeTEXT
2017 DOST
2017 FSNS
2017 MLT
2017 IEHHR
2011-2015 Born-DIgitalImage
2013-2015 Focused Scene Text
2013-2015 Text in Videos
2015 Incidental Scene Text
-
ICDAR Chinese
2017
- more than 12,000 images. Most of the images are collected in the wild by phone cameras.
- Task: Chinese Text in the Wild.
-
- 32,285 high resolution images, 1,018,402 character instances, 3,850 character categories, 6 kinds of attributes
-
Total-Text
2017
- 1555 images,11459 text instances, includes curved tex
-
SCUT_FORU_DB_Release
2016
- FORU contains two parts, which are Chinese2k and English2k dataset, respectively.
-
SynthText in the Wild Dataset
2016
- 800 thousand images, 8 million synthetic word instances.
- Each text instance is annotated with its text-string, word-level and character-level bounding-boxes.
-
COCO-Text (Computer Vision Group, Cornell)
2016
- 63,686 images, 173,589 text instances, 3 fine-grained text attributes.
- Task: text location and recognition
COCO-Text API
-
USTB-SV1k
2014
- 1000 (500 for training and 500 for testing) street view (patch) images from 6 USA cities
-
Synthetic Word Dataset (Oxford, VGG)
2014
- 9 million images covering 90k English words
- Task: text recognition, segmantation
download
-
IIIT 5K-Words
2012
- 5000 images from Scene Texts and born-digital (2k training and 3k testing images)
- Each image is a cropped word image of scene text with case-insensitive labels
- Task: text recognition
download
-
StanfordSynth(Stanford, AI Group)
2012
- Small single-character images of 62 characters (0-9, a-z, A-Z)
- Task: text recognition
download
-
MSRA Text Detection 500 Database (MSRA-TD500)
2012
- 500 natural images(resolutions of the images vary from 1296x864 to 1920x1280)
- Chinese, English or mixture of both
- Task: text detection
-
OSTD
2011
- cannot find the downloadlink
-
Traffice Guide Panel Text Dataset,TGPT
2016
- 3841 high-resolution individual images, 2315 containing traffic guide panel level annotations (1911 for training and 404 for testing, and all the testing images are manually labeled with ground truth tight text region bounding boxes), 1526 containing no traffic signs}.
-
- 350 high resolution images (average size 1260 × 860) (100 images for training and 250 images for testing)
- Only word level bounding boxes are provided with case-insensitive labels
- Task: text location
-
KAIST Scene_Text Database
2010
- 3000 images of indoor and outdoor scenes containing text
- Korean, English (Number), and Mixed (Korean + English + Number)
- Task: text location, segmantation and recognition
-
Chars74k
2009
- Over 74K images from natural images, as well as a set of synthetically generated characters
- Small single-character images of 62 characters (0-9, a-z, A-Z)
- Task: text recognition
-
ICDAR Benchmark Datasets
Dataset | Discription | Competition Paper |
---|---|---|
ICDAR 2015 | 1000 training images and 500 testing images | paper |
ICDAR 2013 | 229 training images and 233 testing images | paper |
ICDAR 2011 | 229 training images and 255 testing images | paper |
ICDAR 2005 | 1001 training images and 489 testing images | paper |
ICDAR 2003 | 181 training images and 251 testing images(word level and character level) | paper |