Shotor (means camel in Persian) is a free synthetic dataset for Word Level OCR.
The current version contains 120000 grayscale 50*100 images and corresponding words. The words contain only alphabet.
Note: To train a robust model, apply augmentations like scaling, translation, additive noise and ... on the images.
To see an example of using the Shotor dataset see this notebook:
A simple word level OCR for Persian Language using Pytorch and OpenCV
I used these resourses to create word lists:
The images have been generated using multiple fonts:
Created by: Amirabbas Asadi (amir137825@gmail.com)
In this repository, I include all my notes and whatever I learn, so that everyone can benefit from them
Easy-to-use cryptocurrency trading strategy simulator and backtester
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
A toolkit full of handy functions including most used models and utilities for deep-learning practitioners!
A collection of ML related stuff including notebooks, codes and a curated list of various useful resources such as books and softwares. Almost everything mentioned here is free (as speech not free food) or open-source.
Multi-class confusion matrix library in Python
In this repository, I have written about my experiences in studying Machine learning. Also, I have included the solutions of some Machine learning exercises and my educational projects.
Image Processing Practics