Today, the cameras are fixed everywhere, in streets, in vehicles, and in any public area. However, Analysis and extraction of information from images are required. Particularly, in autonomous vehicles and in smart applications that are developed to guide tourists. So, a large dataset of scene text images is an important and difficult factor in the extraction of textual information in natural images. It is the input to any computer vision system.