Architecture Discussion

Equations

The solution I developed first begins with determining the level of preprocessing the image requires in order to be robust to lighting conditions. It does this by evaluating the mean lightness of the image, then determining if the lightness needs increased via CLAHE and by how much.

From there, the various regions of the image are segmented in an attempt to isolate all the text regions, with some additional filtering to reduce guaranteed false positives based on things like aspect ratio and non-maximal suppression to eliminate redundant samples.

The last major step is the classification step. First each region is determined to either contain text or not, which is done via a CNN for text discrimination as well as a Stroke-Width Transform filter. The remaining regions are then classified into digits, and their position within the image is retained so they can be sorted later.

The regions are then sorted and the digits present in the image are returned in order,

Skyler Horn

Architecture Discussion

Equations

Results

Code

Efficacy

Architecture Discussion

Equations

Results

Code

Efficacy

This website uses cookies.