segments texts from scene images. Three text-specific features are designed over image
edges with which a set of candidate text boundaries is first detected. For each detected
candidate text boundary, one or more candidate characters are then extracted by using a
local threshold that is estimated based on the surrounding image pixels. The real characters
and words are finally identified by a support vector regression model that is trained using …