pycognaize.common.utils.clean_ocr_data

clean_ocr_data(ocr_data, thresh=4.0)[source]

Cleans the ocr data

Parameters:
  • ocr_data (dict) – OCR data dictionary, where values under the key words have the following form: {‘top’}

  • thresh (float) – Words that

Return type:

dict

Returns: