pycognaize.document.tag.html_tag.HTMLTableTag
- class HTMLTableTag(tag_id, value, ocr_value, xpath, title, html_id, cell_data, html, source_ids, is_table=True)[source]
Bases:
HTMLTagABCRepresents table’s coordinate data in XBRL document
- Parameters:
tag_id (
str)value (
str)ocr_value (
str)xpath (
str)title (
str)html_id (
Union[str,List[str]])cell_data (
dict)html (
HTML)is_table (
bool)
Methods
Builds HTMLTableTag objeTct from pycognaize raw data :type raw:
dict:param raw: pycognaize field's tag info :type html:HTML:param html: HTMLReplaces NaN values in a DataFrame with empty HTML tags.
set_class_confidenceConverts HTMLTableTag to dict
Attributes
cell_datacellsdfhtmlhtml_idis_tableocr_valueraw_dfsource_idstag_idtitlevaluexpath- classmethod construct_from_raw(raw, html)[source]
Builds HTMLTableTag objeTct from pycognaize raw data :type raw:
dict:param raw: pycognaize field’s tag info :type html:HTML:param html: HTML- Return type:
- Returns:
- Parameters:
raw (dict)
html (HTML)