A Flatfile plugin for extracting table data from HTML files
npm i @flatfile/plugin-extract-html-table
This plugin provides HTML table extraction capabilities for Flatfile. It parses HTML files and extracts structured data from tables, handling complex layouts and nested tables.
Event Type: listener.on('file:created')
Supported File Types: .html
options
- object
- (optional)handleColspan
- boolean
- (optional): Determines how to handle colspan. Default is true.handleRowspan
- boolean
- (optional): Determines how to handle rowspan. Default is true.maxDepth
- number
- (optional): Maximum depth for nested tables. Default is 3.debug
- boolean
- (optional): Enables debug logging. Default is false.api.files.download
api.files.update
install
npm install @flatfile/plugin-extract-html-table
import
import { HTMLTableExtractor } from '@flatfile/plugin-extract-html-table';
listener.js
const listener = new FlatfileListener();
listener.use(
HTMLTableExtractor({
handleColspan: true,
handleRowspan: true,
maxDepth: 3,
debug: false
})
);