invoice2data is a command line tool and Python library to support accounting processes. Regular expressions can be used to customize data extraction. See https://github.com/invoice-x/invoice2data.
invoice2data is a command line tool and Python library to support accounting processes. It can extract text from PDF files using different techniques, search for regex in the result using a YAML-based template system and save results as CSV, JSON or XML or renames PDF files to match the content. See https://github.com/invoice-x/invoice2data.