For a project, I convert Excel documents to JSON with in the Java application using Apache Poi. In the future, this task shall be done using AWS Lambda, because it currently it can take very long (up to 20sec) and has a high memory consumption.
Requirements:
- Support XLS and XLSX format
- Evaluate formulas (so streaming is not possible?)
- up to 100.000 Rows but smaller than 1MB
With AWS Lamda I now can use, Java, Python or NodeJs. My question is: Is my Apache POI approach the way to go or are there more suitable frameworks? E.g. sheetjs seems to be a good candidate. I was not able to find a up to date performance comparison of such frameworks.