I'm trying to use Firehose to ingest some data. Here are the parameters:
- Json data with schema in the Glue Schema Registry
- Want to convert json -> parquet
From this post, it seems like Firehose cannot read a tables schema if said table is created from an existing schema. Can anyone confirm this? Like in that post I also get the error message:
The schema is invalid. The specified table has no columns.
My other options are to use a crawler or create the table manually. I'd like to name the table myself so I want to go with the latter.
Is there a way to have Firehose update the schema of a manually created table in Glue or is the Crawler my only option?
I could also just do the parquet conversion myself in a glue job but I'd rather Firehose do it if possible.