Hi all, I’ve been using Observable Framework for a few weeks, and I ran into a strange behaviour with SQL.
I have a parquet file that I download from Google BigQuery created from a Python data loader. From what I can tell, the Parquet file is fine, and has all the fields, the right schema, the right data, the works.
With my code, I pop this SQL in… This works, and confirms that the whole thing is ok.
select
resource
from
metric_detail
where
owner = ${owner}
I’d like to expand the columns. So I run the actual query I want…
select
resource,
title
from
metric_detail
where
owner = ${owner}
This is where the problem comes in – I get 0 rows. Upon further testing, I tried this…
select
*
from
metric_detail
where
owner = ${owner}
The headings update, and they show all the columns, yet again, no data returned.
I did observe something interesting. The query
select distinct title
from metric_detail
did give me a weird error in the browser console.
Error: Invalid Error: TProtocolException: Invalid data
at U.onMessage (_esm.js:7:11151)
Some additional info – changing the data format from parquet to json has worked. Json is not desirable as it is way too big.
Any suggestions on what might be the issue with the parquet file?