apache-arrow Questions

Debian – PostgreSQL authentication fails with Apache Arrow Flight SQL Driver

July 17, 2024
usdn
2 Answers

Original post I want to try out the Apache Arrow Flight SQL Driver for a large OLAP query on a PostgreSQL database. When I run the following example: import adbc_driver_flightsql.dbapi import adbc_driver_manager conn = adbc_driver_flightsql.dbapi.connect('flightsql://username:[email protected]:5432/database') with conn.cursor() as cur: cur.execute("SELECT…

VIEW QUESTION

Pyarrow slice pushdown for Azure data lake

March 19, 2023
Luca
2 Answers

I want to access Parquet files on an Azure data lake, and only retrieve some rows. Here is a reproducible example, using a public dataset: import pyarrow.dataset as ds from adlfs import AzureBlobFileSystem abfs_public = AzureBlobFileSystem( account_name="azureopendatastorage") dataset_public = ds.dataset('az://nyctlc/yellow/puYear=2010/puMonth=1/part-00000-tid-8898858832658823408-a1de80bd-eed3-4d11-b9d4-fa74bfbd47bc-426339-18.c000.snappy.parquet',…

VIEW QUESTION

R + Arrow 10 : convert blank to numeric NA – Debian

November 7, 2022
larry77
2 Answers

Please have a look at the reprex at the end of the post. I need to read a column as a string, perform several manipulations and then save convert it to a numerical column. The blanks ("") in the string…

VIEW QUESTION