All technological notes.
Amazon Athena
S3SQL language to query the files (built on Presto)CSV, JSON, ORC, Avro, and ParquetCommonly used with Amazon Quicksight for reporting/dashboards
Use cases:
S3 using serverless SQL, use Athena
Use columnar data for cost-savings (less scan)
Apache Parquet or ORC is recommendedGlue to convert your data to Parquet or ORCs3://yourBucket/pathToTable
/<PARTITION_COLUMN_NAME>=<VALUE>
/<PARTITION_COLUMN_NAME>=<VALUE>
/<PARTITION_COLUMN_NAME>=<VALUE>
/etc…
s3://athena-examples/flight/parquet/year=1991/month=1/day=1/Federated Query
Data Source Connectors that run on AWS Lambda to run Federated Queries (e.g., CloudWatch Logs, DynamoDB, RDS, …)S3




