Skip to main content

Overview

You can configure Connections, Sources and Endpoints through YAML configuration.

app_name: dozer-taxi-apis

# Define list of connections to use
connections:
- config : !LocalStorage
details:
path: data
tables:
- !Table
name: trips
prefix: /trips
file_type: parquet
extension: .parquet
name: ny_taxi

# You can define multiple transformations to perform on several connections
sql: |
SELECT
PULocationID as pickup_location,
DOLocationID as dropoff_location,
COUNT(PULocationID, DOLocationID) as total_trips,
MIN(trip_time) as min_trip_time,
MAX(trip_time) as max_trip_time
INTO trips_cache
FROM trips
GROUP BY PULocationID, DOLocationID
HAVING COUNT(PULocationID, DOLocationID) > 1000;


# Sources to be used. Here columns can be filtered
sources:
- name: trips
table_name: trips
connection: !Ref ny_taxi
columns:

# APIs to automatically publish
endpoints:
# Endpoint entity name. Used in generating a gRPC service
- name: trips_cache
# Rest Endpoint
path: /trips
table_name: trips_cache
# configure primary and secondary indexes
index:
primary_key:
- pickup_location
- dropoff_location

Refer to the Full Configuration here.

Env Variables

You can surround the environment variable with {{ and }} to use it in the configuration file.

app_name: {{APP_NAME}}