Skip to content

AWS

JustAI can ingest data from a shared S3 bucket. This connection can contain all relevant custom metrics that you may want to track.

  1. Create an IAM role for the export:
    1. Recommended Permissions: s3:PutObject, s3:ListBucket, s3:GetBucketLocation
    2. Share the ARN role with JustAI. JustAI team will grant read/write permissions to a shared S3 bucket.
  2. Create a daily data export that writes data to this S3 bucket: s3://justwords-metrics-ingest/<org>

Proposed Data Schema

In general, we’d like to partition the data by time (hourly) and use some reasonable serialization format (gzip, parquet, etc).

{
"session_id": string; // e.g. a session ID or something that can be
// used to join multiple events
"event_timestamp": long;
"event_name": string; // e.g. "click", "impression"
"copy_id": string; // uuid from the JustAI response
"user_id": string;
}

This is just a suggestion, and we can adapt to how your data is structured. The key parts are that we need to know when a copy_id was shown to a user and when it was positively or negatively engaged.

The purpose of the session ID is to ensure that we don’t double-count events in case a user is shown a variant and engages multiple times. If this can be guaranteed by the data pipeline, it wouldn’t be necessary.

The purpose of the user ID is to join against other custom events and to measure conversions per user instead of conversions per impression, for example. This is optional, and it really just depends on what’s the best way to measure success.