The Scala Stream Collector allows near-real-time processing (Enrichment, Storage, Analytics) of a Snowplow raw event stream. Snowplow raw events can be sunk to either Amazon Kinesis, Google PubSub, Apache Kafka, NSQ or to
stdout for a custom stream collection process. When setting up the Collector in AWS, you will want to configure it to output data to Kinesis.
For more information on the architecture of the Scala Stream Collector, please see Scala Stream Collector.
Setting up the Scala Stream Collector is a 3 step process: