Getting started on Snowplow Open Source

  1. Home
  2. Docs
  3. Getting started on Snowplow Open Source
  4. Setup Snowplow Open Source on AWS
  5. Setup the Snowplow collector
  6. Overview of the Scala Stream Collector

Overview of the Scala Stream Collector

The Scala Stream Collector allows near-real-time processing (Enrichment, Storage, Analytics) of a Snowplow raw event stream. Snowplow raw events can be sunk to either Amazon Kinesis, Google PubSub, Apache Kafka, NSQ or to stdout for a custom stream collection process. When setting up the Collector in AWS, you will want to configure it to output data to Kinesis.

For more information on the architecture of the Scala Stream Collector, please see Scala Stream Collector.

Setting up the Scala Stream Collector is a 3 step process:

  1. Install the Scala Stream Collector
  2. Configure the Scala Stream Collector
  3. Run the Scala Stream Collector