Getting started on Snowplow Open Source

  1. Home
  2. Docs
  3. Getting started on Snowplow Open Source
  4. Setup Snowplow Open Source on AWS
  5. Setup the Snowplow collector
  6. Run the Scala Stream Collector

Run the Scala Stream Collector

Running jar

The Scala Stream Collector is a jarfile. Simply provide the configuration file as a parameter:

$ java -jar snowplow-stream-collector-[targeted platform]-[version].jar \ --config my.conf

Where targeted platform can be one of:

  • kinesis
  • google-pubsub
  • kafka
  • nsq
  • stdout

This will start the collector as a HTTP service and write serialized Thrift records to either stdout, Kinesis, Google PubSub, Kafka or NSQ, depending on the JAR.

Running docker container

To run the docker container, instead of the jarfile directly, you simply need to run it with your configuration folder mounted:

docker run \ -v $PWD/scala-stream-collector-config:/snowplow/config \ -p 8080:8080 \ snowplow/scala-stream-collector-kinesis:1.0.1 --config /snowplow/config/config.hocon

Healthcheck

Pinging the collector on the /health path should return a 200 OK response:

$ curl http://localhost:8080/health