Questions tagged [snowplow]

Snowplow is a powerful, flexible, scalable web analytics platform.

Features

  • Direct access to all your customer-level and event-level data
  • Data is structured in an easy-to-query format
  • Data stored on your own Amazon Web Services account (in S3 and optionally Redshift), so you can query it using any tool you want
  • Enterprise-strength. Scales to 100Ms of events per day

Open source

  • No vendor lock-in
  • Developed by a growing community passionate about data
47 questions
0
votes
1 answer

Snowplow pipeline Could not commit request due to validation error

I get an error “Could not commit request due to validation error: INVALID_ARGUMENT: Pubsub publish requests are limited to 10MB, rejecting message over to avoid exceeding limit with byte64 request encoding” on my enrich step. I use tutorial by Simo…
0
votes
1 answer

Issues connecting Snowplow with Kafka on Confluent Cloud

I have managed to get an end to end local implementation working within Docker using Web Tracker, Scala Stream Kafka Collector, Stream Enrich, Druid and Metabase. Before building a working staging environment within Kubernetes using Helm, I wanted…
0
votes
1 answer

Getting Configuration file config.hocon does not exist when running Docker image (Snowplow, Scala)

Using an AWS EC2 with Docker installed, I am trying to run a docker image using the following code: docker run \ -v $PWD/scala-stream-collector-config:/snowplow/config \ -p 8080:8080 \ snowplow/scala-stream-collector-kinesis:1.0.1 \ --config…
0
votes
1 answer

schema: does not match the regex pattern

We use an unstruct_event but it always results in this failure. I do not understand what goes wrong here. What schema field is going wrong here, the schema field of the unstruct_event or the schema field of the data object within the unstruct_event?…
Thijs
  • 1,423
  • 15
  • 38
0
votes
1 answer

How to send json into snowplow using iglu webook in python

I have a json full of event data that I need to send into snowplow in python using an iglu webhook but having trouble finding any solid guidance on this. Most of the documentation I've been able to find relates to tracking specific events and…
thefinland
  • 65
  • 2
  • 6
0
votes
1 answer

ValidationException - Member must have length less than or equal to 500

My Snowplow enricher loads data to a Kinesis Data Stream but Kinesis responds with error 400 and the message Member must have length less than or equal to 500 (Service: AmazonKinesis; Status Code: 400; Error Code: ValidationException and I cannot…
Thijs
  • 1,423
  • 15
  • 38
0
votes
1 answer

Snowplow Enrich Setup Issue

collector.conf collector { interface = "0.0.0.0" interface = ${?COLLECTOR_INTERFACE} port = 8181 port = ${?COLLECTOR_PORT} # optional SSL/TLS configuration ssl { enable = false enable = ${?COLLECTOR_SSL} # whether to…
Lal Kumar Rai
  • 327
  • 7
  • 21
0
votes
1 answer

NameError (uninitialized constant SnowplowTracker::SelfDescribingJson): when making a snowplow tracker in ruby

I get the error: NameError (uninitialized constant SnowplowTracker::SelfDescribingJson): When I try to initialize a Self describing JSON in ruby. Heres some example code that fails: require 'snowplow-tracker' require 'contracts' require 'webmock' e…
0
votes
1 answer

Validate entity fields using Snowplow Micro

According to this Snowplow Micro blog post, you can validate: The value of specific fields sent with specific events is as expected The correct contexts / entities are sent with the appropriate events However, it doesn’t look like it is possible…
infojolt
  • 5,244
  • 3
  • 40
  • 82
0
votes
0 answers

Cannot update pod due to dependencies in React Native module

I am trying to update SnowplowTracker of our app. I changed the version of SnowplowTracker on my PodFile: pod 'SnowplowTracker', '~> 1.3' However I am getting this error when I run my pod install: [!] CocoaPods could not find compatible versions…
tiw
  • 535
  • 1
  • 6
  • 22
0
votes
2 answers

How do I apply a docker run with --config option to docker compose

I have the following docker command: sudo docker run --env-file env.list \ -p 80:8080 \ -v $PWD/snowplow/config:/snowplow/config \ snowplow/scala-stream-collector-kinesis:1.0.0 \ --config /snowplow/config/config.hocon I'm trying to move…
Hans
  • 2,800
  • 3
  • 28
  • 40
0
votes
1 answer

snowplow java script tracker setup : collector URI

I'm trying to setup snowplow tracker. I'm facing the following issues 1.The healthcheck test is not working(In scala stream collector -running step) error- connection refused, 404 error 2.Setting up tracker Below is the code given in GitHub for…
kuzhal
  • 1
0
votes
1 answer

How do you run snowplow-bigquery-loader?

Where do you find/make/use the command: ./snowplow-bigquery-mutator \ listen # Can be "init" to create empty table --config $CONFIG \ --resolver $RESOLVER \ Given by the snowplow-bigquery-loader documentation.
jStaff
  • 650
  • 1
  • 9
  • 25
0
votes
1 answer

How do you associate a schema with a payload?

For context: I am setting up a PubSub Emitter for snowplow. (For other readers PubSub is a simple queue on Google Cloud Platforms that takes in messages which are an array as input). ['data' => 'Name', 'attributes' => 'key pair values of whatever…
jStaff
  • 650
  • 1
  • 9
  • 25
0
votes
1 answer

Spark UDF with Maxmind Geo Data

I'm trying to use the Maxmind snowplow library to pull out geo data on each IP that I have in a dataframe. We are using Spark SQL (spark version 2.1.0) and I created an UDF in the following class: class UdfDefinitions @Inject() extends Serializable…
Nir Ben Yaacov
  • 1,182
  • 2
  • 17
  • 33