Project Awesome project awesome

Miscellaneous Repos > snowplow/snowplow

Enterprise-strength web, mobile and event analytics, powered by Hadoop, Kafka, Kinesis, Redshift and Elasticsearch.

Package 7.0k stars GitHub

Snowplow logo


As of January 8, 2024, Snowplow introduced the Snowplow Limited Use License Agreement, and we are releasing new versions of our core pipeline technology under this license. There will be no security patches made to versions of our software that pre-date January 2024.

If you are currently using the pipeline in production, or in a way that is competitive with Snowplow, these changes affect you. If you wish to use the current version of Snowplow software, please contact us to discuss a plan that works for you.

We value all of our users and remain dedicated to helping our community use Snowplow in the optimal capacity that fits their business goals and needs.

Read more about this change here.


Introduction

Welcome to Snowplow, the Customer Context Layer transforming raw behavioral data into real-time customer context for AI agents and advanced analytics. AI runs on models, but wins on context.

Digital-first companies like Strava, Burberry, Supercell, Michaels, 1Password, HelloFresh, Experian, and Secret Escapes rely on Snowplow to collect and process event-level data in real time, delivering it securely to their warehouse, lake, or stream — without the engineering complexity of building data infrastructure.


Why a Customer Context Layer?

Snowplow is the data foundation for an organization's most demanding use cases: advanced analytics, context-aware AI agents, and real-time personalization. From predicting customer behaviors to powering agentic AI systems and detecting fraud in real time, Snowplow provides the high-fidelity behavioral context that makes it all possible.

Key benefits of Snowplow’s Customer Context Layer:

  • Data depth and quality
  • Centralized data governance
  • Real-time operationalization and personalization
  • Privacy and compliance
  • AI- and BI-ready behavioral data

Why Snowplow?

  • “Glass-box” technical architecture capable of processing billions of events per day
  • Over 25 SDKs and webhooks to collect data from web, mobile, server-side, and other sources
  • A unique approach based on schemas and validation ensures your data is as clean as possible
  • Over 15 enrichments to get the most out of your data
  • Stream data to your data warehouse/lakehouse or SaaS destinations of choice
  • Build and manage behavioral tracking plans with Event Studio
  • Power context-aware AI agents and proactive decisioning with Signals

Our documentation is a great place to learn more.

This repository contains the major Snowplow components as individual submodule repositories.


Build with Snowplow

The Snowplow Developer Hub has solution accelerators, architecture guides, demo videos, and other helpful resources to get you started.


Community

Check out our Community for support and updates.

If you spot a bug, please raise an issue in the GitHub repository of the component in question.


Copyright and license

Copyright 2012-2026 Snowplow Analytics Ltd.

Snowplow components are licensed differently depending on their purpose. Read about our different licenses here.

Back to Amazon Web Services