Beam Pipelines - Streaming Analytics Techniques

This project is a demo for several Beam techniques to do streaming analytics.

Running the demo

Create a GCP project
Create a file in terraform directory named terraform.tfvars with the following content:
```
project_id = "<GCP Project Id>"
```
There are additional Terraform variables that can be overwritten; see variables.tf for details.

Run the following commands:

export PROJECT_ID=<project-id>
export GCP_REGION=us-central1
export BIGQUERY_REGION=us-central1

Create BigQuery tables, Pub/Sub topics and subscriptions, and GCS buckets by running this script:
```
source ./setup-env.sh
```
Start event generation process:
```
./start-event-generation.sh
```

Start the event processing pipeline:

(cd pipeline; ./run-streaming-pipeline.sh)

Optionally, start the pipeline which will ingest the findings sent as pubsub messages into BigQuery:
```
./start-findings-to-bigquery-pipeline.sh
```

Cleaning up

Shutdown the pipelines via GCP console (TODO: add scripts)
Run this command:
```
cd terraform; terraform destroy
```

Alternatively, delete the project you created.

Disclaimer

The techniques and code contained here are not supported by Google and is provided as-is (under Apache license). This repo provides some options you can investigate, evaluate and employ if you choose to.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
data-generator		data-generator
pipeline		pipeline
terraform		terraform
util		util
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup-env.sh		setup-env.sh
start-event-generation-with-threats.sh		start-event-generation-with-threats.sh
start-event-generation.sh		start-event-generation.sh
start-findings-to-bigquery-pipeline.sh		start-findings-to-bigquery-pipeline.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Beam Pipelines - Streaming Analytics Techniques

Running the demo

Cleaning up

Disclaimer

About

Releases

Packages

Languages

License

slilichenko/streaming-dataflow-examples

Folders and files

Latest commit

History

Repository files navigation

Beam Pipelines - Streaming Analytics Techniques

Running the demo

Cleaning up

Disclaimer

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages