- Node: >= 11.11.0
- NPM: >= 6.7.0
- Docker
docker run -d -p 2181:2181 -p 9092:9092 --env ADVERTISED_HOST=127.0.0.1 --env ADVERTISED_PORT=9092 spotify/kafka
docker ps -a
//find the container id of Kafka and substitute it in below
docker exec -i -t <ID> /bin/bash
--create new topic
/opt/kafka_2.11-0.10.1.0/bin/kafka-topics.sh --create --zookeeper localhost:2181 --replication-factor 1 --partitions 1 --topic test3
docker run -d -p 6379:6379 redis
npm install
node server.js
Run Spark Streaming Datagenerator and Spark Streaming Processor jobs to generate data with classifications These are separate projects
Alternatively, just run the included script to simulate this process:
3a. Install redis-cli
Mac (via brew):
brew install redis
Windows: https://github.com/rgl/redis/downloads
Unix:
wget http://download.redis.io/redis-stable.tar.gz
tar xvzf redis-stable.tar.gz
cd redis-stable
make
3b. Run data generation script (from root of project)
redis-cli < scripts/coords.txt