Setting up Heron Cluster with Apache Aurora Locally

June 18, 2016

In this post we will be looking at how we can setup Heron steam processing engine in Apache Aurora in our local machine. Oh Boy this is going to be a long post :D. I am doing this on Ubuntu 14.04 and these steps should be similar to any Linux machine. Heron supports deployment in Apache Aurora out of the box. Apache Aurora will act as the Scheduler for Heron after the setup is complete. In order to do this first you will have to setup Apache Zookeeper and allow Heron to communicate with it. Here Apache Zookeeper will act as the State Manager of the Heron deployment. if you just want to setup a local cluster without the hassle of installing aurora take a look at my previous blog post - Getting started with Heron stream processing engine in Ubuntu 14.04

Setting Up Apache Aurora Cluster locally

First thing we need to do is to setup Apache Aurora locally. I will try to explain as much of the configurations as i can as we go on. First lets get a Apache Aurora cluster running on our local machine. The steps are extracted from here.

Step 1: Install VirtualBox and Vagrant

Download and install VirtualBox and Vagrant in your machine. If vagrant is successfully installed in your machine the following command should list several common commands for this tool

$ vagrant

Step 2: Clone the Aurora repository

You can get the source repository for Aurora with the following command

$ dgit clone git://git.apache.org/aurora.git

Once the clone is complete cd into the aurora folder

$ cd aurora

Step 3: Starting Local Aurora Cluster

To start the local cluster all you have to do is execute the following command. It will install all the needed dependencies like Apache Mesos and Zookeeper in the VM.

$ vagrant up

Additionally to get rid of some of the warning messages that you get during up command execute the following command

$ vagrant plugin install vagrant-vbguest

You can verify that the Aurora cluster is properly running by opening the following links in your web-browser

If you go into http://192.168.33.7:8081/scheduler you can notice that the name of the default cluster that is setup in aurora is named "devcluster" this will be important to note when submitting typologies from heron.

Installing Heron within VM

Now that we have the Aurora cluster setup we need to install heron within the cluster VM in order to be able to create get our Heron deployment working. Since this is a fresh VM instance you will have to install the basic software such as "unzip" and set the JAVA_HOME path as an environmental variable ( Just need to add this to .bashrc file). After you have the basic stuff working follow the following steps to install Heron in the VM. You can ssh into the VM with the following command

$ vagrant ssh

Step 1.a : Download installation script files

You can download the script files that match to Ubuntu from https://github.com/twitter/heron/releases/tag/0.14.0

For the 0.14.0 release the files you need to download will be the following.

heron-client-install-0.14.0-ubuntu.sh
heron-tools-install-0.14.0-ubuntu.sh

Optionally - You want need the following for the steps in the blog post

heron-api-install-0.14.0-ubuntu.sh
heron-core-0.14.0-ubuntu.tar.gz

Step 1.b: Execute the client and tools shell scripts

$ chmod +x heron-client-install-VERSION-PLATFORM.sh
$ ./heron-client-install-VERSION-PLATFORM.sh --user
Heron client installer
----------------------

Uncompressing......
Heron is now installed!

Make sure you have "/home/vagrant/bin" in your path.

After this you need to add the path "/home/vagrant/bin". You can just execute the following command or add it to the end of .bashrc file ( which is more convenient ).

$ export PATH=$PATH:/home/vagrant/bin

Install the following packages to make sure that you have all the needed dependencies in the VM. You might have to do sudo apt-get update before you execute the following.

$ sudo apt-get install git build-essential automake cmake libtool zip libunwind-setjmp0-dev zlib1g-dev unzip pkg-config -y

Configuring State Manager ( Apache Zookeeper )

Since Heron only uses Apache Zookeeper for coordination the load on the Zookeeper node is minimum. Because of this it is sufficient to use a single Zookeeper node or if you have an Zookeeper instance running for some other task you can simply use that. Since Apache Aurora already uses an Zookeeper instance we can directly use that instance to execute State Manager tasks of Heron. First you need to configure Heron to work with the Zookeeper instance. You can find meanings of each attribute here. Configurations for State manager are located in the directory /home/vagrant/.heron/conf/aurora.

Open the file statemgr.yaml using vim ( or some other text editor you prefer ) and add/edit the file to include the following.

# local state manager class for managing state in a persistent fashion
heron.class.state.manager: com.twitter.heron.statemgr.zookeeper.curator.CuratorStateManager

# local state manager connection string
heron.statemgr.connection.string:  "127.0.0.1:2181"

# path of the root address to store the state in a local file system
heron.statemgr.root.path: "/heronroot"

# create the zookeeper nodes, if they do not exist
heron.statemgr.zookeeper.is.initialize.tree: True

# timeout in ms to wait before considering zookeeper session is dead
heron.statemgr.zookeeper.session.timeout.ms: 30000

# timeout in ms to wait before considering zookeeper connection is dead
heron.statemgr.zookeeper.connection.timeout.ms: 30000

# timeout in ms to wait before considering zookeeper connection is dead
heron.statemgr.zookeeper.retry.count: 10

# duration of time to wait until the next retry
heron.statemgr.zookeeper.retry.interval.ms: 10000

Creating Paths in Zookeeper

Next we need to create some paths within Zookeeper since some of the paths are not created by Heron automatically. So we need to create them manually. Since Aurora installation already installed Zookeeper we can use the Zookeeper cli to create the manual paths.

$ sudo ./usr/share/zookeeper/bin/zkCli.sh

This will connect to the Zookeeper instance running locally. Then execute the following commands from within the client to create paths /heronroot/topologies and /heron/topologies. Later in "Associating new Aurora cluster into Heron UI" we will see that we only need to create /heronroot/topologies but for now lets create both to make sure we don't get any errors when we run things.

create /heronroot null
create /heronroot/topologies null

create /heron null
create /heron/topologies null

Configuring Scheduler ( Apache Aurora )

Next we need to configure Apache Aurora to be used as the Scheduler for our Heron local cluster.
In order to do this we need to edit the scheduler.yaml file that is also located in /home/vagrant/.heron/conf/aurora. Add/Edit the file to include the following. More information regarding parameters can be found here

# scheduler class for distributing the topology for execution
heron.class.scheduler: com.twitter.heron.scheduler.aurora.AuroraScheduler

# launcher class for submitting and launching the topology
heron.class.launcher: com.twitter.heron.scheduler.aurora.AuroraLauncher

# location of the core package
heron.package.core.uri: file:///home/vagrant/.heron/dist/heron-core.tar.gz

# location of java - pick it up from shell environment
heron.directory.sandbox.java.home: /usr/lib/jvm/java-1.8.0-openjdk-amd64/

# Invoke the IScheduler as a library directly
heron.scheduler.is.service: False

Additionally edit the client.yaml file and change the core uri to make it consistant.

# location of the core package
heron.package.core.uri: file:///home/vagrant/.heron/dist/heron-core.tar.gz

Important Step: Change folder name aurora to devcluster

Next we need to change the folder name of /home/vagrant/.heron/conf/aurora to /home/vagrant/.heron/conf/devcluster. This is because the name of our aurora cluster is devcluster as we noted in a previous step. You can do this with the following commands

$ cd /home/vagrant/.heron/conf/
$ mv aurora devcluster

Now we are almost done :).

Submitting Example Topology to Aurora cluster

Now we can submit a topology to the aurora cluster. this can be done with the following command.

$ heron submit devcluster/pulasthi/devel --config-path ~/.heron/conf/ ~/.heron/examples/heron-examples.jar com.twitter.heron.examples.ExclamationTopology ExclamationTopology

Now you should be able to see the topology in the Aurora UI ( http://192.168.33.7:8081/scheduler/pulasthi ) .

Understanding the parameters

I will try to explain some of the important parameters that are used in this command. the first parameter "devcluster/pulasthi/devel" defines cluster, role and env ( env can have values prod | devel | test | staging ). The cluster is the name of the aurora cluster which is devcluster in our case. You can give something like your name for the role name and for env you need to choose from one of the env values.

--config-path points to the config folder. the program will automatically look for a folder with the cluster name. This is why we had to change the name of the aurora conf folder to devcluster.

Now that everything is working we need to perform one last step to be able to see the typologies that we can see in Aurora UI in Heron UI.

Associating new Aurora cluster into Heron UI.

Heron UI uses information that is gets from the heron tracker when displaying the information in the heron UI interface. So in-order to allow the Heron UI to show Aurora cluster information we need to modify configuration of the Heron tracker so that it can identify the Aurora Cluster.

Heron Tracker configurations are located at /home/vagrant/.herontools/conf the configuration file is named heron_tracker.yaml. By default you should see the following in the file

statemgrs:
  -
    type: "file"
    name: "local"
    rootpath: "~/.herondata/repository/state/local"
    tunnelhost: "localhost"
  -
    type: "zookeeper"
    name: "localzk"
    hostport: "localhost:2181"
    rootpath: "/heron"
    tunnelhost: "localhost"

You can see that there already two entries. Before we had to create paths in Zookeeper for /heron/topologiesthis is because the entry named localzk in this file. If we remove this we will not need to create that path in Zookeeper. Not all we have to is to add a new entry for the aurora cluster into this file ( we will comment out localzk ). Then the file would look like below.

statemgrs:
  -
    type: "file"
    name: "local"
    rootpath: "~/.herondata/repository/state/local"
    tunnelhost: "localhost"
  #-
   #type: "zookeeper"
   # name: "localzk"
   # hostport: "localhost:2181"
   # rootpath: "/heron"
   # tunnelhost: "localhost"
  -
    type: "zookeeper"
    name: "devcluster"
    hostport: "localhost:2181"
    rootpath: "/heronroot"
    tunnelhost: "localhost"

Now you can start Heron tracker and then Heron UI, Now you will be able to see the aurora cluster from the Heron UI ( http://192.168.33.7:8889/topologies ) as below

$ heron-tracker
$ heron-ui

Now you should have everything working. I hope this post was helpful. And leave a comment and share your thoughts and if you notice any mistakes in the post please let me know. I will make sure to correct the post.

Comments

saranJune 7, 2020 at 8:22 PM
"It is actually a great and helpful piece of information. I am satisfied that you simply shared this helpful information with us. Please stay us informed like this. Thanks for sharing.
Digital Marketing Training Course in Chennai | Digital Marketing Training Course in Anna Nagar | Digital Marketing Training Course in OMR | Digital Marketing Training Course in Porur | Digital Marketing Training Course in Tambaram | Digital Marketing Training Course in Velachery

"
ReplyDelete
Replies
vivekvedhaJune 20, 2020 at 9:38 PM
Nice blog,I understood the topic very clearly,And want to study more like this.
Digital Marketing Training Course in Chennai | Digital Marketing Training Course in Anna Nagar | Digital Marketing Training Course in OMR | Digital Marketing Training Course in Porur | Digital Marketing Training Course in Tambaram | Digital Marketing Training Course in Velachery

ReplyDelete
Replies
Ramesh SampangiDecember 9, 2021 at 4:20 AM
Data Science Training in Hyderabad program by AI Patasala, where you can learn practical knowledge with real-time experience.
Data Science Training and Placements in Hyderabad
ReplyDelete
Replies
نهالستان ایرانJanuary 27, 2022 at 1:19 AM
خرید نهال بادام
ReplyDelete
Replies
AnonymousMay 1, 2022 at 5:45 PM
mmorpg
İNSTAGRAM TAKİPÇİ SATİN AL
tiktok jeton hilesi
tiktok jeton hilesi
antalya saç ekimi
referans kimliği nedir
İnstagram Takipçi Satın Al
metin2 pvp serverlar
takipçi
ReplyDelete
Replies
AnonymousMay 17, 2022 at 9:33 AM
Tül perde modelleri
sms onay
mobil ödeme bozdurma
NFTNASİLALİNİR.COM
Ankara evden eve nakliyat
Trafik sigortası
dedektör
web sitesi kurma
aşk kitapları
ReplyDelete
Replies
sms onayOctober 24, 2022 at 11:35 PM
Congratulations on your article, it was very helpful and successful. 34c49f87992975e1d6e56bf32beca663
website kurma
website kurma
numara onay
ReplyDelete
Replies
define dedektörüOctober 28, 2022 at 8:30 AM
Thank you for your explanation, very good content. 837081866fb5ceb2650319a3712fc07f
altın dedektörü
ReplyDelete
Replies
korsan taksiDecember 27, 2022 at 12:37 PM
Good content. You write beautiful things.
hacklink
mrbahis
vbet
sportsbet
mrbahis
hacklink
sportsbet
vbet
taksi
ReplyDelete
Replies
sportsbetgiris.netJanuary 6, 2023 at 9:22 AM
This post is on your page i will follow your new content.
mrbahis
casino siteleri
mrbahis giriş
sportsbet
casino siteleri
mrbahis.co
sportsbet giriş
sportsbet
betgaranti.online
ReplyDelete
Replies
periJanuary 25, 2023 at 8:14 AM
Önemli giriş adreslerine buradan ulaşabilirsiniz.
betturkey giriş
betpark giriş
XAY
ReplyDelete
Replies
senolMarch 21, 2023 at 11:07 AM
betmatik
kralbet
betpark
mobil ödeme bahis
tipobet
slot siteleri
kibris bahis siteleri
poker siteleri
bonus veren siteler
NCTNOK
ReplyDelete
Replies
JoergeSeptember 12, 2023 at 1:00 AM
It is real-time, distributed stream processing engine, and setting up a cluster involves configuring Why Online Games
the necessary components to enable real-time data processing and analysis.
ReplyDelete
Replies
AnonymousOctober 2, 2024 at 4:47 AM
شركة كشف تسربات المياه بالدمام GaUHAuf7eI
ReplyDelete
Replies
AnonymousNovember 2, 2024 at 1:39 PM
شركة تسليك مجاري بالقطيف BEY2GHEhuX
ReplyDelete
Replies
AnonymousNovember 14, 2024 at 1:56 AM
شركة تنظيف سجاد بالجبيل aq6eBAnDqW
ReplyDelete
Replies
AnonymousMay 3, 2025 at 6:08 AM
شركة تنظيف كنب بالقطيف
cSGjlbhhwd
ReplyDelete
Replies
AnonymousMay 12, 2025 at 4:29 AM
عزل خزانات الزلفي
JDnq0VRQoo
ReplyDelete
Replies
AnonymousMay 14, 2025 at 2:02 AM
تركيب جبس بورد جازان
kb4hnWA6S2
ReplyDelete
Replies

Add comment

Search This Blog

How To