multi-node glu deployment

classic Classic list List threaded Threaded
13 messages Options
Reply | Threaded
Open this post in threaded view
|

multi-node glu deployment

gergely
hi! i'm trying to build a multi-node glu deployment for evaluation, i'm using the package 1.5.1 version.  I haven't found much documentation on how to build a setup with a remote Zookeeper, so i've been trying to hack to tutorial.sh provided in the package.

I changed the zookeeper hostname in setup-zookeeper.sh  (GLU_ZK_CONNECT_STRING) to my own ZK server. And since then, I've been receiving this expception:

### Setting up keys and agent configuration...
Exception in thread "main" java.util.concurrent.TimeoutException: timeout reached while waiting on the lock: java.lang.Object@df2d38
        at org.linkedin.util.concurrent.ConcurrentUtils.awaitUntil(ConcurrentUtils.java:117)
        at org.linkedin.zookeeper.client.ZKClient.waitForState(ZKClient.java:465)
        at org.linkedin.zookeeper.client.ZKClient.waitForStart(ZKClient.java:450)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:616)
        at org.codehaus.groovy.runtime.callsite.PojoMetaMethodSite$PojoCachedMethodSiteNoUnwrapNoCoerce.invoke(PojoMetaMethodSite.java:229)
        at org.codehaus.groovy.runtime.callsite.PojoMetaMethodSite.call(PojoMetaMethodSite.java:52)
        at org.codehaus.groovy.runtime.callsite.CallSiteArray.defaultCall(CallSiteArray.java:40)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:116)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:124)

Could you suggest what else shall i change?
Reply | Threaded
Open this post in threaded view
|

Re: multi-node glu deployment

gergely
First step towards the solution described here: https://github.com/linkedin/glu/issues#issue/13 . Still working on getting the multi-node system together :)
Reply | Threaded
Open this post in threaded view
|

Re: multi-node glu deployment

frenchyan
Administrator
I saw the issue you posted and fixed it locally. It will make it with the next release. After 'fixing' this issue can you let me know what else you want to know or is not working ?

Generally in a multi node/production environment, you setup zookeeper on one node (recommended 3, you should check the recommended setup for ZooKeeper on the ZooKeeper web site itself).

Then it is a matter of configuring it:

* to load the keys (if you use ssl) and assigning the fabric you use the zk.sh -s <zookeeper string> command (or you can use the setup-zookeeper.sh / setup-agent.sh shell script with -z <zookeeper string>)

* when you start the agent you provide its zookeeper: -z <zookeeper string> (there are other ways to do it but this is the easiest)

* when you start the console, you need to 'define' the fabric and one of the parameters is the zookeeper connection string (Admin tab / Create fabric)

I agree there is a lack of documentation. I just created an issue for it: https://github.com/linkedin/glu/issues/issue/14

So in the meantime feel free to ask other questions on the forum :)

Thanks
Yan
Reply | Threaded
Open this post in threaded view
|

Re: multi-node glu deployment

gergely
Hi Yan,

thank you for the quick and very helpful reply!

I've managed to set up 4 agents, I  have a central zookeeper. All the keys are the same (this a pilot setup, no security concerns).

Then've started the console, set up the fabric, as you said. All the agents show up, no problem. I've got one groovy script installed, this appears on the dashboard. But when I click on the name of the script on the dashboard, I get this exception: http://pastebin.com/BLBB129Z

Do you have an idea how to fix this?

Thanks, Gergely

Reply | Threaded
Open this post in threaded view
|

Re: multi-node glu deployment

frenchyan
Administrator
I am looking at the code and cannot really figure out (based on the exception) what is going on. So let me ask you some questions:

* can you copy/paste the link you are actually clicking on which generates this exception (the stack trace does not show this information) ?
* which version are you using (it is at the bottom right of any page) ?
* did you build it yourself or are you using the prepackaged version ?
* are you running the dev version (from the build directly using gradle and/or grails ?) ?
* can you make sure you have a system loaded in the console ? (go to System tab and make sure there is at least one entry in the table (and the currently selected one is light blue colored)).

Thanks
Yan
Reply | Threaded
Open this post in threaded view
|

Re: multi-node glu deployment

Laura
In reply to this post by frenchyan
Hi,

 I am also trying to set up a multi-node glu environment, but with package 4.6.2.

I was following along the production setup procedure. When I did step 5 (http://linkedin.github.io/glu/docs/latest/html/production-setup.html#step-5-prepare-zookeeper), I get the TimeoutException below.

Zookeeper is up and running on three nodes; I verified it with zkCli.sh. I downloaded version 3.4.5 and followed step 2 to set it up. Do I need to change setup-zookeeper.sh before running it ?

Thank you,
Laura

----------------------------------------------------------------------------------------------------
laura@ubuntu-poc-scc-1:/usr/local/thirdparty/org.linkedin.glu.packaging-all-4.6.2$ ./bin/setup-zookeeper.sh -z ubuntu-poc-scc-1:2181 -f fabric1
Exception in thread "main" java.util.concurrent.TimeoutException: timeout reached while waiting on the lock: java.lang.Object@51e3b325
        at org.linkedin.util.concurrent.ConcurrentUtils.awaitUntil(ConcurrentUtils.java:117)
        at org.linkedin.zookeeper.client.ZKClient.waitForState(ZKClient.java:465)
        at org.linkedin.zookeeper.client.ZKClient.waitForStart(ZKClient.java:450)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:601)
        at org.codehaus.groovy.runtime.callsite.PojoMetaMethodSite$PojoCachedMethodSiteNoUnwrapNoCoerce.invoke(PojoMetaMethodSite.java:229)
        at org.codehaus.groovy.runtime.callsite.PojoMetaMethodSite.call(PojoMetaMethodSite.java:52)
        at org.codehaus.groovy.runtime.callsite.CallSiteArray.defaultCall(CallSiteArray.java:40)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:116)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:124)
        at org.linkedin.zookeeper.cli.ClientMain.start(ClientMain.groovy:69)
        at org.linkedin.zookeeper.cli.ClientMain$start.call(Unknown Source)
        at org.codehaus.groovy.runtime.callsite.CallSiteArray.defaultCall(CallSiteArray.java:40)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:116)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:120)
        at org.linkedin.zookeeper.cli.ClientMain.mainNoExit(ClientMain.groovy:122)
        at org.linkedin.zookeeper.cli.ClientMain$mainNoExit.callStatic(Unknown Source)
        at org.codehaus.groovy.runtime.callsite.CallSiteArray.defaultCallStatic(CallSiteArray.java:48)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.callStatic(AbstractCallSite.java:165)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.callStatic(AbstractCallSite.java:173)
        at org.linkedin.zookeeper.cli.ClientMain.main(ClientMain.groovy:139)
Exception in thread "main" java.util.concurrent.TimeoutException: timeout reached while waiting on the lock
: java.lang.Object@51e3b325
        at org.linkedin.util.concurrent.ConcurrentUtils.awaitUntil(ConcurrentUtils.java:117)
        at org.linkedin.zookeeper.client.ZKClient.waitForState(ZKClient.java:465)
        at org.linkedin.zookeeper.client.ZKClient.waitForStart(ZKClient.java:450)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:601)
        at org.codehaus.groovy.runtime.callsite.PojoMetaMethodSite$PojoCachedMethodSiteNoUnwrapNoCoerce.invoke(PojoMetaMethodSite.java:229)
        at org.codehaus.groovy.runtime.callsite.PojoMetaMethodSite.call(PojoMetaMethodSite.java:52)
        at org.codehaus.groovy.runtime.callsite.CallSiteArray.defaultCall(CallSiteArray.java:40)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:116)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:124)
        at org.linkedin.zookeeper.cli.ClientMain.start(ClientMain.groovy:69)
        at org.linkedin.zookeeper.cli.ClientMain$start.call(Unknown Source)
        at org.codehaus.groovy.runtime.callsite.CallSiteArray.defaultCall(CallSiteArray.java:40)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:116)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:120)
        at org.linkedin.zookeeper.cli.ClientMain.mainNoExit(ClientMain.groovy:122)
        at org.linkedin.zookeeper.cli.ClientMain$mainNoExit.callStatic(Unknown Source)
        at org.codehaus.groovy.runtime.callsite.CallSiteArray.defaultCallStatic(CallSiteArray.java:48)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.callStatic(AbstractCallSite.java:165)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.callStatic(AbstractCallSite.java:173)
        at org.linkedin.zookeeper.cli.ClientMain.main(ClientMain.groovy:139)
Exception in thread "main" java.util.concurrent.TimeoutException: timeout reached while waiting on the lock: java.lang.Object@50295269
        at org.linkedin.util.concurrent.ConcurrentUtils.awaitUntil(ConcurrentUtils.java:117)
        at org.linkedin.zookeeper.client.ZKClient.waitForState(ZKClient.java:465)
        at org.linkedin.zookeeper.client.ZKClient.waitForStart(ZKClient.java:450)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:601)
        at org.codehaus.groovy.runtime.callsite.PojoMetaMethodSite$PojoCachedMethodSiteNoUnwrapNoCoerce.invoke(PojoMetaMethodSite.java:229)
        at org.codehaus.groovy.runtime.callsite.PojoMetaMethodSite.call(PojoMetaMethodSite.java:52)
        at org.codehaus.groovy.runtime.callsite.CallSiteArray.defaultCall(CallSiteArray.java:40)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:116)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:124)
        at org.linkedin.zookeeper.cli.ClientMain.start(ClientMain.groovy:69)
        at org.linkedin.zookeeper.cli.ClientMain$start.call(Unknown Source)
        at org.codehaus.groovy.runtime.callsite.CallSiteArray.defaultCall(CallSiteArray.java:40)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:116)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:120)
        at org.linkedin.zookeeper.cli.ClientMain.mainNoExit(ClientMain.groovy:122)
        at org.linkedin.zookeeper.cli.ClientMain$mainNoExit.callStatic(Unknown Source)
        at org.codehaus.groovy.runtime.callsite.CallSiteArray.defaultCallStatic(CallSiteArray.java:48)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.callStatic(AbstractCallSite.java:165)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.callStatic(AbstractCallSite.java:173)
        at org.linkedin.zookeeper.cli.ClientMain.main(ClientMain.groovy:139)
Reply | Threaded
Open this post in threaded view
|

Re: multi-node glu deployment

frenchyan
Administrator
It is rather weird as setup-zookeeper simply uses the zookeeper cli. Can you change the log4j.xml file which is under setup/org.linkedin.zookeeper-cli-<version>/conf and change the level to info or debug for the org.apache.zookeeper logger (you can also change root to debug if that can help troubleshooting).

Yan
Reply | Threaded
Open this post in threaded view
|

Re: multi-node glu deployment

Laura
Thank you for the help, Yan.

After turning on the debug, I found that client was unable to establish connection w/ zookeeper server. That's when I realized I had rebooted the machine for a different reason and forgot to start the zookeeper server again. Once I did, this step is successful.

Thank you,
Laura
Reply | Threaded
Open this post in threaded view
|

Re: multi-node glu deployment

doshihardikn
In reply to this post by gergely
Hello Everyone,

I have just started off with glu setup for the first time. I am facing similar issue while performing production setup instructions at step 5. Here are the environment details.

1. Glu - JAVA_HOME=jdk1.7.0_21
2. Zookeeper (Uses jdk1.6.0_38) - 3 Different Servers - They are able to talk to each other. Verified with zkCli.sh

   ./zkCli.sh -server 172.22.166.228:2181 works fine.

3. VM environment - Created with Oracle Virtual VM - CentOs 6.x.

Getting following error while executing this command.

./bin/setup-zookeeper.sh -z 172.22.166.228:2181 -f glu-fabric

Exception in thread "main" java.util.concurrent.TimeoutException: timeout reached while waiting on the lock: java.lang.Object@12b7140
        at org.linkedin.util.concurrent.ConcurrentUtils.awaitUntil(ConcurrentUtils.java:117)
        at org.linkedin.zookeeper.client.ZKClient.waitForState(ZKClient.java:534)
        at org.linkedin.zookeeper.client.ZKClient.waitForStart(ZKClient.java:519)
        at org.linkedin.zookeeper.client.ZKClient$waitForStart.call(Unknown Source)
        at org.codehaus.groovy.runtime.callsite.CallSiteArray.defaultCall(CallSiteArray.java:45)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:108)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:116)
        at org.linkedin.zookeeper.cli.ClientMain.start(ClientMain.groovy:70)
        at org.linkedin.zookeeper.cli.ClientMain$start.call(Unknown Source)
        at org.codehaus.groovy.runtime.callsite.CallSiteArray.defaultCall(CallSiteArray.java:45)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:108)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:112)
        at org.linkedin.zookeeper.cli.ClientMain.mainNoExit(ClientMain.groovy:124)
        at org.linkedin.zookeeper.cli.ClientMain$mainNoExit.callStatic(Unknown Source)
        at org.codehaus.groovy.runtime.callsite.CallSiteArray.defaultCallStatic(CallSiteArray.java:53)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.callStatic(AbstractCallSite.java:157)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.callStatic(AbstractCallSite.java:165)
        at org.linkedin.zookeeper.cli.ClientMain.main(ClientMain.groovy:141)

Please help. Thanks
Reply | Threaded
Open this post in threaded view
|

Re: multi-node glu deployment

doshihardikn
Few More Details

Glu Version 5.0.0
Zookeeper Version 3.4.5
Reply | Threaded
Open this post in threaded view
|

Re: multi-node glu deployment

doshihardikn
Few More Details,

I tried with in-built zookeeper-2.0.0 comes with glu-5.0.0. Then enabled echo in zk.sh on second last line. I got following out put. Does this indicate anything?

[root@glu glu-all-5.0.0]# ./bin/setup-zookeeper.sh -z localhost:2181 -f glu-fabric
/ltapps/jdk/1.7/32-bit/jdk1.7.0_21/bin/java -Dlog4j.configuration=file:/ltapps/glu/glu-all-5.0.0/setup/org.linkedin.zookeeper-cli-2.0.0/conf/log4j.xml -classpath /ltapps/jdk/1.7/32-bit/jdk1.7.0_21/lib/tools.jar:lib/ant-1.8.2.jar:lib/ant-launcher-1.8.2.jar:lib/commons-cli-1.2.jar:lib/groovy-all-2.0.7.jar:lib/jackson-annotations-2.1.4.jar:lib/jackson-core-2.1.4.jar:lib/jackson-databind-2.1.4.jar:lib/jline-0.9.94.jar:lib/json-20090211.jar:lib/jul-to-slf4j-1.6.2.jar:lib/junit-3.8.1.jar:lib/log4j-1.2.16.jar:lib/netty-3.2.2.Final.jar:lib/org.linkedin.util-core-2.0.0.jar:lib/org.linkedin.util-groovy-2.0.0.jar:lib/org.linkedin.zookeeper-cli-impl-2.0.0.jar:lib/org.linkedin.zookeeper-impl-2.0.0.jar:lib/slf4j-api-1.6.2.jar:lib/slf4j-log4j12-1.6.2.jar:lib/zookeeper-3.4.5.jar org.linkedin.zookeeper.cli.ClientMain -s localhost:2181 upload -f /ltapps/glu/glu-all-5.0.0/setup/zookeeper-config/config.properties /org/glu/agents/fabrics/glu-fabric/config/config.properties
/ltapps/jdk/1.7/32-bit/jdk1.7.0_21/bin/java -Dlog4j.configuration=file:/ltapps/glu/glu-all-5.0.0/setup/org.linkedin.zookeeper-cli-2.0.0/conf/log4j.xml -classpath /ltapps/jdk/1.7/32-bit/jdk1.7.0_21/lib/tools.jar:lib/ant-1.8.2.jar:lib/ant-launcher-1.8.2.jar:lib/commons-cli-1.2.jar:lib/groovy-all-2.0.7.jar:lib/jackson-annotations-2.1.4.jar:lib/jackson-core-2.1.4.jar:lib/jackson-databind-2.1.4.jar:lib/jline-0.9.94.jar:lib/json-20090211.jar:lib/jul-to-slf4j-1.6.2.jar:lib/junit-3.8.1.jar:lib/log4j-1.2.16.jar:lib/netty-3.2.2.Final.jar:lib/org.linkedin.util-core-2.0.0.jar:lib/org.linkedin.util-groovy-2.0.0.jar:lib/org.linkedin.zookeeper-cli-impl-2.0.0.jar:lib/org.linkedin.zookeeper-impl-2.0.0.jar:lib/slf4j-api-1.6.2.jar:lib/slf4j-log4j12-1.6.2.jar:lib/zookeeper-3.4.5.jar org.linkedin.zookeeper.cli.ClientMain -s localhost:2181 upload -f /ltapps/glu/glu-all-5.0.0/setup/zookeeper-config/agent.keystore /org/glu/agents/fabrics/glu-fabric/config/agent.keystore
Exception in thread "main" java.util.concurrent.TimeoutException: timeout reached while waiting on the lock: java.lang.Object@1bff563
        at org.linkedin.util.concurrent.ConcurrentUtils.awaitUntil(ConcurrentUtils.java:117)
        at org.linkedin.zookeeper.client.ZKClient.waitForState(ZKClient.java:534)
        at org.linkedin.zookeeper.client.ZKClient.waitForStart(ZKClient.java:519)
        at org.linkedin.zookeeper.client.ZKClient$waitForStart.call(Unknown Source)
        at org.codehaus.groovy.runtime.callsite.CallSiteArray.defaultCall(CallSiteArray.java:45)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:108)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:116)
        at org.linkedin.zookeeper.cli.ClientMain.start(ClientMain.groovy:70)
        at org.linkedin.zookeeper.cli.ClientMain$start.call(Unknown Source)
        at org.codehaus.groovy.runtime.callsite.CallSiteArray.defaultCall(CallSiteArray.java:45)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:108)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:112)
        at org.linkedin.zookeeper.cli.ClientMain.mainNoExit(ClientMain.groovy:124)
        at org.linkedin.zookeeper.cli.ClientMain$mainNoExit.callStatic(Unknown Source)
        at org.codehaus.groovy.runtime.callsite.CallSiteArray.defaultCallStatic(CallSiteArray.java:53)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.callStatic(AbstractCallSite.java:157)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.callStatic(AbstractCallSite.java:165)
        at org.linkedin.zookeeper.cli.ClientMain.main(ClientMain.groovy:141)
/ltapps/jdk/1.7/32-bit/jdk1.7.0_21/bin/java -Dlog4j.configuration=file:/ltapps/glu/glu-all-5.0.0/setup/org.linkedin.zookeeper-cli-2.0.0/conf/log4j.xml -classpath /ltapps/jdk/1.7/32-bit/jdk1.7.0_21/lib/tools.jar:lib/ant-1.8.2.jar:lib/ant-launcher-1.8.2.jar:lib/commons-cli-1.2.jar:lib/groovy-all-2.0.7.jar:lib/jackson-annotations-2.1.4.jar:lib/jackson-core-2.1.4.jar:lib/jackson-databind-2.1.4.jar:lib/jline-0.9.94.jar:lib/json-20090211.jar:lib/jul-to-slf4j-1.6.2.jar:lib/junit-3.8.1.jar:lib/log4j-1.2.16.jar:lib/netty-3.2.2.Final.jar:lib/org.linkedin.util-core-2.0.0.jar:lib/org.linkedin.util-groovy-2.0.0.jar:lib/org.linkedin.zookeeper-cli-impl-2.0.0.jar:lib/org.linkedin.zookeeper-impl-2.0.0.jar:lib/slf4j-api-1.6.2.jar:lib/slf4j-log4j12-1.6.2.jar:lib/zookeeper-3.4.5.jar org.linkedin.zookeeper.cli.ClientMain -s localhost:2181 upload -f /ltapps/glu/glu-all-5.0.0/setup/zookeeper-config/console.truststore /org/glu/agents/fabrics/glu-fabric/config/console.truststore
Exception in thread "main" java.util.concurrent.TimeoutException: timeout reached while waiting on the lock: java.lang.Object@12090ab
        at org.linkedin.util.concurrent.ConcurrentUtils.awaitUntil(ConcurrentUtils.java:117)
        at org.linkedin.zookeeper.client.ZKClient.waitForState(ZKClient.java:534)
        at org.linkedin.zookeeper.client.ZKClient.waitForStart(ZKClient.java:519)
        at org.linkedin.zookeeper.client.ZKClient$waitForStart.call(Unknown Source)
        at org.codehaus.groovy.runtime.callsite.CallSiteArray.defaultCall(CallSiteArray.java:45)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:108)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:116)
        at org.linkedin.zookeeper.cli.ClientMain.start(ClientMain.groovy:70)
        at org.linkedin.zookeeper.cli.ClientMain$start.call(Unknown Source)
        at org.codehaus.groovy.runtime.callsite.CallSiteArray.defaultCall(CallSiteArray.java:45)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:108)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.call(AbstractCallSite.java:112)
        at org.linkedin.zookeeper.cli.ClientMain.mainNoExit(ClientMain.groovy:124)
        at org.linkedin.zookeeper.cli.ClientMain$mainNoExit.callStatic(Unknown Source)
        at org.codehaus.groovy.runtime.callsite.CallSiteArray.defaultCallStatic(CallSiteArray.java:53)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.callStatic(AbstractCallSite.java:157)
        at org.codehaus.groovy.runtime.callsite.AbstractCallSite.callStatic(AbstractCallSite.java:165)
        at org.linkedin.zookeeper.cli.ClientMain.main(ClientMain.groovy:141)
Reply | Threaded
Open this post in threaded view
|

Re: multi-node glu deployment

sodul
In reply to this post by doshihardikn
You say you have 3 zookeeper instances but only list one in your zookeeper string:
./bin/setup-zookeeper.sh -z 172.22.166.228:2181 -f glu-fabric

You need to have all 3 listed, for example: "-z 172.22.166.228:2181,172.22.166.229:2181,172.22.166.230:2181"

Also make sure that the myid file is unique for each of the zk instances.
Reply | Threaded
Open this post in threaded view
|

Re: multi-node glu deployment

frenchyan
Administrator

When you are running ./bin/zkCli.sh (the one that you say works) are you running it from the same machine that you are running the setup from? Or are you running it directly on the machine itself?

In the config file: setup/org.linkedin.zookeeper-cli-2.0.0/conf/log4j.xml in the following block 

  <logger name="org.apache.zookeeper" additivity="false">
      <level value="error"/>
      <appender-ref ref="Console2Appender"/>
  </logger>

replace error with debug

See what you get...

Yan