JGroups took almost 5-6 min to join all the nodes.

2013-05-14
2013-05-14
  • Sreenivasa Reddy

    JGroups Version: 2.12.2.Final

    Description : JGroups took almost 5-6 min to join all the nodes.

    Log snippet:

    09.05.2013 21.33.03:637 431 WARNING {main}JGROUPS_GMS join(10.240.11.160-smsrouter) sent to 10.47.24.44-smsrouter timed out (after 3000 ms), retrying
    09.05.2013 21.33.06:670 435 WARNING {main}JGROUPS_GMS join(10.240.11.160-smsrouter) sent to 10.47.24.44-smsrouter timed out (after 3000 ms), retrying
    09.05.2013 21.33.09:693 439 WARNING {main}JGROUPS_GMS join(10.240.11.160-smsrouter) sent to 10.47.24.44-smsrouter timed out (after 3000 ms), retrying
    09.05.2013 21.33.12:717 443 WARNING {main}JGROUPS_GMS join(10.240.11.160-smsrouter) sent to 10.47.24.44-smsrouter timed out (after 3000 ms), retrying
    09.05.2013 21.33.15:750 447 WARNING {main}JGROUPS_GMS join(10.240.11.160-smsrouter) sent to 10.47.24.44-smsrouter timed out (after 3000 ms), retrying
    .
    .
    .
    .
    .
    09.05.2013 21.38.28:248 880 WARNING {main}JGROUPS_GMS join(10.240.11.160-smsrouter) sent to 10.47.24.44-smsrouter timed out (after 3000 ms), retrying
    09.05.2013 21.38.31:301 884 WARNING {main}JGROUPS_GMS join(10.240.11.160-smsrouter) sent to 10.47.24.44-smsrouter timed out (after 3000 ms), retrying
    09.05.2013 21.38.34:325 888 WARNING {main}JGROUPS_GMS join(10.240.11.160-smsrouter) sent to 10.47.24.44-smsrouter timed out (after 3000 ms), retrying
    .
    .
    09.05.2013 21.38.34:346 891 DEBUG {main}JGROUPS_GMS sending handleJoin(10.240.11.160-smsrouter) to 10.47.24.44-smsrouter

    What is causing JGroups to take lot of time ? Can any one help me to identify this ?

     
  • Sreenivasa Reddy

    Following is the configuration in my jgroupsmanager.xml
    <tcpconfig>
    <config xmlns="urn:org:jgroups" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="urn:org:jgroups &lt;a href=" http:="" www.jgroups.org="" schema="" JGroups-2.12.xsd"="">http://www.jgroups.org/schema/JGroups-2.12.xsd">

        <TCP bind_port="7800"
             loopback="false"
             recv_buf_size="20M"
             send_buf_size="640K"
             discard_incompatible_packets="true"
             max_bundle_size="64K"
             max_bundle_timeout="30"
             enable_bundling="false"
             enable_diagnostics="false"
             use_send_queues="true"
             sock_conn_timeout="300"    
             timer_type="new"
             timer.min_threads="4"
             timer.max_threads="10"
             timer.keep_alive_time="3000"
             timer.queue_max_size="500"             
             thread_pool.enabled="true"
             thread_pool.min_threads="1"
             thread_pool.max_threads="10"
             thread_pool.keep_alive_time="5000"
             thread_pool.queue_enabled="false"
             thread_pool.queue_max_size="100"
             thread_pool.rejection_policy="discard"    
             oob_thread_pool.enabled="true"
             oob_thread_pool.min_threads="1"
             oob_thread_pool.max_threads="8"
             oob_thread_pool.keep_alive_time="5000"
             oob_thread_pool.queue_enabled="false"
             oob_thread_pool.queue_max_size="100"
             oob_thread_pool.rejection_policy="discard"/>                             
        <TCPPING timeout="5000"             initial_hosts="10.47.24.44[7800],10.47.24.46[7800],10.47.24.53[7800],10.47.24.57[7800],10.240.11.81[7800],10.240.11.83[7800],10.240.11.160[7800],10.240.11.161[7800]" port_range="3"
                 num_initial_members="9"/>
        <MERGE2  min_interval="10000"
                 max_interval="30000"/>
        <FD_SOCK start_port="7830"/>
        <FD timeout="3000" max_tries="3" />
        <VERIFY_SUSPECT timeout="1500"  />
        <BARRIER />
        <pbcast.NAKACK
                       use_mcast_xmit="false" gc_lag="0"
                       retransmit_timeout="300,600,1200,2400,4800"
                       discard_delivered_msgs="true"/>
        <UNICAST timeout="300,600,1200" />
        <pbcast.STABLE stability_delay="1000" desired_avg_gossip="50000"
                       max_bytes="4M"/>
        <pbcast.GMS print_local_addr="true" join_timeout="3000"
    
                    view_bundling="true"/>
        <UFC max_credits="2M"
             min_threshold="0.4"/>
        <MFC max_credits="2M"
             min_threshold="0.4"/>
        <FRAG2 frag_size="60K"  />
        <pbcast.STREAMING_STATE_TRANSFER/>
        <!-- <pbcast.STATE_TRANSFER/> -->  
    </config>
    

    </tcpconfig>

     
    Last edit: Sreenivasa Reddy 2013-05-15
    Attachments

Get latest updates about Open Source Projects, Conferences and News.

Sign up for the SourceForge newsletter:





No, thanks