|
From: Woodruff, R. J <wo...@co...> - 2004-04-26 16:37:01
|
Looks like you are unable to query the SM. Is OpenSM
up and running ?
SA query callback failed status IB_REMOTE_ERROR
> > > --> DsINMG: query SA found no record
-----Original Message-----
From: inf...@li... =
[mailto:inf...@li...] On Behalf Of =
moral moral
Sent: Friday, April 23, 2004 12:32 AM
To: inf...@li...
Subject: [Infiniband-general] udapl:al:query_req_cb() !ERROR!: query =
failed:
I am a chinese student.there are two question:
1)I can run ibal and opensm,it shows"subnet up".i can ping port1 =
from=20
por1 on the same computer.when i ping other port in the subnet,the =
computer=20
died.
2)i run a server which is waiting on scorce1.ustb.cn,after i start =
ibal=20
and opensm.when i run a client on score2.ustb.cn,it has these error:
> > >
> > > root@score2 dapltest]# ./bw.sh
> > > Usage: bw.sh [hostname [size [device]]]
> > > [root@score2 dapltest]# ./bw.sh score1.ustb.cn-ib0
> > > UAL: _init: Entered UAL Shell Initialization
> > > cl_open_device: opening device /dev/al
> > > uvp_get_interface() [
> > > mlnx_get_pd_interface() [
> > > mlnx_get_pd_interface() ]
> > > mlnx_get_qp_interface() [
> > > mlnx_get_qp_interface() ]
> > > mlnx_get_cq_interface() [
> > > mlnx_get_cq_interface() ]
> > > mlnx_get_eec_interface() [
> > > mlnx_get_eec_interface() ]
> > > mlnx_get_mrw_interface() [
> > > mlnx_get_mrw_interface() ]
> > > mlnx_get_mcast_interface() [
> > > mlnx_get_mcast_interface() ]
> > > mlnx_get_errh_interface() [
> > > mlnx_get_errh_interface() ]
> > > uvp_get_interface() ]
> > > mlnx_pre_open_ca() [
> > > mlnx_pre_open_ca() ]
> > > mlnx_post_open_ca() [
> > > cl_open_device: opening device /dev/mlx8cd30a0
> > > mlnx_pre_query_ca() [
> > > mlnx_pre_query_ca(): priv_op =3D 1
> > > mlnx_pre_query_ca() ]
> > > mlnx_post_query_ca() [
> > > mlnx_post_query_ca() ]
> > > mlnx_pre_query_ca() [
> > > mlnx_pre_query_ca(): priv_op =3D 1
> > > mlnx_pre_query_ca() ]
> > > mlnx_post_query_ca() [
> > > mlnx_post_query_ca(): got ul resource size 48
> > > mlnx_post_query_ca() ]
> > > mlnx_post_open_ca(): comm 27904 buf_size 48
> > > mlnx_post_open_ca() ]
> > > mlnx_pre_query_ca() [
> > > mlnx_pre_query_ca(): priv_op =3D 0
> > > mlnx_pre_query_ca() ]
> > > mlnx_post_query_ca() [
> > > mlnx_post_query_ca() ]
> > > mlnx_pre_query_ca() [
> > > mlnx_pre_query_ca(): priv_op =3D 0
> > > mlnx_pre_query_ca() ]
> > > mlnx_post_query_ca() [
> > > mlnx_post_query_ca(): got ul resource size 48
> > > mlnx_post_query_ca() ]
> > > mlnx_pre_query_ca() [
> > > mlnx_pre_query_ca(): priv_op =3D 0
> > > mlnx_pre_query_ca() ]
> > > mlnx_post_query_ca() [
> > > mlnx_post_query_ca() ]
> > > mlnx_pre_query_ca() [
> > > mlnx_pre_query_ca(): priv_op =3D 0
> > > mlnx_pre_query_ca() ]
> > > mlnx_post_query_ca() [
> > > mlnx_post_query_ca(): got ul resource size 48
> > > mlnx_post_query_ca() ]
> > > UAL: _init: Initialization completed successfully
> > > -------------------------------------
> > > PerfCmd.server_name : score1.ustb.cn-ib0
> > > PerfCmd.dapl_name : IbalHca0
> > > PerfCmd.mode : POLLING
> > > PerfCmd.num_iterations : 1024
> > > PerfCmd.pipeline_len : 16
> > > PerfCmd.op.transfer_type : RDMA_WRITE
> > > PerfCmd.op.num_segs : 1
> > > PerfCmd.op.seg_size : 65536
> > > DT_cs_Client: Starting Test ...
> > > Server Name: score1.ustb.cn-ib0
> > > Server Net Address: 192.168.67.0
> > > mlnx_pre_query_ca() [
> > > mlnx_pre_query_ca(): priv_op =3D 0
> > > mlnx_pre_query_ca() ]
> > > mlnx_post_query_ca() [
> > > mlnx_post_query_ca() ]
> > > mlnx_pre_query_ca() [
> > > mlnx_pre_query_ca(): priv_op =3D 0
> > > mlnx_pre_query_ca() ]
> > > mlnx_post_query_ca() [
> > > mlnx_post_query_ca(): got ul resource size 48
> > > mlnx_post_query_ca() ]
> > > WARNING: <score2-ib0> not registered in DNS, using dummy IP value
> > > mlnx_pre_query_ca() [
> > > mlnx_pre_query_ca(): priv_op =3D 0
> > > mlnx_pre_query_ca() ]
> > > mlnx_post_query_ca() [
> > > mlnx_post_query_ca(): got ul resource size 48
> > > mlnx_post_query_ca() ]
> > > mlnx_pre_query_ca() [
> > > mlnx_pre_query_ca(): priv_op =3D 0
> > > mlnx_pre_query_ca() ]
> > > mlnx_post_query_ca() [
> > > mlnx_post_query_ca() ]
> > > mlnx_pre_query_ca() [
> > > mlnx_pre_query_ca(): priv_op =3D 0
> > > mlnx_pre_query_ca() ]
> > > mlnx_post_query_ca() [
> > > mlnx_post_query_ca(): got ul resource size 48
> > > mlnx_post_query_ca() ]
> > > cl_open_device: opening device /dev/mvdapl8cd30a0
> > > cl_open_dev: error opening /dev/mvdapl8cd30a0 (No such file or
> directory)
> > > --> DsMI: Init MRDB failed =3D 0x1
> > > DT_cs_Client: IA IbalHca0 opened
> > > mlnx_pre_allocate_pd() [
> > > mlnx_pre_allocate_pd(): umv_buf->input_size 20, pd_ul_res_sz 16
> > > mlnx_pre_allocate_pd() ]
> > > mlnx_post_allocate_pd() [
> > > mlnx_post_allocate_pd() ]
> > > mlnx_pre_create_cq() [
> > > mlnx_pre_create_cq(): The created cq_size 15 different than=20
> > > *p_size 8
> > > mlnx_pre_create_cq() ]
> > > mlnx_post_create_cq() [
> > > mlnx_post_create_cq(): Newly created CQ cq_idx 0x88
> > > mlnx_post_create_cq() ]
> > > mlnx_enable_cq_notify() [
> > > mlnx_enable_cq_notify() ]
> > > mlnx_pre_create_cq() [
> > > mlnx_pre_create_cq(): The created cq_size 15 different than =
*p_size 8
> > > mlnx_pre_create_cq() ]
> > > mlnx_post_create_cq() [
> > > mlnx_post_create_cq(): Newly created CQ cq_idx 0x8a
> > > mlnx_post_create_cq() ]
> > > mlnx_enable_cq_notify() [
> > > mlnx_enable_cq_notify() ]
> > > mlnx_pre_create_qp() [
> > > mlnx_pre_create_qp() ]
> > > mlnx_post_create_qp() [
> > > mlnx_post_create_qp(): Newly created QP qp_idx 0x10019
> > > mlnx_post_create_qp() ]
> > > mlnx_pre_modify_qp() [
> > > mlnx_pre_modify_qp() ]
> > > mlnx_post_modify_qp() [
> > > mlnx_post_modify_qp(): Committed to modify QP to state 1
> > > mlnx_post_modify_qp() ]
> > > DT_cs_Client: EP created
> > > mlnx_pre_query_ca() [
> > > mlnx_pre_query_ca(): priv_op =3D 0
> > > mlnx_pre_query_ca() ]
> > > mlnx_post_query_ca() [
> > > mlnx_post_query_ca(): got ul resource size 48
> > > mlnx_post_query_ca() ]
> > > ***** DAPL Characteristics *****
> > > Provider: IbalHca0 Version 1.0 DAPL 1.1
> > > Adapter: InfiniHost (Tavor) by Mellanox Technolgy Inc. Version
> 23108.161
> > > Supporting:
> > > 65512 EPs with 65535 DTOs and 4 RDMA/RDs each
> > > 16256 EVDs of up to 131071 entries (default S/R size is
> 16/16)
> > > IOVs of up to 60 elements
> > > 131056 LMRs (and 262128 RMRs) of up to 0xffffffffffffffff
> bytes
> > > Maximum MTU 0x80000000 bytes, RDMA 0x80000000 bytes
> > > Maximum Private data size 92 bytes
> > > Local IP address 4.3.2.1
> > > ***** ***** ***** ***** ***** *****
> > > DT_cs_Client: Posting 1 recv buffer
> > > mlnx_post_recv() [
> > > mlnx_post_recv() ]
> > > DT_cs_Client: Connect Endpoint
> > > al:query_req_cb() !ERROR!: query failed: IB_REMOTE_ERROR
> > > --> DiISQC: SA query callback failed status IB_REMOTE_ERROR
> > > --> DsINMG: query SA found no record
> > > --> DsIC: fail to map remote_ia_addr (sa_family 2) to gid
> > > DT_cs_Client: Cannot connect Endpoint DAT_INVALID_PARAMETER
> > > DT_cs_Client: Cleaning Up ...
> > > DT_cs_Client: dat_ep_disconnect (abrupt) error: DAT_INVALID_STATE
> > > mlnx_poll_cq() [
> > > mlnx_poll_cq() ]
> > > mlnx_pre_modify_qp() [
> > > mlnx_pre_modify_qp() ]
> > > mlnx_post_modify_qp() [
> > > mlnx_post_modify_qp(): Committed to modify QP to state 0
> > > mlnx_post_modify_qp() ]
> > > mlnx_pre_destroy_qp() [
> > > mlnx_pre_destroy_qp() ]
> > > mlnx_post_destroy_qp() [
> > > mlnx_post_destroy_qp() ]
> > > mlnx_pre_destroy_cq() [
> > > mlnx_pre_destroy_cq() ]
> > > mlnx_post_destroy_cq() [
> > > mlnx_post_destroy_cq() ]
> > > mlnx_pre_destroy_cq() [
> > > mlnx_pre_destroy_cq() ]
> > > mlnx_post_destroy_cq() [
> > > mlnx_post_destroy_cq() ]
> > > mlnx_pre_deallocate_pd() [
> > > mlnx_pre_deallocate_pd() ]
> > > mlnx_post_deallocate_pd() [
> > > mlnx_post_deallocate_pd() ]
> > > DT_cs_Client: IA IbalHca0 closed
> > > DT_cs_Client: =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D End of Work -- Client =
Exiting
> > > mlnx_pre_close_ca() [
> > > mlnx_pre_close_ca() ]
> > > mlnx_post_close_ca() [
> > > mlnx_post_close_ca(): PRE block
> > > mlnx_post_close_ca(): comm 27905 buf_size 48
> > > mlnx_post_close_ca(): POST block
> > > mlnx_post_close_ca() ]
> > > al:sync_destroy_obj() !ERROR!: Error waiting for references to be=20
> > > released. Forcing shutdown now. Ref_cnt =3D 1
> > > al:sync_destroy_obj() !ERROR!: 0x8077200(AL_OBJ_TYPE_H_AL)
> > > dapltest: al_common.c:504: async_destroy_cb: Assertion
> `!p_obj->ref_cnt'
> > > failed.
> > > ./bw.sh: line 22: 2392 Aborted ./dapltest -T P -d =
-i
> 1024
> > > -s ${host} -D ${device} -p 16 -m p RW ${size} 1
> > >
> > > i am a baby in iba.please tell me the detail,thanks. wangjue
_________________________________________________________________
=C3=E2=B7=D1=CF=C2=D4=D8 MSN Explorer: http://explorer.msn.com/lccn/ =20
-------------------------------------------------------
This SF.net email is sponsored by: The Robotic Monkeys at ThinkGeek For =
a limited time only, get FREE Ground shipping on all orders of $35 or =
more. Hurry up and shop folks, this offer expires April 30th! =
http://www.thinkgeek.com/freeshipping/?cpg=3D12297
_______________________________________________
Infiniband-general mailing list Inf...@li...
https://lists.sourceforge.net/lists/listinfo/infiniband-general
|