From: Vladislav B. <vs...@vl...> - 2006-05-22 12:18:52
|
Thanks. Could you try with the attached patch, please? (On top of the latest CVS) Brad Johnson wrote: > I tried it with the latest code from CVS (as of 8:00 AM CDT 05/19/2006). > It goes a while longer before crashing this time, and fails in a > different place. This test again used scst_disk handler. I will next try > using the scst_fileio module and will let you know the results. > Here is the GDB output for this failure: > > Program received signal SIGILL, Illegal instruction. > __scst_process_active_cmd (cmd=0xea340d14, context=<value optimized > out>, > pflags=0xf1488fd4, left_locked=1) > at /root/mid-level/cvs_version/src/scst_targ.c:2494 > 2494 BUG(); > (gdb) bt > #0 __scst_process_active_cmd (cmd=0xea340d14, context=<value optimized > out>, > pflags=0xf1488fd4, left_locked=1) > at /root/mid-level/cvs_version/src/scst_targ.c:2494 > #1 0xf8d88b8c in scst_do_job_active (active_cmd_list=0xf8dab8f0, > pflags=0xf1488fd4, context=268435459) > at /root/mid-level/cvs_version/src/scst_targ.c:54 > #2 0xf8d88f2b in scst_cmd_thread (arg=<value optimized out>) > at /root/mid-level/cvs_version/src/scst_targ.c:2662 > #3 0xc01024d9 in kernel_thread_helper () at > arch/i386/kernel/process.c:298 > > (gdb) print *cmd > $2 = {cmd_list_entry = {next = 0xf8dab900, prev = 0xf8dab900}, > sess = 0xe92800a8, state = 9, sent_to_midlev = 0, ua_ignore = 0, > atomic = 0, non_atomic_only = 1, internal = 0, retry = 0, blocking = > 0, > data_buf_alloced = 0, expected_values_set = 1, processible_env = 1, > cmd_flags = 0, tgtt = 0xf8c39f40, dev = 0xf5b1a340, lun = 0, > tgt_dev = 0xd1b2309c, scsi_req = 0x0, sn = 20668, > search_cmd_list_entry = { > next = 0xe92800d0, prev = 0xe92800d0}, > cdb = "(\000\003\uffff\000?\000\000\b\000\000\000\000\000\000", > cdb_len = 10, > queue_type = SCST_CMD_QUEUE_UNTAGGED, timeout = 15000, retries = 0, > data_direction = DMA_FROM_DEVICE, > expected_data_direction = DMA_FROM_DEVICE, expected_transfer_len = > 4096, > data_len = 4096, scst_cmd_done = 0xf8d83080 <scst_cmd_done_local>, > sgv = 0xecc606c0, bufflen = 4096, buffer = 0xecc606d4, use_sg = 1, > get_sg_buf_entry_num = 0, status = 0 '\0', masked_status = 0 '\0', > msg_status = 0 '\0', host_status = 0, driver_status = 0, > sense_buffer = '\0' <repeats 95 times>, tgt_dev_saved = 0x0, > tgt_resp_flags = 2, resp_data_len = 4096, mgmt_cmd = 0x0, > extra_cmd_list_entry = {next = 0x100100, prev = 0x200200}, cmd_saved = > 0x0, > tag = 18824, tgt_specific = 0xe32b09bc, tgt_dev_specific = 0x0} > (gdb) > > > ...Brad > > > On Thu, 2006-05-18 at 18:38 -0400, Ming Zhang wrote: > >>1) can u try latest code from scst cvs >> >>2) can u try to export it via scst_fileio module? see if oops in same >>place. >> >>ming >> >> >> >> >>On Thu, 2006-05-18 at 17:20 -0500, Brad Johnson wrote: >> >>>The system running scst crashes when doing I/O to target from remote >>>system. >>> >>>Here is my setup: >>>My target system has 2 Intel Xeon processors (3.2 MHz) and 1 GB RAM. >>>It is running Linux 2.6.15.7. >>>It has scst-0.9.4 and qla2x00-target-26-0.9.3.8 installed. >>>It has a Qlogic 2312 HBA connected to a switch. This is my FC target >>>host. (My FC Initiator is another x86 system with a Qlogic HBA also >>>connected to the switch.) >>>For back-end devices it has an LSI FC949X HBA connected to a Hitachi >>>Fibre-channel drive. >>> >>>Here is my start script: >>>-------------------------------------------------------- >>>modprobe -v qla2x00tgt >>>modprobe -v scst_disk >>>echo "add 2:0:3:0 0" >/proc/scsi_tgt/groups/Default/devices >>>echo "1" >/sys/class/scsi_host/host5/target_mode_enabled >>>-------------------------------------------------------- >>> >>>In the script, 2:0:3:0 refers to my Hitachi drive, host5 refers to my >>>Qlogic target-mode port. Everything starts successfully (including >>>scsi_tgt module since it is a dependency of scst_disk). >>> >>>>From my initiator system I see the one drive I have exposed. I >>>successfully partition that drive and do mkfs. At this point everything >>>is still fine. I then mount the file system and copy a big file to it. >>>The copy seems to work fine but at some point shortly after that my >>>target system crashes. There is no oops output to the system log. So I >>>did it again with a remote kgdb attached. Here is the gdb output: >>> >>> >>>Program received signal SIGILL, Illegal instruction. >>>__free_pages (page=0xc190a22c, order=0) at mm/page_alloc.c:1055 >>>1055 if (put_page_testzero(page)) { >>>(gdb) bt >>>#0 __free_pages (page=0xc190a22c, order=0) at mm/page_alloc.c:1055 >>>#1 0xf8d61efd in scst_release_space (cmd=0xf40a4e58) >>> at /root/mid-level/scst-0.9.4/src/scst_lib.c:1430 >>>#2 0xf8d60b2a in scst_free_cmd (cmd=0xf40a4e58, check_retry=1) >>> at /root/mid-level/scst-0.9.4/src/scst_lib.c:956 >>>#3 0xf8d599ee in scst_finish_cmd (cmd=0xf40a4e58) >>> at /root/mid-level/scst-0.9.4/src/scst_targ.c:2212 >>>#4 0xf8d5a7df in __scst_process_active_cmd (cmd=0xf40a4e58, >>> context=<value optimized out>, pflags=0xc046cfb8, >>> left_locked=<value optimized out>) >>> at /root/mid-level/scst-0.9.4/src/scst_targ.c:2461 >>>#5 0xf8d5aa81 in scst_do_job_active (active_cmd_list=0xf8d756d0, >>> pflags=0xc046cfb8, context=268435457) >>> at /root/mid-level/scst-0.9.4/src/scst_targ.c:54 >>>#6 0xf8d5af99 in scst_cmd_tasklet (p=<value optimized out>) >>> at /root/mid-level/scst-0.9.4/src/scst_targ.c:2672 >>>#7 0xc012d905 in tasklet_action (a=<value optimized out>) >>> at kernel/softirq.c:267 >>>#8 0xc012d552 in __do_softirq () at kernel/softirq.c:95 >>>#9 0xc010619e in do_softirq () at arch/i386/kernel/irq.c:187 >>>#10 0xc012d689 in irq_exit () at kernel/softirq.c:169 >>>#11 0xc010604e in do_IRQ (regs=0xc1cf4f48) at arch/i386/kernel/irq.c:110 >>>#12 0xc010499e in common_interrupt () at thread_info.h:91 >>>#13 0xc1cf4000 in ?? () >>>#14 0x00000000 in ?? () >>> >>> >>>I can reproduce this easily every time. Let me know if you want any >>>further information about this. >>> >>>...Brad Johnson >>> >>> >>> >>> >>>------------------------------------------------------- >>>Using Tomcat but need to do more? Need to support web services, security? >>>Get stuff done quickly with pre-integrated technology to make your job easier >>>Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo >>>http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 >>>_______________________________________________ >>>Scst-devel mailing list >>>Scs...@li... >>>https://lists.sourceforge.net/lists/listinfo/scst-devel >> >> > > > > ------------------------------------------------------- > Using Tomcat but need to do more? Need to support web services, security? > Get stuff done quickly with pre-integrated technology to make your job easier > Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo > http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 > _______________________________________________ > Scst-devel mailing list > Scs...@li... > https://lists.sourceforge.net/lists/listinfo/scst-devel > |