From: Brad J. <bjo...@pr...> - 2006-05-20 16:55:37
|
Vlad, I tried it 3 times and every time it failed in the exact same place. ...Brad On Fri, 2006-05-19 at 21:45 +0400, Vladislav Bolkhovitin wrote: > Brad, > > Thanks for the report. Could you please try several times more to see if > it crashes in the same or different points? > > Vlad > > Brad Johnson wrote: > > I tried it with the latest code from CVS (as of 8:00 AM CDT 05/19/2006). > > It goes a while longer before crashing this time, and fails in a > > different place. This test again used scst_disk handler. I will next try > > using the scst_fileio module and will let you know the results. > > Here is the GDB output for this failure: > > > > Program received signal SIGILL, Illegal instruction. > > __scst_process_active_cmd (cmd=0xea340d14, context=<value optimized > > out>, > > pflags=0xf1488fd4, left_locked=1) > > at /root/mid-level/cvs_version/src/scst_targ.c:2494 > > 2494 BUG(); > > (gdb) bt > > #0 __scst_process_active_cmd (cmd=0xea340d14, context=<value optimized > > out>, > > pflags=0xf1488fd4, left_locked=1) > > at /root/mid-level/cvs_version/src/scst_targ.c:2494 > > #1 0xf8d88b8c in scst_do_job_active (active_cmd_list=0xf8dab8f0, > > pflags=0xf1488fd4, context=268435459) > > at /root/mid-level/cvs_version/src/scst_targ.c:54 > > #2 0xf8d88f2b in scst_cmd_thread (arg=<value optimized out>) > > at /root/mid-level/cvs_version/src/scst_targ.c:2662 > > #3 0xc01024d9 in kernel_thread_helper () at > > arch/i386/kernel/process.c:298 > > > > (gdb) print *cmd > > $2 = {cmd_list_entry = {next = 0xf8dab900, prev = 0xf8dab900}, > > sess = 0xe92800a8, state = 9, sent_to_midlev = 0, ua_ignore = 0, > > atomic = 0, non_atomic_only = 1, internal = 0, retry = 0, blocking = > > 0, > > data_buf_alloced = 0, expected_values_set = 1, processible_env = 1, > > cmd_flags = 0, tgtt = 0xf8c39f40, dev = 0xf5b1a340, lun = 0, > > tgt_dev = 0xd1b2309c, scsi_req = 0x0, sn = 20668, > > search_cmd_list_entry = { > > next = 0xe92800d0, prev = 0xe92800d0}, > > cdb = "(\000\003\uffff\000?\000\000\b\000\000\000\000\000\000", > > cdb_len = 10, > > queue_type = SCST_CMD_QUEUE_UNTAGGED, timeout = 15000, retries = 0, > > data_direction = DMA_FROM_DEVICE, > > expected_data_direction = DMA_FROM_DEVICE, expected_transfer_len = > > 4096, > > data_len = 4096, scst_cmd_done = 0xf8d83080 <scst_cmd_done_local>, > > sgv = 0xecc606c0, bufflen = 4096, buffer = 0xecc606d4, use_sg = 1, > > get_sg_buf_entry_num = 0, status = 0 '\0', masked_status = 0 '\0', > > msg_status = 0 '\0', host_status = 0, driver_status = 0, > > sense_buffer = '\0' <repeats 95 times>, tgt_dev_saved = 0x0, > > tgt_resp_flags = 2, resp_data_len = 4096, mgmt_cmd = 0x0, > > extra_cmd_list_entry = {next = 0x100100, prev = 0x200200}, cmd_saved = > > 0x0, > > tag = 18824, tgt_specific = 0xe32b09bc, tgt_dev_specific = 0x0} > > (gdb) > > > > > > ...Brad > > > > > > On Thu, 2006-05-18 at 18:38 -0400, Ming Zhang wrote: > > > >>1) can u try latest code from scst cvs > >> > >>2) can u try to export it via scst_fileio module? see if oops in same > >>place. > >> > >>ming > >> > >> > >> > >> > >>On Thu, 2006-05-18 at 17:20 -0500, Brad Johnson wrote: > >> > >>>The system running scst crashes when doing I/O to target from remote > >>>system. > >>> > >>>Here is my setup: > >>>My target system has 2 Intel Xeon processors (3.2 MHz) and 1 GB RAM. > >>>It is running Linux 2.6.15.7. > >>>It has scst-0.9.4 and qla2x00-target-26-0.9.3.8 installed. > >>>It has a Qlogic 2312 HBA connected to a switch. This is my FC target > >>>host. (My FC Initiator is another x86 system with a Qlogic HBA also > >>>connected to the switch.) > >>>For back-end devices it has an LSI FC949X HBA connected to a Hitachi > >>>Fibre-channel drive. > >>> > >>>Here is my start script: > >>>-------------------------------------------------------- > >>>modprobe -v qla2x00tgt > >>>modprobe -v scst_disk > >>>echo "add 2:0:3:0 0" >/proc/scsi_tgt/groups/Default/devices > >>>echo "1" >/sys/class/scsi_host/host5/target_mode_enabled > >>>-------------------------------------------------------- > >>> > >>>In the script, 2:0:3:0 refers to my Hitachi drive, host5 refers to my > >>>Qlogic target-mode port. Everything starts successfully (including > >>>scsi_tgt module since it is a dependency of scst_disk). > >>> > >>>>From my initiator system I see the one drive I have exposed. I > >>>successfully partition that drive and do mkfs. At this point everything > >>>is still fine. I then mount the file system and copy a big file to it. > >>>The copy seems to work fine but at some point shortly after that my > >>>target system crashes. There is no oops output to the system log. So I > >>>did it again with a remote kgdb attached. Here is the gdb output: > >>> > >>> > >>>Program received signal SIGILL, Illegal instruction. > >>>__free_pages (page=0xc190a22c, order=0) at mm/page_alloc.c:1055 > >>>1055 if (put_page_testzero(page)) { > >>>(gdb) bt > >>>#0 __free_pages (page=0xc190a22c, order=0) at mm/page_alloc.c:1055 > >>>#1 0xf8d61efd in scst_release_space (cmd=0xf40a4e58) > >>> at /root/mid-level/scst-0.9.4/src/scst_lib.c:1430 > >>>#2 0xf8d60b2a in scst_free_cmd (cmd=0xf40a4e58, check_retry=1) > >>> at /root/mid-level/scst-0.9.4/src/scst_lib.c:956 > >>>#3 0xf8d599ee in scst_finish_cmd (cmd=0xf40a4e58) > >>> at /root/mid-level/scst-0.9.4/src/scst_targ.c:2212 > >>>#4 0xf8d5a7df in __scst_process_active_cmd (cmd=0xf40a4e58, > >>> context=<value optimized out>, pflags=0xc046cfb8, > >>> left_locked=<value optimized out>) > >>> at /root/mid-level/scst-0.9.4/src/scst_targ.c:2461 > >>>#5 0xf8d5aa81 in scst_do_job_active (active_cmd_list=0xf8d756d0, > >>> pflags=0xc046cfb8, context=268435457) > >>> at /root/mid-level/scst-0.9.4/src/scst_targ.c:54 > >>>#6 0xf8d5af99 in scst_cmd_tasklet (p=<value optimized out>) > >>> at /root/mid-level/scst-0.9.4/src/scst_targ.c:2672 > >>>#7 0xc012d905 in tasklet_action (a=<value optimized out>) > >>> at kernel/softirq.c:267 > >>>#8 0xc012d552 in __do_softirq () at kernel/softirq.c:95 > >>>#9 0xc010619e in do_softirq () at arch/i386/kernel/irq.c:187 > >>>#10 0xc012d689 in irq_exit () at kernel/softirq.c:169 > >>>#11 0xc010604e in do_IRQ (regs=0xc1cf4f48) at arch/i386/kernel/irq.c:110 > >>>#12 0xc010499e in common_interrupt () at thread_info.h:91 > >>>#13 0xc1cf4000 in ?? () > >>>#14 0x00000000 in ?? () > >>> > >>> > >>>I can reproduce this easily every time. Let me know if you want any > >>>further information about this. > >>> > >>>...Brad Johnson > >>> > >>> > >>> > >>> > >>>------------------------------------------------------- > >>>Using Tomcat but need to do more? Need to support web services, security? > >>>Get stuff done quickly with pre-integrated technology to make your job easier > >>>Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo > >>>http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 > >>>_______________________________________________ > >>>Scst-devel mailing list > >>>Scs...@li... > >>>https://lists.sourceforge.net/lists/listinfo/scst-devel > >> > >> > > > > > > > > ------------------------------------------------------- > > Using Tomcat but need to do more? Need to support web services, security? > > Get stuff done quickly with pre-integrated technology to make your job easier > > Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo > > http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 > > _______________________________________________ > > Scst-devel mailing list > > Scs...@li... > > https://lists.sourceforge.net/lists/listinfo/scst-devel > > > > |