From: Salazar De T. M. <sal...@ll...> - 2018-03-18 19:35:31
|
Hello, Running a big problem (1,601,777 elements) on 100 processors. I am using a DistributedMesh. At some point, I call MeshRefinement::refine_and_coarsen_elements() to do AMR, but I get this assertion error (running in debug mode): Assertion `min_id == node->processor_id()' failed. min_id = 11 node->processor_id() = 57 Assertion `max_id == node->processor_id()' failed. max_id = 57 node->processor_id() = 11 Assertion `max_id == node->processor_id()' failed. max_id = 57 node->processor_id() = 11 I also obtain traceout files with numbers: traceout_57_85461.txt traceout_11_104555.txt traceout_8_142118.txt. Their content is similar and looks like this: [New LWP 142203] [Thread debugging using libthread_db enabled] Using host libthread_db library "/usr/lib64/libthread_db.so.1". 0x00002aaaba58fe09 in __libc_waitpid (pid=143882, stat_loc=stat_loc@entry=0x7fffffff4a90, options=options@entry=0) at ../sysdeps/unix/sysv/linux/waitpid.c:40 40 int result = INLINE_SYSCALL (wait4, 4, pid, stat_loc, options, NULL); To enable execution of this file add add-auto-load-safe-path /usr/tce/packages/gcc/gcc-4.9.3/lib64/libstdc++.so.6.0.20-gdb.py line to your configuration file "/g/g92/miguel/.gdbinit". To completely disable this security protection add set auto-load safe-path / line to your configuration file "/g/g92/miguel/.gdbinit". For more information about this security protection see the "Auto-loading safe path" section in the GDB manual. E.g., run from the shell: info "(gdb)Auto-loading safe path" #0 0x00002aaaba58fe09 in __libc_waitpid (pid=143882, stat_loc=stat_loc@entry=0x7fffffff4a90, options=options@entry=0) at ../sysdeps/unix/sysv/linux/waitpid.c:40 #1 0x00002aaaba512cc2 in do_system (line=line@entry=0x1e3ac078 "gdb -p 142118 -batch -ex bt -ex detach 2>/dev/null 1>temp_print_trace.4jiwUH") at ../sysdeps/posix/system.c:148 #2 0x00002aaaba513071 in __libc_system (line=0x1e3ac078 "gdb -p 142118 -batch -ex bt -ex detach 2>/dev/null 1>temp_print_trace.4jiwUH") at ../sysdeps/posix/system.c:189 #3 0x00002aaaad67e17b in (anonymous namespace)::gdb_backtrace (out_stream=...) at src/base/print_trace.C:162 #4 0x00002aaaad6806ab in libMesh::print_trace (out_stream=...) at src/base/print_trace.C:209 #5 0x00002aaaad67f7f4 in libMesh::write_traceout () at src/base/print_trace.C:239 #6 0x00002aaaad6769fb in libMesh::MacroFunctions::report_error (file=0x2320a <Address 0x2320a out of bounds>, line=-46448, date=0x0, time=0xffffffffffffffff <Address 0xffffffffffffffff out of bounds>) at src/base/libmesh_common.C:89 #7 0x00002aaaaebe174e in libMesh::MeshTools::libmesh_assert_parallel_consistent_procids<libMesh::Node> (mesh=...) at src/mesh/mesh_tools.C:1608 #8 0x00002aaaaeba931e in libMesh::MeshTools::correct_node_proc_ids (mesh=...) at src/mesh/mesh_tools.C:1844 #9 0x00002aaaae69a0ce in libMesh::MeshCommunication::make_new_nodes_parallel_consistent (this=0x2320a, mesh=...) at src/mesh/mesh_communication.C:1776 #10 0x00002aaaaea95919 in libMesh::MeshRefinement::_refine_elements (this=0x2320a) at src/mesh/mesh_refinement.C:1601 #11 0x00002aaaaea6a4d1 in libMesh::MeshRefinement::refine_and_coarsen_elements (this=0x2320a) at src/mesh/mesh_refinement.C:578 #12 0x00002aaab9d69dcd in OptiProblem::solve (this=0x7fffffffabd8) at /g/g92/miguel/code/topsm/src/opti_problem.C:370 #13 0x00000000004371b8 in main (argc=4, argv=0x7fffffffb798) at /g/g92/miguel/code/topsm/test/3D_stress_constraint/linear_stress_opti.C:196 Are there other things I can do to debug this? Thanks Miguel |