From: BENGAOUER A. DENSAC-DM2S-S. <BENGAOUER@DRNSAC.CEA.FR> - 2004-06-21 08:27:51
|
Thank you for your answer, I began working on these points and : - changed the ksp to cg : -ksp_type cg - added the ksp monitor : -ksp_monitor - increased the mesh size : 30*30*30, I can't do much more with the 1Mb / proc I got if I wand it to=20 run on 1 processor. What I find is : for 1 proc : max time =3D 306.2 s cg iterations # =3D 91 for 2 procs : max time =3D 272.5 s cg iterations # =3D 117 for 4 procs : max time =3D 278.8 s cg iterations # =3D 130 for 8 procs : max time =3D 250.2 s cg iterations # =3D 126 I have problems with the load balance as the ratio max/min is really poor= : for 1 proc : max/min =3D 1 (Yes) for 2 proc : max/min =3D 5.9 for 4 proc : max/min =3D 9.9 for 8 proc : max/min =3D 14.5 So far, I did not see that when using Petsc alone, is there a way to moni= tor the load balance within LibMesh just to check ? Another question, do you have some results (articles,...) on scalability = or large scale // applications with libMesh, maybe it could help me ? =20 Any ideas will be appriciated. Sincerely Alain Here are the results I get (I'm sorry, it is rather long as I give result= s for 1, 2, 4 and 8 procs) *** FOR 1 PROC *** The output (if any) follows: Warning: no access to tty; thus no job control in this shell... stty: tcgetattr: Not a typewriter vous etes sur le HP/SC nickel13 Running /home/bengao/libmesh/examples/ex4/ex4 -d 3 /home/bengao/libmesh/reference_elements/3D/one_hex27.xda toto.ucd -ksp_ty= pe cg -ksp_monitor -log_summary Mesh Information: mesh_dimension()=3D3 spatial_dimension()=3D3 n_nodes()=3D226981 n_elem()=3D27000 n_local_elem()=3D27000 n_active_elem()=3D27000 n_subdomains()=3D1 n_processors()=3D1 processor_id()=3D0 EquationSystems n_systems()=3D1 System "Poisson" Type "Implicit" Variables=3D"u"=20 Finite Element Types=3D"0"=20 Approximation Orders=3D"2"=20 n_dofs()=3D226981 n_local_dofs()=3D226981 n_constrained_dofs()=3D0 n_vectors()=3D1 n_parameters()=3D2 Parameters: "linear solver maximum iterations"=3D5000 "linear solver tolerance"=3D1e-12 =20 -------------------------------------------------------------------------= --- | Time: Mon Jun 21 09:29:00 2004 | OS: OSF1 | HostName: nickel29 | OS Release V5.1 | OS Version: 1885 | Machine: alpha | Username: bengao =20 -------------------------------------------------------------------------= --- =20 -------------------------------------------------------------------------= --- | Matrix Assembly Performance: Alive time=3D24.7958, Active time=3D24.510= 7 =20 -------------------------------------------------------------------------= --- | Event nCalls Total Avg Percent o= f | | Time Time Active Ti= me | |------------------------------------------------------------------------= --- -| | | | BCs 27000 3.7900 0.000140 15.46 | | Fe 27000 1.5352 0.000057 6.26 | | Ke 27000 14.3164 0.000530 58.41 | | elem init 27000 2.6172 0.000097 10.68 | | matrix insertion 27000 2.2520 0.000083 9.19 | =20 -------------------------------------------------------------------------= --- | Totals: 135000 24.5107 100.00 | =20 -------------------------------------------------------------------------= --- 0 KSP Residual norm 4.905564573071e+01=20 1 KSP Residual norm 1.076755676335e+01=20 2 KSP Residual norm 6.459098415445e+00=20 3 KSP Residual norm 5.071241285193e+00=20 4 KSP Residual norm 4.348990847409e+00=20 5 KSP Residual norm 2.840435615142e+00=20 6 KSP Residual norm 3.116246983530e+00=20 7 KSP Residual norm 2.313023449529e+00=20 8 KSP Residual norm 2.422099892855e+00=20 9 KSP Residual norm 1.918367065076e+00=20 10 KSP Residual norm 1.746032186556e+00=20 11 KSP Residual norm 1.491974843661e+00=20 12 KSP Residual norm 1.117069563956e+00=20 13 KSP Residual norm 1.041314213618e+00=20 14 KSP Residual norm 7.359760391376e-01=20 15 KSP Residual norm 6.609753830879e-01=20 16 KSP Residual norm 4.775908128288e-01=20 17 KSP Residual norm 3.560774301139e-01=20 18 KSP Residual norm 1.396852226610e-01=20 19 KSP Residual norm 4.284491002328e-02=20 20 KSP Residual norm 5.377220781814e-03=20 21 KSP Residual norm 1.491406093026e-03=20 22 KSP Residual norm 8.229376635088e-04=20 23 KSP Residual norm 6.623394986488e-04=20 24 KSP Residual norm 4.013747183360e-04=20 25 KSP Residual norm 3.352389608270e-04=20 26 KSP Residual norm 2.334253836263e-04=20 27 KSP Residual norm 2.114319386557e-04=20 28 KSP Residual norm 1.611366287333e-04=20 29 KSP Residual norm 1.512553201542e-04=20 30 KSP Residual norm 1.208801297465e-04=20 31 KSP Residual norm 1.127240049147e-04=20 32 KSP Residual norm 9.032668536030e-05=20 33 KSP Residual norm 8.374648150145e-05=20 34 KSP Residual norm 6.627583220969e-05=20 35 KSP Residual norm 5.526159094527e-05=20 36 KSP Residual norm 5.455559664837e-05=20 37 KSP Residual norm 6.592787208749e-05=20 38 KSP Residual norm 7.816092040534e-05=20 39 KSP Residual norm 4.079412349334e-05=20 40 KSP Residual norm 2.331183191511e-05=20 41 KSP Residual norm 1.892429986577e-05=20 42 KSP Residual norm 1.597271453337e-05=20 43 KSP Residual norm 1.357617756008e-05=20 44 KSP Residual norm 1.229879876594e-05=20 45 KSP Residual norm 9.750207600347e-06=20 46 KSP Residual norm 8.382066865312e-06=20 47 KSP Residual norm 6.776899784613e-06=20 48 KSP Residual norm 5.713607328564e-06=20 49 KSP Residual norm 4.749792327532e-06=20 50 KSP Residual norm 4.017978025701e-06=20 51 KSP Residual norm 3.390972429694e-06=20 52 KSP Residual norm 2.947744878417e-06=20 53 KSP Residual norm 2.559306183057e-06=20 54 KSP Residual norm 2.338460661520e-06=20 55 KSP Residual norm 1.906719235333e-06=20 56 KSP Residual norm 1.484540882490e-06=20 57 KSP Residual norm 9.379893727616e-07=20 58 KSP Residual norm 6.453471268597e-07=20 59 KSP Residual norm 3.601453898114e-07=20 60 KSP Residual norm 2.406265320430e-07=20 61 KSP Residual norm 1.564662494533e-07=20 62 KSP Residual norm 1.135461005643e-07=20 63 KSP Residual norm 8.807342101775e-08=20 64 KSP Residual norm 6.986122777938e-08=20 65 KSP Residual norm 6.031254843942e-08=20 66 KSP Residual norm 4.790988813425e-08=20 67 KSP Residual norm 4.020922954520e-08=20 68 KSP Residual norm 3.037015053828e-08=20 69 KSP Residual norm 2.459446964751e-08=20 70 KSP Residual norm 2.046130210485e-08=20 71 KSP Residual norm 1.778277525542e-08=20 72 KSP Residual norm 1.499708585118e-08=20 73 KSP Residual norm 1.006921785745e-08=20 74 KSP Residual norm 6.482569378516e-09=20 75 KSP Residual norm 3.850672962122e-09=20 76 KSP Residual norm 2.572747769877e-09=20 77 KSP Residual norm 1.812084515219e-09=20 78 KSP Residual norm 1.422450022677e-09=20 79 KSP Residual norm 1.086372842555e-09=20 80 KSP Residual norm 7.871196065284e-10=20 81 KSP Residual norm 6.174838447260e-10=20 82 KSP Residual norm 4.141830963047e-10=20 83 KSP Residual norm 3.349092075053e-10=20 84 KSP Residual norm 2.507537185461e-10=20 85 KSP Residual norm 2.013404627030e-10=20 86 KSP Residual norm 1.559815976852e-10=20 87 KSP Residual norm 1.186422597429e-10=20 88 KSP Residual norm 9.937251095282e-11=20 89 KSP Residual norm 7.073300075327e-11=20 90 KSP Residual norm 4.917609149610e-11=20 91 KSP Residual norm 2.645635455730e-11=20 *************************************************************************= *** ******************************************** *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** *************************************************************************= *** ******************************************** ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- /home/bengao/libmesh/examples/ex4/ex4 on a alpha named nickel29 with 1 processor, by bengao Mon Jun 21 09:33:35 2004 Using Petsc Version 2.1.6, Patch 0, Released Aug 05, 2003 Max Max/Min Avg Total=20 Time (sec): 3.062e+02 1.00000 3.062e+02 Objects: 0.000e+00 0.00000 0.000e+00 Flops: 6.055e+09 1.00000 6.055e+09 6.055e+09 Flops/sec: 1.977e+07 1.00000 1.977e+07 1.977e+07 MPI Messages: 0.000e+00 0.00000 0.000e+00 0.000e+00 MPI Message Lengths: 0.000e+00 0.00000 0.000e+00 0.000e+00 MPI Reductions: 4.000e+00 1.00000 Flop counting convention: 1 flop =3D 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N = --> 2N flops and VecAXPY() for complex vectors of length N --> 8N flops Summary of Stages: ----- Time ------ ----- Flops ----- --- Messages -= -- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Tot= al Avg %Total counts %Total=20 0: Main Stage: 3.0620e+02 100.0% 6.0546e+09 100.0% 0.000e+00 0.= 0% 0.000e+00 0.0% 4.000e+00 100.0%=20 -------------------------------------------------------------------------= --- -------------------------------------------- See the 'Profiling' chapter of the users' manual for details on interpret= ing output. Phase summary info: Count: number of times phase was executed Time and Flops/sec: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() an= d PetscLogStagePop(). %T - percent time in this phase %F - percent flops in this phase %M - percent messages in this phase %L - percent message length= s in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flops over all processors)/(max time ov= er all processors) -------------------------------------------------------------------------= --- -------------------------------------------- ########################################################## # # # WARNING!!! # # # # This code was run without the PreLoadBegin() # # macros. To get timing results we always recommend # # preloading. otherwise timing numbers may be # # meaningless. # ########################################################## Event Count Time (sec) Flops/sec --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s -------------------------------------------------------------------------= --- -------------------------------------------- --- Event Stage 0: Main Stage VecDot 183 1.0 2.4902e-01 1.0 3.34e+08 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 334 VecNorm 92 1.0 1.3770e-01 1.0 3.03e+08 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 303 VecCopy 1 1.0 9.7704e-04 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 4 1.0 3.9070e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 182 1.0 2.4805e-01 1.0 3.33e+08 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 333 VecAYPX 91 1.0 1.1133e-01 1.0 3.71e+08 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 371 VecAssemblyBegin 2 1.0 0.0000e+00 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAssemblyEnd 2 1.0 0.0000e+00 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecScatterBegin 1 1.0 4.8831e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatMult 92 1.0 9.0517e+00 1.0 2.82e+08 1.0 0.0e+00 0.0e+00 0.0e+00 3 42 0 0 0 3 42 0 0 0 282 MatSolve 92 1.0 2.6776e+01 1.0 9.54e+07 1.0 0.0e+00 0.0e+00 0.0e+00 9 42 0 0 0 9 42 0 0 0 95 MatLUFactorNum 1 1.0 2.2051e+00 1.0 3.16e+08 1.0 0.0e+00 0.0e+00 0.0e+00 1 12 0 0 0 1 12 0 0 0 316 MatILUFactorSym 1 1.0 4.5312e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+00 0 0 0 0 25 0 0 0 0 25 0 MatAssemblyBegin 1 1.0 0.0000e+00 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatAssemblyEnd 1 1.0 4.0039e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetOrdering 1 1.0 3.9060e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 0 0 0 0 50 0 0 0 0 50 0 MatZeroEntries 3 1.0 3.8183e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 PCSetUp 1 1.0 2.6621e+00 1.0 2.62e+08 1.0 0.0e+00 0.0e+00 3.0e+00 1 12 0 0 75 1 12 0 0 75 262 PCApply 92 1.0 2.6778e+01 1.0 9.54e+07 1.0 0.0e+00 0.0e+00 0.0e+00 9 42 0 0 0 9 42 0 0 0 95 SLESSetup 1 1.0 2.6641e+00 1.0 2.61e+08 1.0 0.0e+00 0.0e+00 3.0e+00 1 12 0 0 75 1 12 0 0 75 261 SLESSolve 1 1.0 3.6589e+01 1.0 1.46e+08 1.0 0.0e+00 0.0e+00 0.0e+00 12 88 0 0 0 12 88 0 0 0 146 -------------------------------------------------------------------------= --- -------------------------------------------- Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. --- Event Stage 0: Main Stage Viewer 1 1 0 0 Index Set 4 4 3632624 0 Map 10 10 2000 0 Vec 6 6 3408 0 Vec Scatter 1 1 1816208 0 Matrix 2 2 343205976 0 Krylov Solver 1 1 200 1704 Preconditioner 1 1 152 1.73419e+08 SLES 1 1 0 1.73421e+08 =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D *** FOR 2 PROCS *** The output (if any) follows: Warning: no access to tty; thus no job control in this shell... stty: tcgetattr: Not a typewriter vous etes sur le HP/SC nickel13 Running /home/bengao/libmesh/examples/ex4/ex4 -d 3 /home/bengao/libmesh/reference_elements/3D/one_hex27.xda toto.ucd -ksp_ty= pe cg -ksp_monitor -log_summary Mesh Information: mesh_dimension()=3D3 spatial_dimension()=3D3 n_nodes()=3D226981 n_elem()=3D27000 n_local_elem()=3D13500 n_active_elem()=3D27000 n_subdomains()=3D1 n_processors()=3D2 processor_id()=3D0 EquationSystems n_systems()=3D1 System "Poisson" Type "Implicit" Variables=3D"u"=20 Finite Element Types=3D"0"=20 Approximation Orders=3D"2"=20 n_dofs()=3D226981 n_local_dofs()=3D115477 n_constrained_dofs()=3D0 n_vectors()=3D1 n_parameters()=3D2 Parameters: "linear solver maximum iterations"=3D5000 "linear solver tolerance"=3D1e-12 =20 -------------------------------------------------------------------------= --- | Processor id: 0 | Num Processors: 2 | Time: Mon Jun 21 09:35:53 2004 | OS: OSF1 | HostName: nickel32 | OS Release V5.1 | OS Version: 1885 | Machine: alpha | Username: bengao =20 -------------------------------------------------------------------------= --- =20 -------------------------------------------------------------------------= --- | Matrix Assembly Performance: Alive time=3D11.3212, Active time=3D11.169= 9 =20 -------------------------------------------------------------------------= --- | Event nCalls Total Avg Percent o= f | | Time Time Active Ti= me | |------------------------------------------------------------------------= --- -| | | | BCs 13500 1.8281 0.000135 16.37 | | Fe 13500 0.7705 0.000057 6.90 | | Ke 13500 6.1064 0.000452 54.67 | | elem init 13500 1.3447 0.000100 12.04 | | matrix insertion 13500 1.1201 0.000083 10.03 | =20 -------------------------------------------------------------------------= --- | Totals: 67500 11.1699 100.00 | =20 -------------------------------------------------------------------------= --- 0 KSP Residual norm 4.905563983049e+01=20 1 KSP Residual norm 1.076753183677e+01=20 2 KSP Residual norm 6.459055312958e+00=20 3 KSP Residual norm 5.071183461797e+00=20 4 KSP Residual norm 4.348924975504e+00=20 5 KSP Residual norm 2.840330529776e+00=20 6 KSP Residual norm 3.116120905915e+00=20 7 KSP Residual norm 2.312887277824e+00=20 8 KSP Residual norm 2.422035479497e+00=20 9 KSP Residual norm 1.919302928005e+00=20 10 KSP Residual norm 1.754210285204e+00=20 11 KSP Residual norm 1.558378799959e+00=20 12 KSP Residual norm 1.516324516281e+00=20 13 KSP Residual norm 1.209689067402e+00=20 14 KSP Residual norm 9.381129063310e-01=20 15 KSP Residual norm 7.617725313902e-01=20 16 KSP Residual norm 6.917665506510e-01=20 17 KSP Residual norm 5.252253730064e-01=20 18 KSP Residual norm 3.354632119771e-01=20 19 KSP Residual norm 1.692356330282e-01=20 20 KSP Residual norm 8.518745573407e-02=20 21 KSP Residual norm 2.847487603667e-02=20 22 KSP Residual norm 1.449054494080e-02=20 23 KSP Residual norm 1.099450289639e-02=20 24 KSP Residual norm 9.540441399873e-03=20 25 KSP Residual norm 8.671198934935e-03=20 26 KSP Residual norm 7.744971397418e-03=20 27 KSP Residual norm 6.174931311750e-03=20 28 KSP Residual norm 5.936376544350e-03=20 29 KSP Residual norm 5.442135090432e-03=20 30 KSP Residual norm 4.536639726178e-03=20 31 KSP Residual norm 3.524948476144e-03=20 32 KSP Residual norm 3.348420053799e-03=20 33 KSP Residual norm 2.923047262975e-03=20 34 KSP Residual norm 2.495755422145e-03=20 35 KSP Residual norm 1.978533362871e-03=20 36 KSP Residual norm 1.879303053445e-03=20 37 KSP Residual norm 1.778394308848e-03=20 38 KSP Residual norm 1.661901010711e-03=20 39 KSP Residual norm 1.362531330784e-03=20 40 KSP Residual norm 1.149004822051e-03=20 41 KSP Residual norm 8.554856191656e-04=20 42 KSP Residual norm 7.106228390228e-04=20 43 KSP Residual norm 6.405644545799e-04=20 44 KSP Residual norm 6.205072057381e-04=20 45 KSP Residual norm 6.117652664525e-04=20 46 KSP Residual norm 6.616008503173e-04=20 47 KSP Residual norm 6.202644194775e-04=20 48 KSP Residual norm 5.655107288808e-04=20 49 KSP Residual norm 4.582346778069e-04=20 50 KSP Residual norm 4.018171014190e-04=20 51 KSP Residual norm 3.125006083531e-04=20 52 KSP Residual norm 2.613808049085e-04=20 53 KSP Residual norm 2.165899033256e-04=20 54 KSP Residual norm 1.800308411710e-04=20 55 KSP Residual norm 1.486147698223e-04=20 56 KSP Residual norm 1.289235810468e-04=20 57 KSP Residual norm 1.027136279169e-04=20 58 KSP Residual norm 7.779333853153e-05=20 59 KSP Residual norm 6.077526169393e-05=20 60 KSP Residual norm 5.442658335399e-05=20 61 KSP Residual norm 4.425921552950e-05=20 62 KSP Residual norm 3.194890831293e-05=20 63 KSP Residual norm 2.278270536844e-05=20 64 KSP Residual norm 1.769507403356e-05=20 65 KSP Residual norm 1.157482698069e-05=20 66 KSP Residual norm 8.486483296752e-06=20 67 KSP Residual norm 6.633264784703e-06=20 68 KSP Residual norm 5.144564472765e-06=20 69 KSP Residual norm 3.879034138343e-06=20 70 KSP Residual norm 3.152006982282e-06=20 71 KSP Residual norm 2.417113333597e-06=20 72 KSP Residual norm 1.998450570455e-06=20 73 KSP Residual norm 1.716709546317e-06=20 74 KSP Residual norm 1.515764489015e-06=20 75 KSP Residual norm 1.258812262804e-06=20 76 KSP Residual norm 1.044945445893e-06=20 77 KSP Residual norm 8.526527347316e-07=20 78 KSP Residual norm 6.769858792235e-07=20 79 KSP Residual norm 4.902820914912e-07=20 80 KSP Residual norm 3.486598817995e-07=20 81 KSP Residual norm 2.329787535200e-07=20 82 KSP Residual norm 1.689495080001e-07=20 83 KSP Residual norm 1.185956959887e-07=20 84 KSP Residual norm 9.214808293293e-08=20 85 KSP Residual norm 7.202518030568e-08=20 86 KSP Residual norm 5.630812010234e-08=20 87 KSP Residual norm 4.254480261538e-08=20 88 KSP Residual norm 3.196608906247e-08=20 89 KSP Residual norm 2.342722296177e-08=20 90 KSP Residual norm 1.689734897046e-08=20 91 KSP Residual norm 1.158561601034e-08=20 92 KSP Residual norm 7.789686628770e-09=20 93 KSP Residual norm 5.244703428322e-09=20 94 KSP Residual norm 3.723134750035e-09=20 95 KSP Residual norm 2.770596388554e-09=20 96 KSP Residual norm 2.542500539877e-09=20 97 KSP Residual norm 2.571050442982e-09=20 98 KSP Residual norm 2.445425893657e-09=20 99 KSP Residual norm 2.049558848116e-09=20 100 KSP Residual norm 1.522442382110e-09=20 101 KSP Residual norm 1.039412579858e-09=20 102 KSP Residual norm 8.028063052804e-10=20 103 KSP Residual norm 6.087022017824e-10=20 104 KSP Residual norm 4.556971426635e-10=20 105 KSP Residual norm 3.529123052752e-10=20 106 KSP Residual norm 3.041524005592e-10=20 107 KSP Residual norm 2.588647965092e-10=20 108 KSP Residual norm 2.069179197158e-10=20 109 KSP Residual norm 1.624897203021e-10=20 110 KSP Residual norm 1.286631259353e-10=20 111 KSP Residual norm 1.006544053271e-10=20 112 KSP Residual norm 9.063110940807e-11=20 113 KSP Residual norm 8.724273143176e-11=20 114 KSP Residual norm 7.732765757024e-11=20 115 KSP Residual norm 6.732705171712e-11=20 116 KSP Residual norm 5.659946715217e-11=20 117 KSP Residual norm 4.787522473240e-11=20 *************************************************************************= *** ******************************************** *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** *************************************************************************= *** ******************************************** ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- /home/bengao/libmesh/examples/ex4/ex4 on a alpha named nickel32 with 2 processors, by bengao Mon Jun 21 09:40:10 2004 Using Petsc Version 2.1.6, Patch 0, Released Aug 05, 2003 Max Max/Min Avg Total=20 Time (sec): 2.725e+02 5.96231 1.591e+02 Objects: 0.000e+00 0.00000 0.000e+00 Flops: 3.831e+09 1.04612 3.746e+09 7.492e+09 Flops/sec: 8.011e+07 5.69945 4.708e+07 9.416e+07 MPI Messages: 1.230e+02 1.00000 1.230e+02 2.460e+02 MPI Message Lengths: 7.685e+06 1.00000 6.248e+04 1.537e+07 MPI Reductions: 1.880e+02 1.00000 Flop counting convention: 1 flop =3D 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N = --> 2N flops and VecAXPY() for complex vectors of length N --> 8N flops Summary of Stages: ----- Time ------ ----- Flops ----- --- Messages -= -- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Tot= al Avg %Total counts %Total=20 0: Main Stage: 1.5913e+02 100.0% 7.4923e+09 100.0% 2.460e+02 100.= 0% 6.248e+04 100.0% 3.760e+02 100.0%=20 -------------------------------------------------------------------------= --- -------------------------------------------- See the 'Profiling' chapter of the users' manual for details on interpret= ing output. Phase summary info: Count: number of times phase was executed Time and Flops/sec: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() an= d PetscLogStagePop(). %T - percent time in this phase %F - percent flops in this phase %M - percent messages in this phase %L - percent message length= s in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flops over all processors)/(max time ov= er all processors) -------------------------------------------------------------------------= --- -------------------------------------------- ########################################################## # # # WARNING!!! # # # # This code was run without the PreLoadBegin() # # macros. To get timing results we always recommend # # preloading. otherwise timing numbers may be # # meaningless. # ########################################################## Event Count Time (sec) Flops/sec --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s -------------------------------------------------------------------------= --- -------------------------------------------- --- Event Stage 0: Main Stage VecDot 235 1.0 2.1680e-01 1.4 3.61e+08 1.5 0.0e+00 0.0e+00 2.4e+02 0 1 0 0 62 0 1 0 0 62 492 VecNorm 118 1.0 2.9765e+0032.4 2.97e+0833.6 0.0e+00 0.0e+00 1.2e+02 1 1 0 0 31 1 1 0 0 31 18 VecCopy 1 1.0 0.0000e+00 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 122 1.0 2.7347e-02 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 234 1.0 1.5234e-01 1.0 3.55e+08 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 697 VecAYPX 117 1.0 8.3009e-02 1.1 3.38e+08 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 640 VecAssemblyBegin 2 1.0 1.9531e-03 0.0 0.00e+00 0.0 2.0e+00 5.2e+04 6.0e+00 0 0 1 1 2 0 0 1 1 2 0 VecAssemblyEnd 2 1.0 0.0000e+00 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecScatterBegin 119 1.0 1.9532e-02 2.0 0.00e+00 0.0 2.4e+02 4.8e+04 0.0e+00 0 0 97 74 0 0 0 97 74 0 0 VecScatterEnd 119 1.0 6.3280e-0121.6 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatMult 118 1.0 7.7128e+00 1.0 2.19e+08 1.1 2.4e+02 4.8e+04 0.0e+00 5 44 96 73 0 5 44 96 73 0 425 MatSolve 118 1.0 1.9387e+01 1.2 9.55e+07 1.1 0.0e+00 0.0e+00 0.0e+00 11 43 0 0 0 11 43 0 0 0 166 MatLUFactorNum 1 1.0 1.1084e+00 1.1 3.21e+08 1.0 0.0e+00 0.0e+00 0.0e+00 1 9 0 0 0 1 9 0 0 0 610 MatILUFactorSym 1 1.0 2.6269e-01 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatAssemblyBegin 1 1.0 6.1523e-0215.8 0.00e+00 0.0 2.0e+00 1.9e+06 2.0e+00 0 0 1 24 1 0 0 1 24 1 0 MatAssemblyEnd 1 1.0 9.1796e-02 1.2 0.00e+00 0.0 2.0e+00 2.4e+04 7.0e+00 0 0 1 0 2 0 0 1 0 2 0 MatGetOrdering 1 1.0 1.9531e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 0 0 0 0 1 0 0 0 0 1 0 MatZeroEntries 3 1.0 2.4707e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 PCSetUp 2 1.0 1.3779e+00 1.1 2.58e+08 1.0 0.0e+00 0.0e+00 3.0e+00 1 9 0 0 1 1 9 0 0 1 491 PCSetUpOnBlocks 1 1.0 1.3740e+00 1.1 2.58e+08 1.0 0.0e+00 0.0e+00 3.0e+00 1 9 0 0 1 1 9 0 0 1 492 PCApply 118 1.0 1.9424e+01 1.2 9.53e+07 1.1 0.0e+00 0.0e+00 0.0e+00 11 43 0 0 0 11 43 0 0 0 166 SLESSetup 2 1.0 1.3799e+00 1.1 2.57e+08 1.0 0.0e+00 0.0e+00 3.0e+00 1 9 0 0 1 1 9 0 0 1 490 SLESSolve 1 1.0 2.7646e+01 1.0 1.26e+08 1.0 2.4e+02 4.8e+04 3.5e+02 17 91 96 73 94 17 91 96 73 94 247 -------------------------------------------------------------------------= --- -------------------------------------------- Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. --- Event Stage 0: Main Stage Viewer 1 1 0 0 Index Set 6 6 1910568 0 Map 15 15 3000 0 Vec 10 10 5549104 0 Vec Scatter 2 2 2680164 0 Matrix 4 4 175370020 8.9259e+07 Krylov Solver 2 2 200 2.77342e+06 Preconditioner 2 2 240 1.74359e+08 SLES 2 2 0 1.77133e+08 =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D *** FOR 4 PROCS *** The output (if any) follows: Warning: no access to tty; thus no job control in this shell... stty: tcgetattr: Not a typewriter vous etes sur le HP/SC nickel13 Running /home/bengao/libmesh/examples/ex4/ex4 -d 3 /home/bengao/libmesh/reference_elements/3D/one_hex27.xda toto.ucd -ksp_ty= pe cg -ksp_monitor -log_summary Mesh Information: mesh_dimension()=3D3 spatial_dimension()=3D3 n_nodes()=3D226981 n_elem()=3D27000 n_local_elem()=3D6750 n_active_elem()=3D27000 n_subdomains()=3D1 n_processors()=3D4 processor_id()=3D0 EquationSystems n_systems()=3D1 System "Poisson" Type "Implicit" Variables=3D"u"=20 Finite Element Types=3D"0"=20 Approximation Orders=3D"2"=20 n_dofs()=3D226981 n_local_dofs()=3D58703 n_constrained_dofs()=3D0 n_vectors()=3D1 n_parameters()=3D2 Parameters: "linear solver maximum iterations"=3D5000 "linear solver tolerance"=3D1e-12 =20 -------------------------------------------------------------------------= --- | Processor id: 0 | Num Processors: 4 | Time: Mon Jun 21 09:33:48 2004 | OS: OSF1 | HostName: nickel37 | OS Release V5.1 | OS Version: 1885 | Machine: alpha | Username: bengao =20 -------------------------------------------------------------------------= --- =20 -------------------------------------------------------------------------= --- | Matrix Assembly Performance: Alive time=3D5.58985, Active time=3D5.5332 =20 -------------------------------------------------------------------------= --- | Event nCalls Total Avg Percent o= f | | Time Time Active Ti= me | |------------------------------------------------------------------------= --- -| | | | BCs 6750 0.8516 0.000126 15.39 | | Fe 6750 0.3662 0.000054 6.62 | | Ke 6750 3.1103 0.000461 56.21 | | elem init 6750 0.6563 0.000097 11.86 | | matrix insertion 6750 0.5488 0.000081 9.92 | =20 -------------------------------------------------------------------------= --- | Totals: 33750 5.5332 100.00 | =20 -------------------------------------------------------------------------= --- 0 KSP Residual norm 4.896131757435e+01=20 1 KSP Residual norm 1.100876561049e+01=20 2 KSP Residual norm 6.775230395898e+00=20 3 KSP Residual norm 5.421227453913e+00=20 4 KSP Residual norm 4.550815454230e+00=20 5 KSP Residual norm 4.050199843076e+00=20 6 KSP Residual norm 3.506321657952e+00=20 7 KSP Residual norm 3.200829140388e+00=20 8 KSP Residual norm 2.479183724922e+00=20 9 KSP Residual norm 2.348913021830e+00=20 10 KSP Residual norm 2.319779191697e+00=20 11 KSP Residual norm 1.675533966477e+00=20 12 KSP Residual norm 1.791625494749e+00=20 13 KSP Residual norm 1.552906296718e+00=20 14 KSP Residual norm 1.201223139419e+00=20 15 KSP Residual norm 1.197478376832e+00=20 16 KSP Residual norm 8.714890479448e-01=20 17 KSP Residual norm 7.975427805646e-01=20 18 KSP Residual norm 6.709401225048e-01=20 19 KSP Residual norm 4.901471423830e-01=20 20 KSP Residual norm 4.289375144473e-01=20 21 KSP Residual norm 2.791850701607e-01=20 22 KSP Residual norm 1.697806204806e-01=20 23 KSP Residual norm 1.051793164352e-01=20 24 KSP Residual norm 4.187513566338e-02=20 25 KSP Residual norm 3.067464416651e-02=20 26 KSP Residual norm 2.384494381924e-02=20 27 KSP Residual norm 2.246595888151e-02=20 28 KSP Residual norm 1.901604886403e-02=20 29 KSP Residual norm 1.342795946096e-02=20 30 KSP Residual norm 1.174143886576e-02=20 31 KSP Residual norm 9.257508189739e-03=20 32 KSP Residual norm 8.185162562390e-03=20 33 KSP Residual norm 7.187054624181e-03=20 34 KSP Residual norm 6.489947948072e-03=20 35 KSP Residual norm 5.518480762652e-03=20 36 KSP Residual norm 4.234010448247e-03=20 37 KSP Residual norm 3.942492806808e-03=20 38 KSP Residual norm 3.219422955615e-03=20 39 KSP Residual norm 3.253554954896e-03=20 40 KSP Residual norm 3.055765204308e-03=20 41 KSP Residual norm 3.085694616956e-03=20 42 KSP Residual norm 3.156205113536e-03=20 43 KSP Residual norm 2.775177033514e-03=20 44 KSP Residual norm 2.688192355483e-03=20 45 KSP Residual norm 2.199192933891e-03=20 46 KSP Residual norm 1.863143690788e-03=20 47 KSP Residual norm 1.585199566382e-03=20 48 KSP Residual norm 1.404881562830e-03=20 49 KSP Residual norm 1.305622725852e-03=20 50 KSP Residual norm 9.783382246650e-04=20 51 KSP Residual norm 8.057787970110e-04=20 52 KSP Residual norm 6.337396221032e-04=20 53 KSP Residual norm 5.465008796248e-04=20 54 KSP Residual norm 4.947900816949e-04=20 55 KSP Residual norm 4.189041337483e-04=20 56 KSP Residual norm 3.832617974778e-04=20 57 KSP Residual norm 3.161868870560e-04=20 58 KSP Residual norm 2.894943819077e-04=20 59 KSP Residual norm 2.641053062872e-04=20 60 KSP Residual norm 2.345103366939e-04=20 61 KSP Residual norm 1.975348820202e-04=20 62 KSP Residual norm 1.484180553685e-04=20 63 KSP Residual norm 1.215078165764e-04=20 64 KSP Residual norm 9.452797632834e-05=20 65 KSP Residual norm 8.000728119770e-05=20 66 KSP Residual norm 6.626243212876e-05=20 67 KSP Residual norm 5.504942181226e-05=20 68 KSP Residual norm 4.727175766164e-05=20 69 KSP Residual norm 3.671923367400e-05=20 70 KSP Residual norm 3.071804686573e-05=20 71 KSP Residual norm 2.223876567022e-05=20 72 KSP Residual norm 1.667941303405e-05=20 73 KSP Residual norm 1.284458582752e-05=20 74 KSP Residual norm 9.936129760532e-06=20 75 KSP Residual norm 8.284563526677e-06=20 76 KSP Residual norm 6.893633796265e-06=20 77 KSP Residual norm 6.676223645034e-06=20 78 KSP Residual norm 5.705912142394e-06=20 79 KSP Residual norm 4.520066338564e-06=20 80 KSP Residual norm 3.472170792283e-06=20 81 KSP Residual norm 2.624572931321e-06=20 82 KSP Residual norm 2.147880411479e-06=20 83 KSP Residual norm 1.624227703320e-06=20 84 KSP Residual norm 1.132711148925e-06=20 85 KSP Residual norm 7.564663568018e-07=20 86 KSP Residual norm 5.367463248234e-07=20 87 KSP Residual norm 4.036600917496e-07=20 88 KSP Residual norm 3.036547049302e-07=20 89 KSP Residual norm 2.311652382121e-07=20 90 KSP Residual norm 1.737486025209e-07=20 91 KSP Residual norm 1.329047997975e-07=20 92 KSP Residual norm 1.090521349204e-07=20 93 KSP Residual norm 8.629799207728e-08=20 94 KSP Residual norm 6.553819207685e-08=20 95 KSP Residual norm 4.795391265983e-08=20 96 KSP Residual norm 3.727327854823e-08=20 97 KSP Residual norm 3.028882414259e-08=20 98 KSP Residual norm 2.514878951710e-08=20 99 KSP Residual norm 2.241151710121e-08=20 100 KSP Residual norm 1.817884616175e-08=20 101 KSP Residual norm 1.539571140708e-08=20 102 KSP Residual norm 1.270554408416e-08=20 103 KSP Residual norm 1.126920822873e-08=20 104 KSP Residual norm 9.707580025244e-09=20 105 KSP Residual norm 8.033834812573e-09=20 106 KSP Residual norm 6.871712402202e-09=20 107 KSP Residual norm 5.641604552360e-09=20 108 KSP Residual norm 5.528332141817e-09=20 109 KSP Residual norm 4.819348789790e-09=20 110 KSP Residual norm 4.260706591839e-09=20 111 KSP Residual norm 3.236889575582e-09=20 112 KSP Residual norm 2.337927583836e-09=20 113 KSP Residual norm 1.853653654390e-09=20 114 KSP Residual norm 1.441274431871e-09=20 115 KSP Residual norm 1.274027549670e-09=20 116 KSP Residual norm 9.988752773127e-10=20 117 KSP Residual norm 7.802728143715e-10=20 118 KSP Residual norm 5.737570915046e-10=20 119 KSP Residual norm 4.388722398501e-10=20 120 KSP Residual norm 3.558986823234e-10=20 121 KSP Residual norm 2.667048116622e-10=20 122 KSP Residual norm 2.164405577302e-10=20 123 KSP Residual norm 1.593296054935e-10=20 124 KSP Residual norm 1.332353895486e-10=20 125 KSP Residual norm 1.071178419360e-10=20 126 KSP Residual norm 9.824754872103e-11=20 127 KSP Residual norm 8.432821739954e-11=20 128 KSP Residual norm 6.683635832017e-11=20 129 KSP Residual norm 5.444399136953e-11=20 130 KSP Residual norm 3.967788055607e-11=20 *************************************************************************= *** ******************************************** *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** *************************************************************************= *** ******************************************** ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- /home/bengao/libmesh/examples/ex4/ex4 on a alpha named nickel37 with 4 processors, by bengao Mon Jun 21 09:38:17 2004 Using Petsc Version 2.1.6, Patch 0, Released Aug 05, 2003 Max Max/Min Avg Total=20 Time (sec): 2.788e+02 9.89239 9.086e+01 Objects: 0.000e+00 0.00000 0.000e+00 Flops: 2.131e+09 1.09487 2.040e+09 8.161e+09 Flops/sec: 7.245e+07 9.47878 5.537e+07 2.215e+08 MPI Messages: 4.080e+02 1.00000 4.080e+02 1.632e+03 MPI Message Lengths: 9.231e+06 1.07981 2.155e+04 3.517e+07 MPI Reductions: 1.038e+02 1.00000 Flop counting convention: 1 flop =3D 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N = --> 2N flops and VecAXPY() for complex vectors of length N --> 8N flops Summary of Stages: ----- Time ------ ----- Flops ----- --- Messages -= -- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Tot= al Avg %Total counts %Total=20 0: Main Stage: 9.0855e+01 100.0% 8.1610e+09 100.0% 1.632e+03 100.= 0% 2.155e+04 100.0% 4.150e+02 100.0%=20 -------------------------------------------------------------------------= --- -------------------------------------------- See the 'Profiling' chapter of the users' manual for details on interpret= ing output. Phase summary info: Count: number of times phase was executed Time and Flops/sec: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() an= d PetscLogStagePop(). %T - percent time in this phase %F - percent flops in this phase %M - percent messages in this phase %L - percent message length= s in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flops over all processors)/(max time ov= er all processors) -------------------------------------------------------------------------= --- -------------------------------------------- ########################################################## # # # WARNING!!! # # # # This code was run without the PreLoadBegin() # # macros. To get timing results we always recommend # # preloading. otherwise timing numbers may be # # meaningless. # ########################################################## Event Count Time (sec) Flops/sec --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s -------------------------------------------------------------------------= --- -------------------------------------------- --- Event Stage 0: Main Stage VecDot 261 1.0 1.4453e-01 1.7 3.41e+08 1.7 0.0e+00 0.0e+00 2.6e+02 0 1 0 0 63 0 1 0 0 63 820 VecNorm 131 1.0 9.2579e-0114.1 2.35e+0815.2 0.0e+00 0.0e+00 1.3e+02 0 1 0 0 32 0 1 0 0 32 64 VecCopy 1 1.0 0.0000e+00 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 135 1.0 1.1720e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 260 1.0 1.1816e-01 1.1 2.61e+08 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 999 VecAYPX 130 1.0 7.3239e-02 1.1 2.10e+08 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 806 VecAssemblyBegin 2 1.0 2.9290e-03 3.0 0.00e+00 0.0 1.2e+01 2.0e+04 6.0e+00 0 0 1 1 1 0 0 1 1 1 0 VecAssemblyEnd 2 1.0 0.0000e+00 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecScatterBegin 132 1.0 3.2226e-02 1.6 0.00e+00 0.0 1.6e+03 1.6e+04 0.0e+00 0 0 97 74 0 0 0 97 74 0 0 VecScatterEnd 132 1.0 5.1171e-01 3.8 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatMult 131 1.0 5.6289e+00 1.0 1.70e+08 1.1 1.6e+03 1.6e+04 0.0e+00 6 45 96 73 0 6 45 96 73 0 646 MatSolve 131 1.0 1.0762e+01 1.1 8.58e+07 1.0 0.0e+00 0.0e+00 0.0e+00 11 43 0 0 0 11 43 0 0 0 326 MatLUFactorNum 1 1.0 6.4942e-01 1.3 3.10e+08 1.2 0.0e+00 0.0e+00 0.0e+00 1 8 0 0 0 1 8 0 0 0 1011 MatILUFactorSym 1 1.0 1.5625e-01 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatAssemblyBegin 1 1.0 1.2500e-01 9.1 0.00e+00 0.0 1.2e+01 7.1e+05 2.0e+00 0 0 1 24 0 0 0 1 24 0 0 MatAssemblyEnd 1 1.0 7.4219e-02 1.4 0.00e+00 0.0 1.2e+01 1.0e+04 7.0e+00 0 0 1 0 2 0 0 1 0 2 0 MatGetOrdering 1 1.0 1.9529e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatZeroEntries 3 1.0 1.3379e-01 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 PCSetUp 2 1.0 8.1152e-01 1.3 2.42e+08 1.2 0.0e+00 0.0e+00 3.0e+00 1 8 0 0 1 1 8 0 0 1 809 PCSetUpOnBlocks 1 1.0 8.0762e-01 1.3 2.44e+08 1.2 0.0e+00 0.0e+00 3.0e+00 1 8 0 0 1 1 8 0 0 1 813 PCApply 131 1.0 1.0777e+01 1.1 8.56e+07 1.0 0.0e+00 0.0e+00 0.0e+00 11 43 0 0 0 11 43 0 0 0 326 SLESSetup 2 1.0 8.1250e-01 1.3 2.42e+08 1.2 0.0e+00 0.0e+00 3.0e+00 1 8 0 0 1 1 8 0 0 1 808 SLESSolve 1 1.0 1.6802e+01 1.0 1.17e+08 1.1 1.6e+03 1.6e+04 3.9e+02 18 92 96 73 94 18 92 96 73 94 447 -------------------------------------------------------------------------= --- -------------------------------------------- Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. --- Event Stage 0: Main Stage Viewer 1 1 0 0 Index Set 6 6 1002280 0 Map 15 15 3000 0 Vec 10 10 2823952 0 Vec Scatter 2 2 1546436 0 Matrix 4 4 88821100 4.56315e+07 Krylov Solver 2 2 200 1.41084e+06 Preconditioner 2 2 240 8.76106e+07 SLES 2 2 0 8.90219e+07 =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D *** FOR 8 PROCS *** The output (if any) follows: Warning: no access to tty; thus no job control in this shell... stty: tcgetattr: Not a typewriter vous etes sur le HP/SC nickel41 Running /home/bengao/libmesh/examples/ex4/ex4 -d 3 /home/bengao/libmesh/reference_elements/3D/one_hex27.xda toto.ucd -ksp_ty= pe cg -ksp_monitor -log_summary Mesh Information: mesh_dimension()=3D3 spatial_dimension()=3D3 n_nodes()=3D226981 n_elem()=3D27000 n_local_elem()=3D3375 n_active_elem()=3D27000 n_subdomains()=3D1 n_processors()=3D8 processor_id()=3D0 EquationSystems n_systems()=3D1 System "Poisson" Type "Implicit" Variables=3D"u"=20 Finite Element Types=3D"0"=20 Approximation Orders=3D"2"=20 n_dofs()=3D226981 n_local_dofs()=3D29869 n_constrained_dofs()=3D0 n_vectors()=3D1 n_parameters()=3D2 Parameters: "linear solver maximum iterations"=3D5000 "linear solver tolerance"=3D1e-12 =20 -------------------------------------------------------------------------= --- | Processor id: 0 | Num Processors: 8 | Time: Mon Jun 21 09:37:24 2004 | OS: OSF1 | HostName: nickel29 | OS Release V5.1 | OS Version: 1885 | Machine: alpha | Username: bengao =20 -------------------------------------------------------------------------= --- =20 -------------------------------------------------------------------------= --- | Matrix Assembly Performance: Alive time=3D2.8457, Active time=3D2.80273 =20 -------------------------------------------------------------------------= --- | Event nCalls Total Avg Percent o= f | | Time Time Active Ti= me | |------------------------------------------------------------------------= --- -| | | | BCs 3375 0.4141 0.000123 14.77 | | Fe 3375 0.1953 0.000058 6.97 | | Ke 3375 1.5625 0.000463 55.75 | | elem init 3375 0.3457 0.000102 12.33 | | matrix insertion 3375 0.2852 0.000084 10.17 | =20 -------------------------------------------------------------------------= --- | Totals: 16875 2.8027 100.00 | =20 -------------------------------------------------------------------------= --- 0 KSP Residual norm 4.874234999505e+01=20 1 KSP Residual norm 1.095551642521e+01=20 2 KSP Residual norm 6.799764554040e+00=20 3 KSP Residual norm 5.395222869687e+00=20 4 KSP Residual norm 4.612763217936e+00=20 5 KSP Residual norm 4.067868962130e+00=20 6 KSP Residual norm 3.617296658520e+00=20 7 KSP Residual norm 3.231291377657e+00=20 8 KSP Residual norm 2.747549370604e+00=20 9 KSP Residual norm 2.157406686288e+00=20 10 KSP Residual norm 2.363099264334e+00=20 11 KSP Residual norm 1.828503557092e+00=20 12 KSP Residual norm 1.746369995507e+00=20 13 KSP Residual norm 1.662266998466e+00=20 14 KSP Residual norm 1.224245645949e+00=20 15 KSP Residual norm 1.248650192257e+00=20 16 KSP Residual norm 1.034141960224e+00=20 17 KSP Residual norm 8.135578462730e-01=20 18 KSP Residual norm 7.801248957258e-01=20 19 KSP Residual norm 5.386403826987e-01=20 20 KSP Residual norm 5.049169116763e-01=20 21 KSP Residual norm 3.562901342253e-01=20 22 KSP Residual norm 2.434126892107e-01=20 23 KSP Residual norm 1.604276603594e-01=20 24 KSP Residual norm 7.718052486690e-02=20 25 KSP Residual norm 4.778689962783e-02=20 26 KSP Residual norm 2.927574070871e-02=20 27 KSP Residual norm 2.045970283218e-02=20 28 KSP Residual norm 1.602199325900e-02=20 29 KSP Residual norm 1.198949531551e-02=20 30 KSP Residual norm 9.017555602726e-03=20 31 KSP Residual norm 7.423162134296e-03=20 32 KSP Residual norm 5.417578106363e-03=20 33 KSP Residual norm 4.745602148775e-03=20 34 KSP Residual norm 3.949421718494e-03=20 35 KSP Residual norm 3.413731628081e-03=20 36 KSP Residual norm 2.785529661044e-03=20 37 KSP Residual norm 1.988106619119e-03=20 38 KSP Residual norm 1.647565131489e-03=20 39 KSP Residual norm 1.252852539263e-03=20 40 KSP Residual norm 1.113829153240e-03=20 41 KSP Residual norm 9.452264489887e-04=20 42 KSP Residual norm 8.852205211820e-04=20 43 KSP Residual norm 8.009087237445e-04=20 44 KSP Residual norm 7.051316196940e-04=20 45 KSP Residual norm 7.626284612089e-04=20 46 KSP Residual norm 7.692325522536e-04=20 47 KSP Residual norm 8.105306935719e-04=20 48 KSP Residual norm 6.754572656079e-04=20 49 KSP Residual norm 4.904761764611e-04=20 50 KSP Residual norm 3.986621922083e-04=20 51 KSP Residual norm 3.476228078546e-04=20 52 KSP Residual norm 3.426597098251e-04=20 53 KSP Residual norm 3.120971098789e-04=20 54 KSP Residual norm 2.769936953562e-04=20 55 KSP Residual norm 2.621096963954e-04=20 56 KSP Residual norm 2.241929758730e-04=20 57 KSP Residual norm 1.840445795331e-04=20 58 KSP Residual norm 1.477394166025e-04=20 59 KSP Residual norm 1.141612356893e-04=20 60 KSP Residual norm 9.574701472168e-05=20 61 KSP Residual norm 7.874246069364e-05=20 62 KSP Residual norm 7.271945256830e-05=20 63 KSP Residual norm 6.297632771997e-05=20 64 KSP Residual norm 5.315979795168e-05=20 65 KSP Residual norm 4.912135833590e-05=20 66 KSP Residual norm 4.338720350772e-05=20 67 KSP Residual norm 4.120480903721e-05=20 68 KSP Residual norm 3.386177891258e-05=20 69 KSP Residual norm 2.586130736809e-05=20 70 KSP Residual norm 2.003228591233e-05=20 71 KSP Residual norm 1.679283232536e-05=20 72 KSP Residual norm 1.542413646819e-05=20 73 KSP Residual norm 1.186018534649e-05=20 74 KSP Residual norm 8.343995820358e-06=20 75 KSP Residual norm 6.086261466222e-06=20 76 KSP Residual norm 4.608351541787e-06=20 77 KSP Residual norm 3.797253978459e-06=20 78 KSP Residual norm 2.868950787685e-06=20 79 KSP Residual norm 2.331260891788e-06=20 80 KSP Residual norm 1.859180984181e-06=20 81 KSP Residual norm 1.534788901092e-06=20 82 KSP Residual norm 1.339912560564e-06=20 83 KSP Residual norm 1.155184749519e-06=20 84 KSP Residual norm 9.574312235474e-07=20 85 KSP Residual norm 7.780536934462e-07=20 86 KSP Residual norm 6.095908349272e-07=20 87 KSP Residual norm 4.962137565767e-07=20 88 KSP Residual norm 3.708874256466e-07=20 89 KSP Residual norm 2.725826596255e-07=20 90 KSP Residual norm 2.062916200867e-07=20 91 KSP Residual norm 1.638859470273e-07=20 92 KSP Residual norm 1.412242451472e-07=20 93 KSP Residual norm 1.059207061699e-07=20 94 KSP Residual norm 7.279706905383e-08=20 95 KSP Residual norm 5.057536869338e-08=20 96 KSP Residual norm 3.347167334479e-08=20 97 KSP Residual norm 2.523730940216e-08=20 98 KSP Residual norm 1.757339482700e-08=20 99 KSP Residual norm 1.326066585113e-08=20 100 KSP Residual norm 9.783508454797e-09=20 101 KSP Residual norm 7.807276727035e-09=20 102 KSP Residual norm 6.777052346483e-09=20 103 KSP Residual norm 6.052049718370e-09=20 104 KSP Residual norm 6.099599169807e-09=20 105 KSP Residual norm 5.380840211341e-09=20 106 KSP Residual norm 4.615959801862e-09=20 107 KSP Residual norm 3.568742963809e-09=20 108 KSP Residual norm 2.728235142822e-09=20 109 KSP Residual norm 2.349399439668e-09=20 110 KSP Residual norm 2.110611680219e-09=20 111 KSP Residual norm 2.047622000438e-09=20 112 KSP Residual norm 1.700284864512e-09=20 113 KSP Residual norm 1.459066735798e-09=20 114 KSP Residual norm 1.201333053830e-09=20 115 KSP Residual norm 9.939314874190e-10=20 116 KSP Residual norm 8.036659229074e-10=20 117 KSP Residual norm 5.468476504634e-10=20 118 KSP Residual norm 3.989675492675e-10=20 119 KSP Residual norm 2.827500432789e-10=20 120 KSP Residual norm 2.255148366420e-10=20 121 KSP Residual norm 1.819546867303e-10=20 122 KSP Residual norm 1.349356037949e-10=20 123 KSP Residual norm 1.020111434046e-10=20 124 KSP Residual norm 7.357691674430e-11=20 125 KSP Residual norm 6.079294109786e-11=20 126 KSP Residual norm 4.797031426284e-11=20 *************************************************************************= *** ******************************************** *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** *************************************************************************= *** ******************************************** ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- /home/bengao/libmesh/examples/ex4/ex4 on a alpha named nickel29 with 8 processors, by bengao Mon Jun 21 09:41:27 2004 Using Petsc Version 2.1.6, Patch 0, Released Aug 05, 2003 Max Max/Min Avg Total=20 Time (sec): 2.502e+02 14.47542 4.641e+01 Objects: 0.000e+00 0.00000 0.000e+00 Flops: 1.049e+09 1.14386 9.824e+08 7.859e+09 Flops/sec: 5.826e+07 13.89204 4.970e+07 3.976e+08 MPI Messages: 9.220e+02 1.74291 7.898e+02 6.318e+03 MPI Message Lengths: 8.019e+06 1.32256 8.590e+03 5.427e+07 MPI Reductions: 5.038e+01 1.00000 Flop counting convention: 1 flop =3D 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N = --> 2N flops and VecAXPY() for complex vectors of length N --> 8N flops Summary of Stages: ----- Time ------ ----- Flops ----- --- Messages -= -- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Tot= al Avg %Total counts %Total=20 0: Main Stage: 4.6415e+01 100.0% 7.8589e+09 100.0% 6.318e+03 100.= 0% 8.590e+03 100.0% 4.030e+02 100.0%=20 -------------------------------------------------------------------------= --- -------------------------------------------- See the 'Profiling' chapter of the users' manual for details on interpret= ing output. Phase summary info: Count: number of times phase was executed Time and Flops/sec: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() an= d PetscLogStagePop(). %T - percent time in this phase %F - percent flops in this phase %M - percent messages in this phase %L - percent message length= s in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flops over all processors)/(max time ov= er all processors) -------------------------------------------------------------------------= --- -------------------------------------------- ########################################################## # # # WARNING!!! # # # # This code was run without the PreLoadBegin() # # macros. To get timing results we always recommend # # preloading. otherwise timing numbers may be # # meaningless. # ########################################################## Event Count Time (sec) Flops/sec --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s -------------------------------------------------------------------------= --- -------------------------------------------- --- Event Stage 0: Main Stage VecDot 253 1.0 8.3008e-01 7.5 1.31e+08 7.7 0.0e+00 0.0e+00 2.5e+02 1 1 0 0 63 1 1 0 0 63 138 VecNorm 127 1.0 1.8330e+00 5.6 2.32e+07 6.0 0.0e+00 0.0e+00 1.3e+02 3 1 0 0 32 3 1 0 0 32 31 VecCopy 1 1.0 9.7704e-04 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 131 1.0 1.1720e-02 4.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 252 1.0 4.2974e-02 1.3 4.38e+08 1.3 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 2662 VecAYPX 126 1.0 2.3436e-02 1.4 4.23e+08 1.4 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 2441 VecAssemblyBegin 2 1.0 2.8320e-02 1.0 0.00e+00 0.0 4.2e+01 1.0e+04 6.0e+00 0 0 1 1 1 0 0 1 1 1 0 VecAssemblyEnd 2 1.0 3.9060e-03 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecScatterBegin 128 1.0 7.1286e-02 1.8 0.00e+00 0.0 6.1e+03 6.2e+03 0.0e+00 0 0 97 70 0 0 0 97 70 0 0 VecScatterEnd 128 1.0 2.0899e-01 7.6 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatMult 127 1.0 2.4433e+00 1.4 2.43e+08 1.3 6.1e+03 6.2e+03 0.0e+00 5 45 96 69 0 5 45 96 69 0 1443 MatSolve 127 1.0 5.3310e+00 1.4 1.07e+08 1.3 0.0e+00 0.0e+00 0.0e+00 10 43 0 0 0 10 43 0 0 0 628 MatLUFactorNum 1 1.0 2.8418e-01 1.2 3.17e+08 1.1 0.0e+00 0.0e+00 0.0e+00 1 8 0 0 0 1 8 0 0 0 2254 MatILUFactorSym 1 1.0 6.1522e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatAssemblyBegin 1 1.0 2.4805e-0112.1 0.00e+00 0.0 4.2e+01 3.6e+05 2.0e+00 0 0 1 28 0 0 0 1 28 0 0 MatAssemblyEnd 1 1.0 1.3281e-01 1.2 0.00e+00 0.0 4.8e+01 5.3e+03 7.0e+00 0 0 1 0 2 0 0 1 0 2 0 MatGetOrdering 1 1.0 1.9541e-03 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatZeroEntries 3 1.0 5.0781e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 PCSetUp 2 1.0 3.3887e-01 1.1 2.63e+08 1.1 0.0e+00 0.0e+00 3.0e+00 1 8 0 0 1 1 8 0 0 1 1890 PCSetUpOnBlocks 1 1.0 3.3789e-01 1.1 2.63e+08 1.1 0.0e+00 0.0e+00 3.0e+00 1 8 0 0 1 1 8 0 0 1 1896 PCApply 127 1.0 5.3447e+00 1.4 1.06e+08 1.3 0.0e+00 ... [truncated message content] |