Benchmark
Give an overview of the solver's performance.
Version v1.0.0.
Reference performance page is available here. Some results are also given in the last appendix of this PhD thesis.
Notes:
- GMRES uses a tolerance of 1e-5 and 50 restarts, and is preconditioned with an incomplete (fill-in factor of 2 and drop tolerance of 1e-6) LU matrix.
List of machines:
Id | name | operating system | machine type | computing unit (frequency) | memory | compiler |
---|---|---|---|---|---|---|
01 | - | ubuntu20 | virtual | i7-7700HQ (2.8Ghz) | 8Gb | GNU gcc |
02 | - | windows10/msys2 | laptop | i7-7700HQ (2.8Ghz) | 16Gb | GNU MinGW |
03 | - | ubuntu20 | virtual | i7-4790K (4.0Ghz) | 8Gb | GNU gcc |
04 | - | windows10 | desktop | i7-4790K (4.0Ghz) | 12Gb | VS 2019 |
Test case:
The Onera M6 wing (dart/benchmark/onera.py) is the standard test/benchmark case for steady transonic flow computations.
Results
v1.0.2 (unreleased)
Mesh size: 711976 elements (556008 in the volume) and 135233 nodes.
Machine id | BLAS | Solver | iterations | threads | CPU_user | CPU_real | CPU_kernel |
---|---|---|---|---|---|---|---|
01 | Intel MKL | PARDISO | 8 | 1 (4) | 89 (116) s | 93 (45) s | 4.2 (5.8) s |
02 | Openblas | GMRES | 8 | 1 (4) | 115 (148) s | 118 (68) s | 2.4 (4.0) s |
03 | Intel MKL | PARDISO | 8 | 1 (4) | 82 (103) s | 84 (39) s | 2.6 (4.0) s |
04 | Intel MKL | PARDISO | ? | 1 (4) | ? (?) s | ?(?) s | ? (?) s |
v1.0.1 (September 2021)
Mesh size: 772197 elements (602199 in the volume) and 146744 nodes.
Machine id | BLAS | Solver | iterations | threads | CPU_user | CPU_real | CPU_kernel |
---|---|---|---|---|---|---|---|
01 | Intel MKL | PARDISO | 15 | 1 (4) | 152 (202) s | 157 (79) s | 5.5 (8.5) s |
02 | Openblas | GMRES | 15 | 1 (4) | 191 (263) s | 195 (127) s | 3.5 (7.7) s |
03 | Intel MKL | PARDISO | 15 | 1 (4) | 138 (194) s | 142 (74) s | 5.5 (4.4) s |
04 | Intel MKL | PARDISO | 15 | 1 (4) | 204 (312) s | 211 (116) s | 6.4 (21.6) s |