Name | hadam3prm3pm2t_eu_oqd7_2002_1_009829438_0 |
Workunit | 9885364 |
Created | 7 May 2015, 15:37:17 UTC |
Sent | 1 Nov 2015, 10:05:48 UTC |
Report deadline | 13 Oct 2016, 15:25:48 UTC |
Received | 16 Nov 2015, 9:54:21 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1377973 |
Run time | 9 days 17 hours 50 min 53 sec |
CPU time | 5 days 15 hours 43 min 59 sec |
Validate state | Invalid |
Credit | 2,392.12 |
Device peak FLOPS | 2.06 GFLOPS |
Application version | UK Met Office HadAM3P and HadRM3P model with MOSES II and TRIFFID Europe v7.01 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1044, selfPID=1044, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1044, selfPID=1045, iMonCtr=1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13144, selfPID=13063, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13636, selfPID=13552, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... SIGSEGV: segmentation violation Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=20230, selfPID=20146, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15051, selfPID=15052, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15051, selfPID=15051, iMonCtr=1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=18176, selfPID=18148, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=23479, selfPID=23433, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 17:14:18 (24650): No heartbeat from client for 30 sec - exiting 17:14:18 (24650): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:00:45 (25055): No heartbeat from client for 30 sec - exiting 02:00:46 (25055): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:00:47 (25055): No heartbeat from client for 30 sec - exiting 02:00:48 (25055): timer handler: client dead, exiting CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:12:28 (27707): No heartbeat from client for 30 sec - exiting 18:12:44 (27707): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:12:45 (27707): No heartbeat from client for 30 sec - exiting 18:12:45 (27707): timer handler: client dead, exiting 18:12:46 (27707): No heartbeat from client for 30 sec - exiting 18:13:26 (27707): timer handler: client dead, exiting 18:13:27 (27707): No heartbeat from client for 30 sec - exiting 18:13:32 (27707): timer handler: client dead, exiting 18:13:33 (27707): No heartbeat from client for 30 sec - exiting 18:13:38 (27707): timer handler: client dead, exiting 18:13:39 (27707): No heartbeat from client for 30 sec - exiting 18:13:41 (27707): timer handler: client dead, exiting 18:13:42 (27707): No heartbeat from client for 30 sec - exiting 18:13:43 (27707): timer handler: client dead, exiting 18:13:44 (27707): No heartbeat from client for 30 sec - exiting 18:13:46 (27707): timer handler: client dead, exiting 18:13:47 (27707): No heartbeat from client for 30 sec - exiting 18:13:49 (27707): timer handler: client dead, exiting 18:14:20 (27707): No heartbeat from client for 30 sec - exiting 18:14:20 (27707): timer handler: client dead, exiting 18:14:21 (27707): No heartbeat from client for 30 sec - exiting 18:14:21 (27707): timer handler: client dead, exiting 18:14:22 (27707): No heartbeat from client for 30 sec - exiting 18:14:22 (27707): timer handler: client dead, exiting 18:14:23 (27707): No heartbeat from client for 30 sec - exiting 18:14:24 (27707): timer handler: client dead, exiting 18:14:25 (27707): No heartbeat from client for 30 sec - exiting 18:14:26 (27707): timer handler: client dead, exiting 18:14:27 (27707): No heartbeat from client for 30 sec - exiting 18:14:28 (27707): timer handler: client dead, exiting 18:14:29 (27707): No heartbeat from client for 30 sec - exiting 18:14:29 (27707): timer handler: client dead, exiting 18:14:30 (27707): No heartbeat from client for 30 sec - exiting 18:14:31 (27707): timer handler: client dead, exiting 18:14:32 (27707): No heartbeat from client for 30 sec - exiting 18:14:33 (27707): timer handler: client dead, exiting 18:14:34 (27707): No heartbeat from client for 30 sec - exiting 18:14:35 (27707): timer handler: client dead, exiting 18:14:36 (27707): No heartbeat from client for 30 sec - exiting 18:14:36 (27707): timer handler: client dead, exiting 18:14:37 (27707): No heartbeat from client for 30 sec - exiting 18:14:37 (27707): timer handler: client dead, exiting 18:14:38 (27707): No heartbeat from client for 30 sec - exiting 18:14:40 (27707): timer handler: client dead, exiting 18:14:41 (27707): No heartbeat from client for 30 sec - exiting 18:14:41 (27707): timer handler: client dead, exiting 18:14:42 (27707): No heartbeat from client for 30 sec - exiting 18:14:44 (27707): timer handler: client dead, exiting 18:14:45 (27707): No heartbeat from client for 30 sec - exiting 18:14:45 (27707): timer handler: client dead, exiting 18:14:46 (27707): No heartbeat from client for 30 sec - exiting 18:14:50 (27707): timer handler: client dead, exiting 18:14:51 (27707): No heartbeat from client for 30 sec - exiting 18:14:53 (27707): timer handler: client dead, exiting 18:14:54 (27707): No heartbeat from client for 30 sec - exiting 18:14:56 (27707): timer handler: client dead, exiting 18:14:57 (27707): No heartbeat from client for 30 sec - exiting 18:15:02 (27707): timer handler: client dead, exiting 20:16:49 (28063): No heartbeat from client for 30 sec - exiting 20:16:50 (28063): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:16:51 (28063): No heartbeat from client for 30 sec - exiting 20:16:51 (28063): timer handler: client dead, exiting 04:01:48 (28180): No heartbeat from client for 30 sec - exiting 04:01:52 (28180): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:01:53 (28180): No heartbeat from client for 30 sec - exiting 04:01:53 (28180): timer handler: client dead, exiting 04:01:54 (28180): No heartbeat from client for 30 sec - exiting 04:01:55 (28180): timer handler: client dead, exiting 04:01:56 (28180): No heartbeat from client for 30 sec - exiting 04:01:56 (28180): timer handler: client dead, exiting 04:01:57 (28180): No heartbeat from client for 30 sec - exiting 04:01:57 (28180): timer handler: client dead, exiting 04:01:58 (28180): No heartbeat from client for 30 sec - exiting 04:01:59 (28180): timer handler: client dead, exiting 04:02:00 (28180): No heartbeat from client for 30 sec - exiting 04:02:00 (28180): timer handler: client dead, exiting 04:02:01 (28180): No heartbeat from client for 30 sec - exiting 04:02:02 (28180): timer handler: client dead, exiting 06:11:25 (28344): No heartbeat from client for 30 sec - exiting 06:11:27 (28344): timer handler: client dead, exiting CPDN Monitor - No 'heartbeat' from BOINC... execv: No such file or directory </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
15 Nov 2015 06:32:01 | 1377973 | 18415747 | hadam3prm3pm2t_eu_oqd7_2002_1_009829438_0 | 34,667 | 430,905 | 12.4298 |
12 Nov 2015 16:01:59 | 1377973 | 18415747 | hadam3prm3pm2t_eu_oqd7_2002_1_009829438_0 | 23,339 | 297,024 | 12.7265 |
09 Nov 2015 22:22:07 | 1377973 | 18415747 | hadam3prm3pm2t_eu_oqd7_2002_1_009829438_0 | 11,819 | 147,630 | 12.4909 |
©2024 climateprediction.net