climateprediction.net home page
Task 18513761

Task 18513761

Name hadam3prm3pm2t_eu_pq4d_2002_1_009830120_1
Workunit 9886046
Created 30 May 2015, 0:38:18 UTC
Sent 29 Oct 2015, 8:15:47 UTC
Report deadline 10 Oct 2016, 13:35:47 UTC
Received 16 Nov 2015, 9:54:21 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1377973
Run time 12 days 21 hours 53 min 35 sec
CPU time 9 days 3 hours 58 min 58 sec
Validate state Invalid
Credit 3,995.19
Device peak FLOPS 2.06 GFLOPS
Application version UK Met Office HadAM3P and HadRM3P model with MOSES II and TRIFFID Europe v7.01
i686-pc-linux-gnu
Stderr
<core_client_version>7.0.27</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
17:22:26 (25211): No heartbeat from client for 30 sec - exiting
17:22:26 (25211): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1000, selfPID=1000, iMonCtr=1
Signal 3 received, exiting...
10:16:05 (1001): called boinc_finish
10:16:38 (1164): No heartbeat from client for 30 sec - exiting
10:16:38 (1164): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13158, selfPID=13065, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=13586, selfPID=13537, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=20117, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=21106, selfPID=21106, iMonCtr=1
Signal 3 received, exiting...
07:25:38 (21107): called boinc_finish
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15047, selfPID=15048, iMonCtr=1
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15047, selfPID=15047, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=18218, selfPID=18133, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=23467, selfPID=23404, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
17:14:18 (24635): No heartbeat from client for 30 sec - exiting
17:14:18 (24635): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:00:45 (25040): No heartbeat from client for 30 sec - exiting
02:00:46 (25040): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:00:47 (25040): No heartbeat from client for 30 sec - exiting
02:00:47 (25040): timer handler: client dead, exiting
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:12:28 (27692): No heartbeat from client for 30 sec - exiting
18:12:44 (27692): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:12:45 (27692): No heartbeat from client for 30 sec - exiting
18:12:45 (27692): timer handler: client dead, exiting
18:12:46 (27692): No heartbeat from client for 30 sec - exiting
18:13:26 (27692): timer handler: client dead, exiting
18:13:27 (27692):18:14:34 (28033): No heartbeat from client for 30 sec - exiting
18:14:34 (28033): timer handler: client dead, exiting
18:14:35 (28033): No heartbeat from client for 30 sec - exiting
18:14:35 (28033): timer handler: client dead, exiting
18:14:36 (28033): No heartbeat from client for 30 sec - exiting
18:14:36 (28033): timer handler: client dead, exiting
18:14:37 (28033): No heartbeat from client for 30 sec - exiting
18:14:37 (28033): timer handler: client dead, exiting
18:14:38 (28033): No heartbeat from client for 30 sec - exiting
18:14:40 (28033): timer handler: client dead, exiting
18:14:41 (28033): No heartbeat from client for 30 sec - exiting
18:14:41 (28033): timer handler: client dead, exiting
18:14:42 (28033): No heartbeat from client for 30 sec - exiting
18:14:44 (28033): timer handler: client dead, exiting
18:14:45 (28033): No heartbeat from client for 30 sec - exiting
18:14:45 (28033): timer handler: client dead, exiting
18:14:46 (28033): No heartbeat from client for 30 sec - exiting
18:14:50 (28033): timer handler: client dead, exiting
18:14:51 (28033): No heartbeat from client for 30 sec - exiting
18:14:53 (28033): timer handler: client dead, exiting
18:14:54 (28033): No heartbeat from client for 30 sec - exiting
18:14:56 (28033): timer handler: client dead, exiting
18:14:57 (28033): No heartbeat from client for 30 sec - exiting
18:15:02 (28033): timer handler: client dead, exiting
18:15:03 (28033): No heartbeat from client for 30 sec - exiting
18:15:06 (28033): timer handler: client dead, exiting
18:15:07 (28033): No heartbeat from client for 30 sec - exiting
18:15:08 (28033): timer handler: client dead, exiting
18:15:09 (28033): No heartbeat from client for 30 sec - exiting
18:15:09 (28033): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:16:49 (28071): No heartbeat from client for 30 sec - exiting
20:16:51 (28071): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:01:48 (28182): No heartbeat from client for 30 sec - exiting
04:01:52 (28182): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:01:53 (28182): No heartbeat from client for 30 sec - exiting
04:01:53 (28182): timer handler: client dead, exiting
04:01:54 (28182):00:03:22 (28346): No heartbeat from client for 30 sec - exiting
00:03:23 (28346): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:11:25 (28736): No heartbeat from client for 30 sec - exiting
06:11:27 (28736): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
execv: No such file or directory

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Nov 2015 13:45:50 1377973 18513761 hadam3prm3pm2t_eu_pq4d_2002_1_009830120_1 57,899 697,209 12.0418
11 Nov 2015 17:41:09 1377973 18513761 hadam3prm3pm2t_eu_pq4d_2002_1_009830120_1 46,379 553,749 11.9396
08 Nov 2015 20:16:56 1377973 18513761 hadam3prm3pm2t_eu_pq4d_2002_1_009830120_1 34,859 404,544 11.6052
06 Nov 2015 08:19:10 1377973 18513761 hadam3prm3pm2t_eu_pq4d_2002_1_009830120_1 23,339 257,568 11.0359
05 Nov 2015 14:20:06 1377973 18513761 hadam3prm3pm2t_eu_pq4d_2002_1_009830120_1 11,819 120,218 10.1716


©2024 cpdn.org