Name | hadam3p_pnw_bv3c_1975_1_007926599_1 |
Workunit | 8081711 |
Created | 5 May 2012, 17:19:09 UTC |
Sent | 5 May 2012, 18:05:48 UTC |
Report deadline | 17 Apr 2013, 23:25:48 UTC |
Received | 8 Jun 2012, 20:49:15 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 984314 |
Run time | 5 days 21 hours 56 min 29 sec |
CPU time | 5 days 21 hours 56 min 29 sec |
Validate state | Invalid |
Credit | 2,755.56 |
Device peak FLOPS | 2.19 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 windows_intelx86 |
Stderr | <core_client_version>6.2.28</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1736, selfPID=5404, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4652, selfPID=3904, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4188, selfPID=5592, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5296, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4716, selfPID=4996, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 3 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5452, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 3 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5936, selfPID=5284, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 4 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5364, selfPID=5872, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2420, selfPID=5976, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 5 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1872, selfPID=4484, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3036, selfPID=4504, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4988, selfPID=4396, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1092, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5460, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5380, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5384, selfPID=4188, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 9 CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5040, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4912, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2212, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=168, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 9 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4264, selfPID=2192, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 10 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=288, selfPID=2836, iMonCtr=1 Model crash detected, will try to restart... 20:27:32 (4664): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3288, selfPID=4432, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
06 Jun 2012 17:35:40 | 984314 | 14631753 | hadam3p_pnw_bv3c_1975_1_007926599_1 | 126,816 | 493,300 | 3.8899 |
05 Jun 2012 20:04:35 | 984314 | 14631753 | hadam3p_pnw_bv3c_1975_1_007926599_1 | 115,296 | 449,054 | 3.8948 |
02 Jun 2012 17:57:08 | 984314 | 14631753 | hadam3p_pnw_bv3c_1975_1_007926599_1 | 103,776 | 404,783 | 3.9005 |
30 May 2012 05:49:29 | 984314 | 14631753 | hadam3p_pnw_bv3c_1975_1_007926599_1 | 92,256 | 360,358 | 3.9061 |
21 May 2012 19:54:50 | 984314 | 14631753 | hadam3p_pnw_bv3c_1975_1_007926599_1 | 80,736 | 316,151 | 3.9159 |
20 May 2012 10:54:05 | 984314 | 14631753 | hadam3p_pnw_bv3c_1975_1_007926599_1 | 69,216 | 271,836 | 3.9274 |
19 May 2012 11:22:42 | 984314 | 14631753 | hadam3p_pnw_bv3c_1975_1_007926599_1 | 57,696 | 227,384 | 3.9411 |
17 May 2012 13:00:34 | 984314 | 14631753 | hadam3p_pnw_bv3c_1975_1_007926599_1 | 46,176 | 182,472 | 3.9517 |
13 May 2012 11:05:19 | 984314 | 14631753 | hadam3p_pnw_bv3c_1975_1_007926599_1 | 34,656 | 136,342 | 3.9342 |
11 May 2012 19:34:09 | 984314 | 14631753 | hadam3p_pnw_bv3c_1975_1_007926599_1 | 23,136 | 91,205 | 3.9421 |
06 May 2012 20:27:01 | 984314 | 14631753 | hadam3p_pnw_bv3c_1975_1_007926599_1 | 11,616 | 45,998 | 3.9599 |
©2024 cpdn.org