Name | hadam3p_eu_2iny_1996_1_007398979_2 |
Workunit | 7596409 |
Created | 23 Aug 2011, 21:20:55 UTC |
Sent | 23 Aug 2011, 21:21:03 UTC |
Report deadline | 5 Aug 2012, 2:41:03 UTC |
Received | 31 Aug 2011, 2:23:38 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1160433 |
Run time | 5 days 0 hours 10 min 53 sec |
CPU time | 4 days 10 hours 48 min 37 sec |
Validate state | Invalid |
Credit | 1,988.94 |
Device peak FLOPS | 2.41 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=12788, selfPID=12988, iMonCtr=1 Model crash detected, will try to restart... 15:35:55 (5036): No heartbeat from core client for 30 sec - exiting 15:35:56 (5036): No heartbeat from core client for 30 sec - exiting 15:35:57 (5036): No heartbeat from core client for 30 sec - exiting 15:35:58 (5036): No heartbeat from core client for 30 sec - exiting 15:35:59 (5036): No heartbeat from core client for 30 sec - exiting 15:36:00 (5036): No heartbeat from core client for 30 sec - exiting 15:36:01 (5036): No heartbeat from core client for 30 sec - exiting 15:36:02 (5036): No heartbeat from core client for 30 sec - exiting 15:36:03 (5036): No heartbeat from core client for 30 sec - exiting 15:36:04 (5036): No heartbeat from core client for 30 sec - exiting 15:36:05 (5036): No heartbeat from core client for 30 sec - exiting 15:36:06 (5036): No heartbeat from core client for 30 sec - exiting 15:36:07 (5036): No heartbeat from core client for 30 sec - exiting 15:36:08 (5036): No heartbeat from core client for 30 sec - exiting 15:36:09 (5036): No heartbeat from core client for 30 sec - exiting 15:36:10 (5036): No heartbeat from core client for 30 sec - exiting 15:36:11 (5036): No heartbeat from core client for 30 sec - exiting 15:36:12 (5036): No heartbeat from core client for 30 sec - exiting 15:36:13 (5036): No heartbeat from core client for 30 sec - exiting 15:36:14 (5036): No heartbeat from core client for 30 sec - exiting 15:36:15 (5036): No heartbeat from core client for 30 sec - exiting 15:36:16 (5036): No heartbeat from core client for 30 sec - exiting 15:36:17 (5036): No heartbeat from core client for 30 sec - exiting 15:36:18 (5036): No heartbeat from core client for 30 sec - exiting 15:36:19 (5036): No heartbeat from core client for 30 sec - exiting 15:36:20 (5036): No heartbeat from core client for 30 sec - exiting 15:36:21 (5036): No heartbeat from core client for 30 sec - exiting 15:36:22 (5036): No heartbeat from core client for 30 sec - exiting 15:36:23 (5036): No heartbeat from core client for 30 sec - exiting 15:36:24 (5036): No heartbeat from core client for 30 sec - exiting 15:36:25 (5036): No heartbeat from core client for 30 sec - exiting 15:36:26 (5036): No heartbeat from core client for 30 sec - exiting 15:36:27 (5036): No heartbeat from core client for 30 sec - exiting 15:36:28 (5036): No heartbeat from core client for 30 sec - exiting 15:36:29 (5036): No heartbeat from core client for 30 sec - exiting 15:36:30 (5036): No heartbeat from core client for 30 sec - exiting 15:36:31 (5036): No heartbeat from core client for 30 sec - exiting 15:36:32 (5036): No heartbeat from core client for 30 sec - exiting 15:36:33 (5036): No heartbeat from core client for 30 sec - exiting 15:36:34 (5036): No heartbeat from core client for 30 sec - exiting 15:36:35 (5036): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:39:06 (1348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5856, selfPID=5856, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2824, selfPID=2824, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5200, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2540, iMonCtr=2 Model crash detected, will try to restart... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
30 Aug 2011 16:12:45 | 1160433 | 13285270 | hadam3p_eu_2iny_1996_1_007398979_2 | 115,296 | 352,530 | 3.0576 |
30 Aug 2011 06:05:55 | 1160433 | 13285270 | hadam3p_eu_2iny_1996_1_007398979_2 | 103,776 | 317,589 | 3.0603 |
29 Aug 2011 18:51:37 | 1160433 | 13285270 | hadam3p_eu_2iny_1996_1_007398979_2 | 92,256 | 282,416 | 3.0612 |
29 Aug 2011 08:00:47 | 1160433 | 13285270 | hadam3p_eu_2iny_1996_1_007398979_2 | 80,736 | 247,189 | 3.0617 |
26 Aug 2011 16:56:06 | 1160433 | 13285270 | hadam3p_eu_2iny_1996_1_007398979_2 | 69,216 | 212,250 | 3.0665 |
26 Aug 2011 06:15:59 | 1160433 | 13285270 | hadam3p_eu_2iny_1996_1_007398979_2 | 57,696 | 177,208 | 3.0714 |
25 Aug 2011 19:19:16 | 1160433 | 13285270 | hadam3p_eu_2iny_1996_1_007398979_2 | 46,176 | 141,670 | 3.0680 |
25 Aug 2011 07:50:02 | 1160433 | 13285270 | hadam3p_eu_2iny_1996_1_007398979_2 | 34,656 | 105,879 | 3.0551 |
24 Aug 2011 20:22:31 | 1160433 | 13285270 | hadam3p_eu_2iny_1996_1_007398979_2 | 23,136 | 70,755 | 3.0582 |
24 Aug 2011 09:33:26 | 1160433 | 13285270 | hadam3p_eu_2iny_1996_1_007398979_2 | 11,616 | 35,682 | 3.0718 |
©2024 cpdn.org