Name | hadcm3n_o6wd_2020_40_008374226_0 |
Workunit | 8525085 |
Created | 29 May 2013, 21:03:04 UTC |
Sent | 29 May 2013, 21:11:16 UTC |
Report deadline | 29 Aug 2013, 4:38:27 UTC |
Received | 25 Aug 2013, 18:48:17 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 255 (0x000000FF) Unknown error code |
Computer ID | 1169010 |
Run time | 14 days 23 hours 11 min 48 sec |
CPU time | 8 days 22 hours 30 min 1 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 2.46 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> Les attributs étendus (EA) sont incohérents. (0xff) - exit code 255 (0xff) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4876, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5088, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5620, iMonCtr=1 Model crash detected, will try to restart... 12:56:01 (4696): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:21:27 (5304): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:21:28 (5304): No heartbeat from core client for 30 sec - exiting 13:21:29 (5304): No heartbeat from core client for 30 sec - exiting 13:21:30 (5304): No heartbeat from core client for 30 sec - exiting 13:21:31 (5304): No heartbeat from core client for 30 sec - exiting 13:21:32 (5304): No heartbeat from core client for 30 sec - exiting 13:21:33 (5304): No heartbeat from core client for 30 sec - exiting 13:21:34 (5304): No heartbeat from core client for 30 sec - exiting 13:21:35 (5304): No heartbeat from core client for 30 sec - exiting 13:21:36 (5304): No heartbeat from core client for 30 sec - exiting CSuspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4852, iMonCtr=1 Model crash detected, will try to restart... 19:04:19 (2892): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:04:20 (2892): No heartbeat from core client for 30 sec - exiting 19:04:21 (2892): No heartbeat from core client for 30 sec - exiting 19:04:22 (2892): No heartbeat from core client for 30 sec - exiting 19:04:23 (2892): No heartbeat from core client for 30 sec - exiting 19:04:24 (2892): No heartbeat from core client for 30 sec - exiting 19:04:25 (2892): No heartbeat from core client for 30 sec - exiting 19:04:27 (2892): No heartbeat from core client for 30 sec - exiting 19:04:28 (2892): No heartbeat from core client for 30 sec - exiting 19:04:29 (2892): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4708, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5964, iMonCtr=1 Model crash detected, will try to restart... C19:38:17 (4780): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7940, iMonCtr=1 Model crash detected, will try to restart... 18:26:50 (5124): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:21:29 (5024): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:40:40 (5644): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:37:32 (4728): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:37:38 (4728): No heartbeat from core client for 30 sec - exiting 22:37:39 (4728): No heartbeat from core client for 30 sec - exiting 22:37:40 (4728): No heartbeat from core client for 30 sec - exiting 22:37:41 (4728): No heartbeat from core client for 30 sec - exiting 23:04:15 (5132): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:05:43 (2428): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 19:17:44 (4948): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:56:54 (4812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:02:55 (6068): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:02:56 (6068): No heartbeat from core client for 30 sec - exiting 19:02:57 (6068): No heartbeat from core client for 30 sec - exiting 19:02:59 (6068): No heartbeat from core client for 30 sec - exiting 19:03:00 (6068): No heartbeat from core client for 30 sec - exiting 19:03:01 (6068): No heartbeat from core client for 30 sec - exiting 19:03:02 (6068): No heartbeat from core client for 30 sec - exiting 19:03:03 (6068): No heartbeat from core client for 30 sec - exiting 19:03:04 (6068): No heartbeat from core client for 30 sec - exiting 19:03:05 (6068): No heartbeat from core client for 30 sec - exiting 19:03:06 (6068): No heartbeat from core client for 30 sec - exiting C20:26:29 (1676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:56:10 (5088): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5104, iMonCtr=1 Model crash detected, will try to restart... 19:28:50 (5644): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5752, iMonCtr=1 Model crash detected, will try to restart... 18:32:43 (5944): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 19:33:51 (2592): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6020, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5180, iMonCtr=1 Model crash detected, will try to restart... 20:42:14 (2884): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:42:15 (2884): No heartbeat from core client for 30 sec - exiting 20:42:16 (2884): No heartbeat from core client for 30 sec - exiting 20:42:17 (2884): No heartbeat from core client for 30 sec - exiting 20:42:18 (2884): No heartbeat from core client for 30 sec - exiting 20:42:19 (2884): No heartbeat from core client for 30 sec - exiting 20:42:20 (2884): No heartbeat from core client for 30 sec - exiting 20:42:21 (2884): No heartbeat from core client for 30 sec - exiting 20:42:22 (2884): No heartbeat from core client for 30 sec - exiting 21:04:20 (2316): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:12:20 (2940): No heartbeat from core client for 30 sec - exiting 22:12:47 (2940): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:24:03 (5712): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77396E5F read attempt to address 0x00000000 Engaging BOINC Windows Runtime Debugger... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
25 Aug 2013 18:50:12 | 1169010 | 15802789 | hadcm3n_o6wd_2020_40_008374226_0 | 259,200 | 763,559 | 2.9458 |
23 Jul 2013 17:50:02 | 1169010 | 15802789 | hadcm3n_o6wd_2020_40_008374226_0 | 233,280 | 622,855 | 2.6700 |
11 Jul 2013 18:00:37 | 1169010 | 15802789 | hadcm3n_o6wd_2020_40_008374226_0 | 207,360 | 539,352 | 2.6010 |
06 Jul 2013 10:29:38 | 1169010 | 15802789 | hadcm3n_o6wd_2020_40_008374226_0 | 181,440 | 466,254 | 2.5697 |
02 Jul 2013 09:46:27 | 1169010 | 15802789 | hadcm3n_o6wd_2020_40_008374226_0 | 155,520 | 399,835 | 2.5710 |
19 Jun 2013 21:47:15 | 1169010 | 15802789 | hadcm3n_o6wd_2020_40_008374226_0 | 129,600 | 332,630 | 2.5666 |
13 Jun 2013 20:19:18 | 1169010 | 15802789 | hadcm3n_o6wd_2020_40_008374226_0 | 103,680 | 266,530 | 2.5707 |
08 Jun 2013 20:54:11 | 1169010 | 15802789 | hadcm3n_o6wd_2020_40_008374226_0 | 77,760 | 195,717 | 2.5169 |
04 Jun 2013 20:18:24 | 1169010 | 15802789 | hadcm3n_o6wd_2020_40_008374226_0 | 51,840 | 132,263 | 2.5514 |
02 Jun 2013 00:00:27 | 1169010 | 15802789 | hadcm3n_o6wd_2020_40_008374226_0 | 25,920 | 67,270 | 2.5953 |
©2024 cpdn.org