Name | hadcm3n_3jx9_1940_40_008268207_1 |
Workunit | 8423331 |
Created | 23 Mar 2013, 8:28:37 UTC |
Sent | 23 Mar 2013, 8:28:53 UTC |
Report deadline | 22 Jun 2013, 15:56:04 UTC |
Received | 3 May 2013, 5:27:46 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1168062 |
Run time | 4 days 13 hours 16 min 36 sec |
CPU time | 4 days 8 hours 15 min 30 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 2.64 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> (unknown error) - exit code 193 (0xc1) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2664, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3004, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4736, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3004, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2984, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3004, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3004, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 10:48:38 (2924): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:00:53 (5048): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:02:28 (5964): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:02:44 (5964): No heartbeat from core client for 30 sec - exiting 11:02:45 (5964): No heartbeat from core client for 30 sec - exiting 11:02:46 (5964): No heartbeat from core client for 30 sec - exiting 11:02:47 (5964): No heartbeat from core client for 30 sec - exiting 11:02:48 (5964): No heartbeat from core client for 30 sec - exiting 11:03:57 (2948): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:04:49 (5004): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:05:38 (5168): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:07:20 (5156): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:09:29 (5432): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:11:57 (4464): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:14:23 (5548): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:16:23 (5892): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:16:27 (5892): No heartbeat from core client for 30 sec - exiting 11:16:28 (5892): No heartbeat from core client for 30 sec - exiting 11:16:29 (5892): No heartbeat from core client for 30 sec - exiting 11:16:30 (5892): No heartbeat from core client for 30 sec - exiting 11:16:31 (5892): No heartbeat from core client for 30 sec - exiting 11:16:32 (5892): No heartbeat from core client for 30 sec - exiting 11:16:33 (5892): No heartbeat from core client for 30 sec - exiting 11:16:34 (5892): No heartbeat from core client for 30 sec - exiting 11:16:35 (5892): No heartbeat from core client for 30 sec - exiting 11:16:36 (5892): No heartbeat from core client for 30 sec - exiting 11:21:05 (3608): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:24:43 (4000): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:31:21 (800): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:40:46 (1296): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:40:50 (1296): No heartbeat from core client for 30 sec - exiting 11:40:51 (1296): No heartbeat from core client for 30 sec - exiting 11:40:52 (1296): No heartbeat from core client for 30 sec - exiting 11:40:53 (1296): No heartbeat from core client for 30 sec - exiting 11:40:54 (1296): No heartbeat from core client for 30 sec - exiting 11:40:55 (1296): No heartbeat from core client for 30 sec - exiting 11:40:56 (1296): No heartbeat from core client for 30 sec - exiting 11:40:57 (1296): No heartbeat from core client for 30 sec - exiting 11:40:58 (1296): No heartbeat from core client for 30 sec - exiting 11:41:46 (4944): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:41:52 (4944): No heartbeat from core client for 30 sec - exiting 11:41:53 (4944): No heartbeat from core client for 30 sec - exiting 11:41:54 (4944): No heartbeat from core client for 30 sec - exiting 11:41:55 (4944): No heartbeat from core client for 30 sec - exiting 11:41:56 (4944): No heartbeat from core client for 30 sec - exiting 11:41:57 (4944): No heartbeat from core client for 30 sec - exiting 11:41:58 (4944): No heartbeat from core client for 30 sec - exiting 11:41:59 (4944): No heartbeat from core client for 30 sec - exiting 11:43:13 (3424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:44:56 (5392): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:07:13 (1236): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:07:20 (1236): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3020, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3040, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2972, iMonCtr=1 Model crash detected, will try to restart... 10:12:43 (2996): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:35:48 (4800): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:36:52 (1724): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:38:32 (3060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:39:26 (4776): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:42:43 (5768): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:44:40 (1076): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:45:26 (2304): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:45:29 (2304): No heartbeat from core client for 30 sec - exiting 10:45:30 (2304): No heartbeat from core client for 30 sec - exiting 10:46:24 (5896): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:47:06 (2040): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:49:43 (1132): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 10:57:14 (2040): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:57:15 (2040): No heartbeat from core client for 30 sec - exiting 10:57:16 (2040): No heartbeat from core client for 30 sec - exiting 10:57:17 (2040): No heartbeat from core client for 30 sec - exiting 10:57:18 (2040): No heartbeat from core client for 30 sec - exiting 10:57:19 (2040): No heartbeat from core client for 30 sec - exiting 10:57:20 (2040): No heartbeat from core client for 30 sec - exiting 10:57:21 (2040): No heartbeat from core client for 30 sec - exiting 10:57:22 (2040): No heartbeat from core client for 30 sec - exiting 10:57:23 (2040): No heartbeat from core client for 30 sec - exiting 10:57:24 (2040): No heartbeat from core client for 30 sec - exiting 11:16:52 (4720): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:16:56 (4720): No heartbeat from core client for 30 sec - exiting 11:16:57 (4720): No heartbeat from core client for 30 sec - exiting 11:16:58 (4720): No heartbeat from core client for 30 sec - exiting 11:16:59 (4720): No heartbeat from core client for 30 sec - exiting 11:17:00 (4720): No heartbeat from core client for 30 sec - exiting 11:17:01 (4720): No heartbeat from core client for 30 sec - exiting 11:17:02 (4720): No heartbeat from core client for 30 sec - exiting 11:17:03 (4720): No heartbeat from core client for 30 sec - exiting 11:17:04 (4720): No heartbeat from core client for 30 sec - exiting 11:17:05 (4720): No heartbeat from core client for 30 sec - exiting 11:52:15 (3368): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:52:16 (3368): No heartbeat from core client for 30 sec - exiting 11:52:17 (3368): No heartbeat from core client for 30 sec - exiting 11:52:52 (3676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:11:46 (1400): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:12:39 (5928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:12:40 (5928): No heartbeat from core client for 30 sec - exiting 12:12:41 (5928): No heartbeat from core client for 30 sec - exiting 12:12:42 (5928): No heartbeat from core client for 30 sec - exiting 12:12:43 (5928): No heartbeat from core client for 30 sec - exiting 12:12:44 (5928): No heartbeat from core client for 30 sec - exiting 12:20:33 (5156): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:20:34 (5156): No heartbeat from core client for 30 sec - exiting 12:20:35 (5156): No heartbeat from core client for 30 sec - exiting 12:20:36 (5156): No heartbeat from core client for 30 sec - exiting 12:20:37 (5156): No heartbeat from core client for 30 sec - exiting 12:20:38 (5156): No heartbeat from core client for 30 sec - exiting 12:20:39 (5156): No heartbeat from core client for 30 sec - exiting 12:20:40 (5156): No heartbeat from core client for 30 sec - exiting 12:20:41 (5156): No heartbeat from core client for 30 sec - exiting 12:20:42 (5156): No heartbeat from core client for 30 sec - exiting 12:20:43 (5156): No heartbeat from core client for 30 sec - exiting 12:27:19 (1700): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:27:26 (1700): No heartbeat from core client for 30 sec - exiting 12:27:27 (1700): No heartbeat from core client for 30 sec - exiting 12:27:28 (1700): No heartbeat from core client for 30 sec - exiting 12:29:46 (5548): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:08:49 (2256): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:08:55 (2256): No heartbeat from core client for 30 sec - exiting 14:08:56 (2256): No heartbeat from core client for 30 sec - exiting 14:08:57 (2256): No heartbeat from core client for 30 sec - exiting 14:08:58 (2256): No heartbeat from core client for 30 sec - exiting 14:10:29 (5852): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:55:29 (6024): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:55:31 (6024): No heartbeat from core client for 30 sec - exiting 14:55:32 (6024): No heartbeat from core client for 30 sec - exiting 14:55:33 (6024): No heartbeat from core client for 30 sec - exiting 14:55:34 (6024): No heartbeat from core client for 30 sec - exiting 14:55:35 (6024): No heartbeat from core client for 30 sec - exiting 14:55:36 (6024): No heartbeat from core client for 30 sec - exiting 14:55:39 (6024): No heartbeat from core client for 30 sec - exiting 14:55:40 (6024): No heartbeat from core client for 30 sec - exiting 14:56:47 (5348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:58:11 (2452): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2948, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3028, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3024, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4564, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3052, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3052, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1208, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3016, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3048, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3056, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3056, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3012, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3000, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3044, iMonCtr=1 Model crash detected, will try to restart... CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3008, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2212, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2992, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2992, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3052, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3052, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3016, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3076, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3064, iMonCtr=1 Model crash detected, will try to restart... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77343AB3 read attempt to address 0x40F3119A Engaging BOINC Windows Runtime Debugger... Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_3jx9_1940_40_008268207/dataout/shmem_restart.day Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
03 May 2013 05:32:43 | 1168062 | 15679817 | hadcm3n_3jx9_1940_40_008268207_1 | 259,200 | 375,328 | 1.4480 |
30 Apr 2013 10:05:55 | 1168062 | 15679817 | hadcm3n_3jx9_1940_40_008268207_1 | 233,280 | 336,959 | 1.4444 |
22 Apr 2013 17:06:31 | 1168062 | 15679817 | hadcm3n_3jx9_1940_40_008268207_1 | 207,360 | 299,500 | 1.4443 |
18 Apr 2013 18:04:08 | 1168062 | 15679817 | hadcm3n_3jx9_1940_40_008268207_1 | 181,440 | 261,517 | 1.4413 |
10 Apr 2013 17:31:25 | 1168062 | 15679817 | hadcm3n_3jx9_1940_40_008268207_1 | 155,520 | 224,241 | 1.4419 |
06 Apr 2013 15:55:53 | 1168062 | 15679817 | hadcm3n_3jx9_1940_40_008268207_1 | 129,600 | 186,818 | 1.4415 |
01 Apr 2013 17:26:42 | 1168062 | 15679817 | hadcm3n_3jx9_1940_40_008268207_1 | 103,680 | 150,059 | 1.4473 |
31 Mar 2013 11:45:03 | 1168062 | 15679817 | hadcm3n_3jx9_1940_40_008268207_1 | 77,760 | 111,944 | 1.4396 |
27 Mar 2013 16:56:05 | 1168062 | 15679817 | hadcm3n_3jx9_1940_40_008268207_1 | 51,840 | 73,195 | 1.4119 |
24 Mar 2013 11:37:56 | 1168062 | 15679817 | hadcm3n_3jx9_1940_40_008268207_1 | 25,920 | 37,144 | 1.4330 |
©2024 cpdn.org