Name | hadcm3n_3i99_1940_40_008264776_0 |
Workunit | 8419900 |
Created | 21 Dec 2012, 10:07:12 UTC |
Sent | 23 Dec 2012, 0:48:38 UTC |
Report deadline | 24 Mar 2013, 8:15:49 UTC |
Received | 21 Mar 2013, 15:56:44 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1186987 |
Run time | 10 days 4 hours 59 min 39 sec |
CPU time | 8 days 13 hours 24 min 25 sec |
Validate state | Invalid |
Credit | 6,220.80 |
Device peak FLOPS | 3.13 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4812, iMonCtr=1 Model crash detected, will try to restart... 13:37:15 (7348): No heartbeat from core client for 30 sec - exiting 13:37:16 (7348): No heartbeat from core client for 30 sec - exiting 13:37:17 (7348): No heartbeat from core client for 30 sec - exiting 13:37:18 (7348): No heartbeat from core client for 30 sec - exiting 13:37:19 (7348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:37:20 (7348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 08:12:57 (2112): No heartbeat from core client for 30 sec - exiting 08:12:59 (2112): No heartbeat from core client for 30 sec - exiting 08:13:00 (2112): No heartbeat from core client for 30 sec - exiting 08:13:01 (2112): No heartbeat from core client for 30 sec - exiting 08:13:02 (2112): No heartbeat from core client for 30 sec - exiting 08:13:03 (2112): No heartbeat from core client for 30 sec - exiting 08:13:04 (2112): No heartbeat from core client for 30 sec - exiting 08:13:06 (2112): No heartbeat from core client for 30 sec - exiting 08:13:07 (2112): No heartbeat from core client for 30 sec - exiting 08:13:08 (2112): No heartbeat from core client for 30 sec - exiting 08:13:09 (2112): No heartbeat from core client for 30 sec - exiting 08:13:10 (2112): No heartbeat from core client for 30 sec - exiting 08:13:11 (2112): No heartbeat from core client for 30 sec - exiting 08:13:12 (2112): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 05:48:26 (3868): No heartbeat from core client for 30 sec - exiting 05:48:27 (3868): No heartbeat from core client for 30 sec - exiting 05:48:28 (3868): No heartbeat from core client for 30 sec - exiting 05:48:29 (3868): No heartbeat from core client for 30 sec - exiting 05:48:30 (3868): No heartbeat from core client for 30 sec - exiting 05:48:31 (3868): No heartbeat from core client for 30 sec - exiting 05:48:32 (3868): No heartbeat from core client for 30 sec - exiting 05:48:34 (3868): No heartbeat from core client for 30 sec - exiting 05:48:35 (3868): No heartbeat from core client for 30 sec - exiting 05:48:36 (3868): No heartbeat from core client for 30 sec - exiting 05:48:37 (3868): No heartbeat from core client for 30 sec - exiting 05:48:38 (3868): No heartbeat from core client for 30 sec - exiting 05:48:39 (3868): No heartbeat from core client for 30 sec - exiting 05:48:40 (3868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:48:41 (3868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 23:57:01 (6740): No heartbeat from core client for 30 sec - exiting 23:57:02 (6740): No heartbeat from core client for 30 sec - exiting 23:57:03 (6740): No heartbeat from core client for 30 sec - exiting 23:57:04 (6740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 00:34:09 (1592): No heartbeat from core client for 30 sec - exiting 00:34:10 (1592): No heartbeat from core client for 30 sec - exiting 00:34:11 (1592): No heartbeat from core client for 30 sec - exiting 00:34:12 (1592): No heartbeat from core client for 30 sec - exiting 00:34:14 (1592): No heartbeat from core client for 30 sec - exiting 00:34:15 (1592): No heartbeat from core client for 30 sec - exiting 00:34:16 (1592): No heartbeat from core client for 30 sec - exiting 00:34:17 (1592): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=40320, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7228, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1316, iMonCtr=1 Model crash detected, will try to restart... 02:13:54 (5376): No heartbeat from core client for 30 sec - exiting 02:13:55 (5376): No heartbeat from core client for 30 sec - exiting 02:13:56 (5376): No heartbeat from core client for 30 sec - exiting 02:13:58 (5376): No heartbeat from core client for 30 sec - exiting 02:13:59 (5376): No heartbeat from core client for 30 sec - exiting 02:14:00 (5376): No heartbeat from core client for 30 sec - exiting 02:14:01 (5376): No heartbeat from core client for 30 sec - exiting 02:14:02 (5376): No heartbeat from core client for 30 sec - exiting 02:14:03 (5376): No heartbeat from core client for 30 sec - exiting 02:14:04 (5376): No heartbeat from core client for 30 sec - exiting 02:14:05 (5376): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 01:47:39 (7532): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CCPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8120, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7368, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CCPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9400, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 06:08:14 (24100): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4940, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7996, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3560, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77065EAB read attempt to address 0x40F94246 Engaging BOINC Windows Runtime Debugger... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14848, selfPID=14848, iMonCtr=1 Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77065EAB read attempt to address 0x40F94246 Engaging BOINC Windows Runtime Debugger... Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_3i99_1940_40_008264776/dataout/shmem_restart.day Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
20 Mar 2013 03:00:42 | 1186987 | 15493809 | hadcm3n_3i99_1940_40_008264776_0 | 518,400 | 735,561 | 1.4189 |
17 Mar 2013 06:08:26 | 1186987 | 15493809 | hadcm3n_3i99_1940_40_008264776_0 | 492,480 | 701,831 | 1.4251 |
13 Mar 2013 16:00:38 | 1186987 | 15493809 | hadcm3n_3i99_1940_40_008264776_0 | 466,560 | 668,099 | 1.4320 |
04 Mar 2013 15:45:42 | 1186987 | 15493809 | hadcm3n_3i99_1940_40_008264776_0 | 440,640 | 632,755 | 1.4360 |
03 Mar 2013 05:05:03 | 1186987 | 15493809 | hadcm3n_3i99_1940_40_008264776_0 | 414,720 | 597,520 | 1.4408 |
27 Feb 2013 16:30:52 | 1186987 | 15493809 | hadcm3n_3i99_1940_40_008264776_0 | 388,800 | 562,625 | 1.4471 |
24 Feb 2013 07:04:42 | 1186987 | 15493809 | hadcm3n_3i99_1940_40_008264776_0 | 362,880 | 527,195 | 1.4528 |
17 Feb 2013 05:01:13 | 1186987 | 15493809 | hadcm3n_3i99_1940_40_008264776_0 | 336,960 | 491,771 | 1.4594 |
12 Feb 2013 15:01:05 | 1186987 | 15493809 | hadcm3n_3i99_1940_40_008264776_0 | 311,040 | 456,483 | 1.4676 |
07 Feb 2013 16:41:56 | 1186987 | 15493809 | hadcm3n_3i99_1940_40_008264776_0 | 285,120 | 421,961 | 1.4799 |
03 Feb 2013 10:13:08 | 1186987 | 15493809 | hadcm3n_3i99_1940_40_008264776_0 | 259,200 | 387,245 | 1.4940 |
31 Jan 2013 18:26:37 | 1186987 | 15493809 | hadcm3n_3i99_1940_40_008264776_0 | 233,280 | 351,037 | 1.5048 |
28 Jan 2013 16:55:50 | 1186987 | 15493809 | hadcm3n_3i99_1940_40_008264776_0 | 207,360 | 314,228 | 1.5154 |
26 Jan 2013 05:54:27 | 1186987 | 15493809 | hadcm3n_3i99_1940_40_008264776_0 | 181,440 | 276,471 | 1.5238 |
24 Jan 2013 17:31:14 | 1186987 | 15493809 | hadcm3n_3i99_1940_40_008264776_0 | 155,520 | 238,903 | 1.5362 |
20 Jan 2013 18:28:18 | 1186987 | 15493809 | hadcm3n_3i99_1940_40_008264776_0 | 129,600 | 198,849 | 1.5343 |
20 Jan 2013 06:04:10 | 1186987 | 15493809 | hadcm3n_3i99_1940_40_008264776_0 | 103,680 | 160,922 | 1.5521 |
14 Jan 2013 02:47:15 | 1186987 | 15493809 | hadcm3n_3i99_1940_40_008264776_0 | 77,760 | 123,011 | 1.5819 |
12 Jan 2013 07:44:05 | 1186987 | 15493809 | hadcm3n_3i99_1940_40_008264776_0 | 51,840 | 79,702 | 1.5375 |
24 Dec 2012 08:37:49 | 1186987 | 15493809 | hadcm3n_3i99_1940_40_008264776_0 | 25,920 | 40,465 | 1.5611 |
©2024 cpdn.org