Name | hadcm3n_z97r_1920_40_008244800_3 |
Workunit | 8399924 |
Created | 28 Mar 2013, 11:22:36 UTC |
Sent | 28 Mar 2013, 11:22:39 UTC |
Report deadline | 27 Jun 2013, 18:49:50 UTC |
Received | 9 Jun 2013, 9:36:07 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1229098 |
Run time | 43 days 16 hours 10 min 22 sec |
CPU time | 41 days 23 hours 19 min 15 sec |
Validate state | Invalid |
Credit | 12,441.60 |
Device peak FLOPS | 1.53 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> (unknown error) - exit code 193 (0xc1) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 09:37:32 (4544): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3752, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4204, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4532, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 14:00:31 (6076): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1300, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1300, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1300, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 09:50:19 (5496): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 04:06:04 (4196): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:11:25 (6560): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 04:25:42 (2496): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 04:36:49 (3040): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 04:51:04 (6208): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 05:10:48 (5936): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 05:42:54 (7092): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 05:44:21 (4680): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 05:45:26 (4080): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 06:08:24 (3044): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 06:15:57 (6924): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 06:29:55 (392): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 06:43:08 (3204): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:36:33 (5296): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 10:36:35 (5296): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1720, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5424, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x7715FF2B write attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x76FB7373 read attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_z97r_1920_40_008244800/dataout/shmem_restart.day Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
08 Jun 2013 06:47:48 | 1229098 | 15688590 | hadcm3n_z97r_1920_40_008244800_3 | 1,036,800 | 3,623,329 | 3.4947 |
05 Jun 2013 16:49:17 | 1229098 | 15688590 | hadcm3n_z97r_1920_40_008244800_3 | 1,010,880 | 3,504,081 | 3.4664 |
03 Jun 2013 07:07:16 | 1229098 | 15688590 | hadcm3n_z97r_1920_40_008244800_3 | 984,960 | 3,403,008 | 3.4550 |
28 May 2013 08:53:47 | 1229098 | 15688590 | hadcm3n_z97r_1920_40_008244800_3 | 959,040 | 3,340,950 | 3.4836 |
26 May 2013 08:35:00 | 1229098 | 15688590 | hadcm3n_z97r_1920_40_008244800_3 | 933,120 | 3,249,606 | 3.4825 |
23 May 2013 11:18:56 | 1229098 | 15688590 | hadcm3n_z97r_1920_40_008244800_3 | 907,200 | 3,068,654 | 3.3826 |
21 May 2013 16:16:42 | 1229098 | 15688590 | hadcm3n_z97r_1920_40_008244800_3 | 881,280 | 2,999,478 | 3.4035 |
20 May 2013 18:02:31 | 1229098 | 15688590 | hadcm3n_z97r_1920_40_008244800_3 | 855,360 | 2,864,568 | 3.3490 |
17 May 2013 07:09:42 | 1229098 | 15688590 | hadcm3n_z97r_1920_40_008244800_3 | 829,440 | 2,713,014 | 3.2709 |
16 May 2013 07:57:01 | 1229098 | 15688590 | hadcm3n_z97r_1920_40_008244800_3 | 803,520 | 2,652,145 | 3.3007 |
15 May 2013 09:40:50 | 1229098 | 15688590 | hadcm3n_z97r_1920_40_008244800_3 | 777,600 | 2,581,519 | 3.3199 |
14 May 2013 06:27:53 | 1229098 | 15688590 | hadcm3n_z97r_1920_40_008244800_3 | 751,680 | 2,500,570 | 3.3266 |
13 May 2013 05:57:05 | 1229098 | 15688590 | hadcm3n_z97r_1920_40_008244800_3 | 725,760 | 2,418,247 | 3.3320 |
12 May 2013 07:30:37 | 1229098 | 15688590 | hadcm3n_z97r_1920_40_008244800_3 | 699,840 | 2,337,220 | 3.3396 |
11 May 2013 07:07:34 | 1229098 | 15688590 | hadcm3n_z97r_1920_40_008244800_3 | 673,920 | 2,252,542 | 3.3424 |
10 May 2013 06:05:24 | 1229098 | 15688590 | hadcm3n_z97r_1920_40_008244800_3 | 648,000 | 2,175,918 | 3.3579 |
30 Apr 2013 06:04:34 | 1229098 | 15688590 | hadcm3n_z97r_1920_40_008244800_3 | 622,080 | 2,091,267 | 3.3617 |
29 Apr 2013 07:02:10 | 1229098 | 15688590 | hadcm3n_z97r_1920_40_008244800_3 | 596,160 | 2,010,829 | 3.3730 |
28 Apr 2013 16:16:13 | 1229098 | 15688590 | hadcm3n_z97r_1920_40_008244800_3 | 570,240 | 1,931,262 | 3.3868 |
27 Apr 2013 06:57:25 | 1229098 | 15688590 | hadcm3n_z97r_1920_40_008244800_3 | 544,320 | 1,850,231 | 3.3992 |
©2024 cpdn.org