Name | hadcm3n_o31d_1940_40_007301659_1 |
Workunit | 7499083 |
Created | 22 Jun 2011, 14:23:49 UTC |
Sent | 22 Jun 2011, 14:23:53 UTC |
Report deadline | 21 Sep 2011, 21:51:04 UTC |
Received | 5 Aug 2011, 12:18:51 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION |
Computer ID | 1140451 |
Run time | 8 days 20 hours 45 min 52 sec |
CPU time | 8 days 13 hours 56 min 39 sec |
Validate state | Invalid |
Credit | 12,441.60 |
Device peak FLOPS | 4.40 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> - exit code -1073741819 (0xc0000005) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3532, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:59:02 (4608): No heartbeat from core client for 30 sec - exiting 20:59:03 (4608): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 02:14:31 (4304): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 16:49:56 (4148): No heartbeat from core client for 30 sec - exiting 16:49:57 (4148): No heartbeat from core client for 30 sec - exiting 16:49:58 (4148): No heartbeat from core client for 30 sec - exiting 16:49:59 (4148): No heartbeat from core client for 30 sec - exiting 16:50:00 (4148): No heartbeat from core client for 30 sec - exiting 16:50:01 (4148): No heartbeat from core client for 30 sec - exiting 16:50:02 (4148): No heartbeat from core client for 30 sec - exiting 16:50:03 (4148): No heartbeat from core client for 30 sec - exiting 16:50:04 (4148): No heartbeat from core client for 30 sec - exiting 16:50:05 (4148): No heartbeat from core client for 30 sec - exiting 16:50:06 (4148): No heartbeat from core client for 30 sec - exiting 16:50:07 (4148): No heartbeat from core client for 30 sec - exiting 16:50:08 (4148): No heartbeat from core client for 30 sec - exiting 16:50:09 (4148): No heartbeat from core client for 30 sec - exiting 16:50:10 (4148): No heartbeat from core client for 30 sec - exiting 16:50:11 (4148): No heartbeat from core client for 30 sec - exiting 16:50:12 (4148): No heartbeat from core client for 30 sec - exiting 16:50:13 (4148): No heartbeat from core client for 30 sec - exiting 16:50:14 (4148): No heartbeat from core client for 30 sec - exiting 16:50:15 (4148): No heartbeat from core client for 30 sec - exiting 16:50:16 (4148): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4644, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/o31dko.pje5c10 Error converting file to netcdf: dataout/o31dko.pie5c10 Error converting file to netcdf: dataout/o31dko.pfe5c10 Error converting file to netcdf: dataout/o31dka.phe5c10 Error converting file to netcdf: dataout/o31dka.pge5c10 Error converting file to netcdf: dataout/o31dka.pee5c10 Error converting file to netcdf: dataout/o31dka.pde5c10 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4328, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2504, selfPID=2504, iMonCtr=1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 02:03:01 (4832): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1192, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4268, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1632, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x777ECD8B write attempt to address 0x4340CE54 Engaging BOINC Windows Runtime Debugger... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4788, selfPID=4788, iMonCtr=1 </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
05 Aug 2011 12:23:36 | 1140451 | 12995351 | hadcm3n_o31d_1940_40_007301659_1 | 1,036,800 | 741,391 | 0.7151 |
04 Aug 2011 20:32:47 | 1140451 | 12995351 | hadcm3n_o31d_1940_40_007301659_1 | 1,010,880 | 722,938 | 0.7152 |
03 Aug 2011 19:06:17 | 1140451 | 12995351 | hadcm3n_o31d_1940_40_007301659_1 | 984,960 | 704,674 | 0.7154 |
31 Jul 2011 00:30:53 | 1140451 | 12995351 | hadcm3n_o31d_1940_40_007301659_1 | 959,040 | 686,004 | 0.7153 |
30 Jul 2011 19:33:23 | 1140451 | 12995351 | hadcm3n_o31d_1940_40_007301659_1 | 933,120 | 666,611 | 0.7144 |
29 Jul 2011 22:55:58 | 1140451 | 12995351 | hadcm3n_o31d_1940_40_007301659_1 | 907,200 | 647,238 | 0.7134 |
28 Jul 2011 22:14:18 | 1140451 | 12995351 | hadcm3n_o31d_1940_40_007301659_1 | 881,280 | 628,742 | 0.7134 |
28 Jul 2011 10:00:41 | 1140451 | 12995351 | hadcm3n_o31d_1940_40_007301659_1 | 855,360 | 610,218 | 0.7134 |
27 Jul 2011 18:05:06 | 1140451 | 12995351 | hadcm3n_o31d_1940_40_007301659_1 | 829,440 | 591,854 | 0.7136 |
26 Jul 2011 20:01:57 | 1140451 | 12995351 | hadcm3n_o31d_1940_40_007301659_1 | 803,520 | 573,413 | 0.7136 |
26 Jul 2011 09:43:30 | 1140451 | 12995351 | hadcm3n_o31d_1940_40_007301659_1 | 777,600 | 554,948 | 0.7137 |
26 Jul 2011 09:43:30 | 1140451 | 12995351 | hadcm3n_o31d_1940_40_007301659_1 | 751,680 | 536,700 | 0.7140 |
26 Jul 2011 09:43:30 | 1140451 | 12995351 | hadcm3n_o31d_1940_40_007301659_1 | 725,760 | 518,172 | 0.7140 |
26 Jul 2011 09:43:30 | 1140451 | 12995351 | hadcm3n_o31d_1940_40_007301659_1 | 699,840 | 499,559 | 0.7138 |
26 Jul 2011 09:43:30 | 1140451 | 12995351 | hadcm3n_o31d_1940_40_007301659_1 | 673,920 | 480,955 | 0.7137 |
26 Jul 2011 09:43:30 | 1140451 | 12995351 | hadcm3n_o31d_1940_40_007301659_1 | 648,000 | 462,053 | 0.7130 |
25 Jul 2011 18:55:12 | 1140451 | 12995351 | hadcm3n_o31d_1940_40_007301659_1 | 622,080 | 443,370 | 0.7127 |
25 Jul 2011 18:48:35 | 1140451 | 12995351 | hadcm3n_o31d_1940_40_007301659_1 | 596,160 | 424,787 | 0.7125 |
25 Jul 2011 18:18:46 | 1140451 | 12995351 | hadcm3n_o31d_1940_40_007301659_1 | 570,240 | 406,286 | 0.7125 |
25 Jul 2011 17:20:31 | 1140451 | 12995351 | hadcm3n_o31d_1940_40_007301659_1 | 544,320 | 387,886 | 0.7126 |
©2024 climateprediction.net