Name | hadcm3n_y8q8_1900_40_007344170_1 |
Workunit | 7541600 |
Created | 6 Jul 2011, 13:22:50 UTC |
Sent | 22 Jul 2011, 17:56:46 UTC |
Report deadline | 22 Oct 2011, 1:23:57 UTC |
Received | 25 Oct 2011, 16:38:18 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION |
Computer ID | 1097461 |
Run time | 9 days 12 hours 15 min 11 sec |
CPU time | 9 days 5 hours 54 min 56 sec |
Validate state | Invalid |
Credit | 6,220.80 |
Device peak FLOPS | 3.25 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> - exit code -1073741819 (0xc0000005) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2672, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6044, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 13:37:11 (3560): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:42:54 (3560): No heartbeat from core client for 30 sec - exiting 13:42:55 (3560): No heartbeat from core client for 30 sec - exiting 13:42:56 (3560): No heartbeat from core client for 30 sec - exiting 13:42:57 (3560): No heartbeat from core client for 30 sec - exiting 13:42:58 (3560): No heartbeat from core client for 30 sec - exiting 13:42:59 (3560): No heartbeat from core client for 30 sec - exiting 13:43:00 (3560): No heartbeat from core client for 30 sec - exiting 13:43:01 (3560): No heartbeat from core client for 30 sec - exiting 13:43:02 (3560): No heartbeat from core client for 30 sec - exiting 13:43:03 (3560): No heartbeat from core client for 30 sec - exiting 13:43:56 (4244): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:53:20 (1636): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:54:02 (1636): No heartbeat from core client for 30 sec - exiting 13:54:03 (1636): No heartbeat from core client for 30 sec - exiting 13:54:04 (1636): No heartbeat from core client for 30 sec - exiting 13:54:05 (1636): No heartbeat from core client for 30 sec - exiting 13:54:06 (1636): No heartbeat from core client for 30 sec - exiting 13:54:07 (1636): No heartbeat from core client for 30 sec - exiting 13:54:08 (1636): No heartbeat from core client for 30 sec - exiting 13:54:09 (1636): No heartbeat from core client for 30 sec - exiting 13:54:10 (1636): No heartbeat from core client for 30 sec - exiting 13:54:11 (1636): No heartbeat from core client for 30 sec - exiting 14:01:37 (3760): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:07:51 (1616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6076, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6076, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6076, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6076, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6076, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6076, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6076, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6076, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6076, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6076, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6076, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6076, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6076, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/y8q8ko.pja7c10 Error converting file to netcdf: dataout/y8q8ko.pia7c10 Error converting file to netcdf: dataout/y8q8ko.pfa7c10 Error converting file to netcdf: dataout/y8q8ka.pha7c10 Error converting file to netcdf: dataout/y8q8ka.pga7c10 Error converting file to netcdf: dataout/y8q8ka.pea7c10 Error converting file to netcdf: dataout/y8q8ka.pda7c10 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4352, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5228, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77E33072 read attempt to address 0x00000000 Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x771B3072 read attempt to address 0x00000000 Engaging BOINC Windows Runtime Debugger... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
31 Oct 2011 14:18:16 | 1097461 | 13092206 | hadcm3n_y8q8_1900_40_007344170_1 | 518,400 | 793,549 | 1.5308 |
18 Oct 2011 08:16:00 | 1097461 | 13092206 | hadcm3n_y8q8_1900_40_007344170_1 | 492,480 | 752,917 | 1.5288 |
16 Oct 2011 05:49:45 | 1097461 | 13092206 | hadcm3n_y8q8_1900_40_007344170_1 | 466,560 | 712,447 | 1.5270 |
14 Oct 2011 22:23:23 | 1097461 | 13092206 | hadcm3n_y8q8_1900_40_007344170_1 | 440,640 | 672,180 | 1.5255 |
08 Oct 2011 16:32:49 | 1097461 | 13092206 | hadcm3n_y8q8_1900_40_007344170_1 | 414,720 | 632,205 | 1.5244 |
03 Oct 2011 17:51:06 | 1097461 | 13092206 | hadcm3n_y8q8_1900_40_007344170_1 | 388,800 | 592,498 | 1.5239 |
01 Oct 2011 12:01:18 | 1097461 | 13092206 | hadcm3n_y8q8_1900_40_007344170_1 | 362,880 | 552,101 | 1.5214 |
24 Sep 2011 05:24:36 | 1097461 | 13092206 | hadcm3n_y8q8_1900_40_007344170_1 | 336,960 | 510,492 | 1.5150 |
17 Sep 2011 14:43:18 | 1097461 | 13092206 | hadcm3n_y8q8_1900_40_007344170_1 | 311,040 | 469,660 | 1.5100 |
13 Sep 2011 08:44:35 | 1097461 | 13092206 | hadcm3n_y8q8_1900_40_007344170_1 | 285,120 | 429,274 | 1.5056 |
11 Sep 2011 09:52:28 | 1097461 | 13092206 | hadcm3n_y8q8_1900_40_007344170_1 | 259,200 | 389,668 | 1.5033 |
07 Sep 2011 00:36:55 | 1097461 | 13092206 | hadcm3n_y8q8_1900_40_007344170_1 | 233,280 | 349,724 | 1.4992 |
04 Sep 2011 18:34:39 | 1097461 | 13092206 | hadcm3n_y8q8_1900_40_007344170_1 | 207,360 | 309,689 | 1.4935 |
26 Aug 2011 05:50:06 | 1097461 | 13092206 | hadcm3n_y8q8_1900_40_007344170_1 | 181,440 | 270,611 | 1.4915 |
23 Aug 2011 11:19:27 | 1097461 | 13092206 | hadcm3n_y8q8_1900_40_007344170_1 | 155,520 | 230,663 | 1.4832 |
19 Aug 2011 10:21:26 | 1097461 | 13092206 | hadcm3n_y8q8_1900_40_007344170_1 | 129,600 | 192,397 | 1.4845 |
13 Aug 2011 10:25:34 | 1097461 | 13092206 | hadcm3n_y8q8_1900_40_007344170_1 | 103,680 | 154,216 | 1.4874 |
09 Aug 2011 09:18:16 | 1097461 | 13092206 | hadcm3n_y8q8_1900_40_007344170_1 | 77,760 | 116,276 | 1.4953 |
30 Jul 2011 12:43:09 | 1097461 | 13092206 | hadcm3n_y8q8_1900_40_007344170_1 | 51,840 | 76,860 | 1.4826 |
29 Jul 2011 10:05:38 | 1097461 | 13092206 | hadcm3n_y8q8_1900_40_007344170_1 | 25,920 | 38,549 | 1.4872 |
©2024 cpdn.org