Name | hadcm3n_yaf1_1900_40_007520849_1 |
Workunit | 7718324 |
Created | 28 Oct 2011, 13:09:57 UTC |
Sent | 3 Nov 2011, 8:20:22 UTC |
Report deadline | 2 Feb 2012, 15:47:33 UTC |
Received | 29 Jan 2012, 2:22:46 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION |
Computer ID | 1033258 |
Run time | 12 days 10 hours 23 min 15 sec |
CPU time | 12 days 10 hours 23 min 15 sec |
Validate state | Invalid |
Credit | 6,220.80 |
Device peak FLOPS | 1.92 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.4.7</core_client_version> <![CDATA[ <message> - exit code -1073741819 (0xc0000005) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2220, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/yaf1ko.pja3c10 Error converting file to netcdf: dataout/yaf1ko.pia3c10 Error converting file to netcdf: dataout/yaf1ko.pfa3c10 Error converting file to netcdf: dataout/yaf1ka.pha3c10 Error converting file to netcdf: dataout/yaf1ka.pga3c10 Error converting file to netcdf: dataout/yaf1ka.pea3c10 Error converting file to netcdf: dataout/yaf1ka.pda3c10 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=876, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2864, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2184, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2812, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2724, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2792, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2636, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3188, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1980, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3000, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2896, iMonCtr=1 Model crash detected, will try to restart... 18:39:43 (2876): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2768, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3260, iMonCtr=1 Model crash detected, will try to restart... CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2672, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2652, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2712, iMonCtr=1 Model crash detected, will try to restart... CCPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3036, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2756, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2904, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3020, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2980, iMonCtr=1 Model crash detected, will try to restart... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x00417B59 read attempt to address 0x46412058 Engaging BOINC Windows Runtime Debugger... Signal 11 received, exiting... Called boinc_finish ERROR: Invalid parameter detected in function (null). File: (null) Line: 0 ERROR: Expression: (null) </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
28 Jan 2012 07:26:34 | 1033258 | 13547386 | hadcm3n_yaf1_1900_40_007520849_1 | 518,400 | 1,058,499 | 2.0419 |
23 Jan 2012 11:45:56 | 1033258 | 13547386 | hadcm3n_yaf1_1900_40_007520849_1 | 492,480 | 1,005,246 | 2.0412 |
21 Jan 2012 11:21:18 | 1033258 | 13547386 | hadcm3n_yaf1_1900_40_007520849_1 | 466,560 | 949,476 | 2.0351 |
14 Jan 2012 06:01:37 | 1033258 | 13547386 | hadcm3n_yaf1_1900_40_007520849_1 | 440,640 | 897,470 | 2.0367 |
11 Jan 2012 03:57:51 | 1033258 | 13547386 | hadcm3n_yaf1_1900_40_007520849_1 | 414,720 | 841,586 | 2.0293 |
10 Jan 2012 01:46:46 | 1033258 | 13547386 | hadcm3n_yaf1_1900_40_007520849_1 | 388,800 | 789,667 | 2.0310 |
22 Dec 2011 02:06:26 | 1033258 | 13547386 | hadcm3n_yaf1_1900_40_007520849_1 | 362,880 | 735,993 | 2.0282 |
10 Dec 2011 01:14:15 | 1033258 | 13547386 | hadcm3n_yaf1_1900_40_007520849_1 | 336,960 | 682,833 | 2.0265 |
08 Dec 2011 03:21:28 | 1033258 | 13547386 | hadcm3n_yaf1_1900_40_007520849_1 | 311,040 | 630,214 | 2.0262 |
06 Dec 2011 23:52:41 | 1033258 | 13547386 | hadcm3n_yaf1_1900_40_007520849_1 | 285,120 | 577,317 | 2.0248 |
03 Dec 2011 02:19:46 | 1033258 | 13547386 | hadcm3n_yaf1_1900_40_007520849_1 | 259,200 | 525,944 | 2.0291 |
30 Nov 2011 23:32:25 | 1033258 | 13547386 | hadcm3n_yaf1_1900_40_007520849_1 | 233,280 | 472,962 | 2.0274 |
27 Nov 2011 22:32:17 | 1033258 | 13547386 | hadcm3n_yaf1_1900_40_007520849_1 | 207,360 | 420,672 | 2.0287 |
22 Nov 2011 06:18:09 | 1033258 | 13547386 | hadcm3n_yaf1_1900_40_007520849_1 | 181,440 | 366,185 | 2.0182 |
16 Nov 2011 01:11:32 | 1033258 | 13547386 | hadcm3n_yaf1_1900_40_007520849_1 | 155,520 | 312,525 | 2.0095 |
15 Nov 2011 21:49:26 | 1033258 | 13547386 | hadcm3n_yaf1_1900_40_007520849_1 | 129,600 | 260,239 | 2.0080 |
15 Nov 2011 21:49:26 | 1033258 | 13547386 | hadcm3n_yaf1_1900_40_007520849_1 | 103,680 | 208,581 | 2.0118 |
09 Nov 2011 02:50:09 | 1033258 | 13547386 | hadcm3n_yaf1_1900_40_007520849_1 | 77,760 | 156,980 | 2.0188 |
07 Nov 2011 10:14:49 | 1033258 | 13547386 | hadcm3n_yaf1_1900_40_007520849_1 | 51,840 | 105,767 | 2.0403 |
05 Nov 2011 00:49:14 | 1033258 | 13547386 | hadcm3n_yaf1_1900_40_007520849_1 | 25,920 | 54,507 | 2.1029 |
©2024 cpdn.org