Name | hadcm3n_t12w_1940_40_007546336_0 |
Workunit | 7743568 |
Created | 29 Nov 2011, 4:50:42 UTC |
Sent | 30 Nov 2011, 19:20:16 UTC |
Report deadline | 1 Mar 2012, 2:47:27 UTC |
Received | 26 Mar 2012, 0:40:48 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION |
Computer ID | 1177889 |
Run time | 6 days 16 hours 52 min 13 sec |
CPU time | 5 days 23 hours 26 min 2 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 2.30 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.12.33</core_client_version> <![CDATA[ <message> - exit code -1073741819 (0xc0000005) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3012, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4700, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4368, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4380, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4796, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 22:07:22 (2432): Can't acquire lockfile (32) - waiting 35s CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5716, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3432, iMonCtr=1 Model crash detected, will try to restart... 21:28:40 (4708): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:52:43 (5100): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:47:13 (3032): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:37:08 (4816): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2760, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2448, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:55:10 (5352): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:59:28 (1716): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5848, iMonCtr=1 Model crash detected, will try to restart... 19:39:56 (4652): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:30:43 (5920): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:09:08 (3780): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4396, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4776, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4412, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 21:06:37 (4744): No heartbeat from core client for 30 sec - exiting 21:06:39 (4744): No heartbeat from core client for 30 sec - exiting 21:06:40 (4744): No heartbeat from core client for 30 sec - exiting 21:06:41 (4744): No heartbeat from core client for 30 sec - exiting 21:06:42 (4744): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:06:43 (4744): No heartbeat from core client for 30 sec - exiting 22:51:26 (4528): No heartbeat from core client for 30 sec - exiting 22:51:28 (4528): No heartbeat from core client for 30 sec - exiting 22:51:29 (4528): No heartbeat from core client for 30 sec - exiting 22:51:30 (4528): No heartbeat from core client for 30 sec - exiting 22:51:31 (4528): No heartbeat from core client for 30 sec - exiting 22:51:32 (4528): No heartbeat from core client for 30 sec - exiting 22:51:33 (4528): No heartbeat from core client for 30 sec - exiting 22:51:34 (4528): No heartbeat from core client for 30 sec - exiting 22:51:35 (4528): No heartbeat from core client for 30 sec - exiting 22:51:36 (4528): No heartbeat from core client for 30 sec - exiting 22:51:38 (4528): No heartbeat from core client for 30 sec - exiting 22:51:39 (4528): No heartbeat from core client for 30 sec - exiting 22:51:40 (4528): No heartbeat from core client for 30 sec - exiting 22:51:41 (4528): No heartbeat from core client for 30 sec - exiting 22:51:42 (4528): No heartbeat from core client for 30 sec - exiting 22:51:43 (4528): No heartbeat from core client for 30 sec - exiting 22:51:44 (4528): No heartbeat from core client for 30 sec - exiting 22:51:45 (4528): No heartbeat from core client for 30 sec - exiting 22:51:46 (4528): No heartbeat from core client for 30 sec - exiting 22:51:47 (4528): No heartbeat from core client for 30 sec - exiting 22:51:49 (4528): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:39:29 (4868): No heartbeat from core client for 30 sec - exiting 17:39:30 (4868): No heartbeat from core client for 30 sec - exiting 17:39:31 (4868): No heartbeat from core client for 30 sec - exiting 17:39:32 (4868): No heartbeat from core client for 30 sec - exiting 17:39:33 (4868): No heartbeat from core client for 30 sec - exiting 17:39:34 (4868): No heartbeat from core client for 30 sec - exiting 17:39:35 (4868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1504, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4476, iMonCCPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3676, iMonCtr=1 Model crash detected, will try to restart... 09:52:28 (4328): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:24:42 (4804): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:34:02 (4572): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5116, iMonCtr=1 Model crash detected, will try to restart... 07:47:38 (772): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:26:28 (768): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4056, iMonCtr=1 Model crash detected, will try to restart... 21:01:07 (4276): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:00:21 (4596): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=244, iMonCtr=1 Model crash detected, will try to restart... 23:39:46 (4568): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:06:51 (5028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:14:50 (3576): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:43:57 (2512): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4236, iMonCtr=1 Model crash detected, will try to restart... 21:18:24 (3992): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/t12wko.pje8c10 Error converting file to netcdf: dataout/t12wko.pie8c10 Error converting file to netcdf: dataout/t12wko.pfe8c10 Error converting file to netcdf: dataout/t12wka.phe8c10 Error converting file to netcdf: dataout/t12wka.pge8c10 Error converting file to netcdf: dataout/t12wka.pee8c10 Error converting file to netcdf: dataout/t12wka.pde8c10 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4680, iMonCtr=1 Model crash detected, will try to restart... 09:10:40 (1788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:53:16 (4520): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4368, iMonCtr=1 Model crash detected, will try to restart... 15:57:36 (2072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:07:50 (3216): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4464, iMonCtr=1 Model crash detected, will try to restart... 16:43:45 (5388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:36:01 (4800): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:51:50 (3504): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 09:17:51 (4980): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:09:34 (4844): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5276, iMonCtr=1 Model crash detected, will try to restart... 21:51:28 (240): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5284, iMonCtr=1 Model crash detected, will try to restart... 08:32:02 (3964): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:17:19 (4908): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4264, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 15:49:44 (4908): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77093AB3 read attempt to address 0x3F40E3C5 Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x00417B59 read attempt to address 0xC2898004 Engaging BOINC Windows Runtime Debugger... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
25 Mar 2012 23:00:15 | 1177889 | 13669496 | hadcm3n_t12w_1940_40_007546336_0 | 259,200 | 512,160 | 1.9759 |
08 Mar 2012 04:45:52 | 1177889 | 13669496 | hadcm3n_t12w_1940_40_007546336_0 | 233,280 | 471,772 | 2.0223 |
28 Feb 2012 16:02:11 | 1177889 | 13669496 | hadcm3n_t12w_1940_40_007546336_0 | 207,360 | 425,802 | 2.0534 |
19 Feb 2012 20:05:39 | 1177889 | 13669496 | hadcm3n_t12w_1940_40_007546336_0 | 181,440 | 377,481 | 2.0805 |
13 Feb 2012 19:53:27 | 1177889 | 13669496 | hadcm3n_t12w_1940_40_007546336_0 | 155,520 | 324,831 | 2.0887 |
08 Feb 2012 18:14:18 | 1177889 | 13669496 | hadcm3n_t12w_1940_40_007546336_0 | 129,600 | 271,730 | 2.0967 |
03 Feb 2012 16:23:39 | 1177889 | 13669496 | hadcm3n_t12w_1940_40_007546336_0 | 103,680 | 217,968 | 2.1023 |
19 Jan 2012 06:41:07 | 1177889 | 13669496 | hadcm3n_t12w_1940_40_007546336_0 | 77,760 | 159,350 | 2.0493 |
05 Jan 2012 19:44:05 | 1177889 | 13669496 | hadcm3n_t12w_1940_40_007546336_0 | 51,840 | 107,981 | 2.0830 |
17 Dec 2011 06:28:22 | 1177889 | 13669496 | hadcm3n_t12w_1940_40_007546336_0 | 25,920 | 53,497 | 2.0639 |
©2024 cpdn.org