Name | hadcm3n_o0tr_1900_40_007196402_2 |
Workunit | 7394682 |
Created | 4 Jul 2011, 11:12:16 UTC |
Sent | 4 Jul 2011, 11:37:47 UTC |
Report deadline | 3 Oct 2011, 19:04:58 UTC |
Received | 27 Aug 2011, 21:10:44 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION |
Computer ID | 1145601 |
Run time | 22 days 13 hours 10 min 45 sec |
CPU time | 22 days 13 hours 10 min 45 sec |
Validate state | Invalid |
Credit | 9,331.20 |
Device peak FLOPS | 2.24 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> - exit code -1073741819 (0xc0000005) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 22:01:53 (1948): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:02:31 (5364): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:06:47 (5716): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:40:43 (4616): No heartbeat from core client for 30 sec - exiting 03:40:44 (4616): No heartbeat from core client for 30 sec - exiting 03:40:45 (4616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5116, iMonCtr=1 Model crash detected, will try to restart... 12:02:02 (5492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:06:02 (296): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/o0trko.pjb4c10 Error converting file to netcdf: dataout/o0trko.pib4c10 Error converting file to netcdf: dataout/o0trko.pfb4c10 Error converting file to netcdf: dataout/o0trka.phb4c10 Error converting file to netcdf: dataout/o0trka.pgb4c10 Error converting file to netcdf: dataout/o0trka.peb4c10 Error converting file to netcdf: dataout/o0trka.pdb4c10 Suspended CPDN Monitor - Suspend request from BOINC... 09:45:59 (5568): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:24:43 (5212): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:25:21 (1608): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 19:35:25 (4728): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 15:38:41 (1368): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:04:57 (5460): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:29:31 (6860): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:00:45 (2348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:42:15 (7084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:57:16 (5812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:15:05 (7620): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:41:49 (6860): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:29:46 (6844): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:25:57 (1116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:51:06 (6920): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:02:30 (6200): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:36:48 (3848): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:29:38 (2028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:16:42 (3108): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77AA6E0F read attempt to address 0x434800FD Engaging BOINC Windows Runtime Debugger... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3996, selfPID=3996, iMonCtr=1 </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
27 Aug 2011 21:10:32 | 1145601 | 13070442 | hadcm3n_o0tr_1900_40_007196402_2 | 777,600 | 1,936,382 | 2.4902 |
27 Aug 2011 21:10:32 | 1145601 | 13070442 | hadcm3n_o0tr_1900_40_007196402_2 | 751,680 | 1,883,122 | 2.5052 |
27 Aug 2011 21:10:32 | 1145601 | 13070442 | hadcm3n_o0tr_1900_40_007196402_2 | 725,760 | 1,829,620 | 2.5210 |
27 Aug 2011 21:10:32 | 1145601 | 13070442 | hadcm3n_o0tr_1900_40_007196402_2 | 699,840 | 1,775,913 | 2.5376 |
27 Aug 2011 21:10:32 | 1145601 | 13070442 | hadcm3n_o0tr_1900_40_007196402_2 | 673,920 | 1,722,764 | 2.5563 |
27 Aug 2011 21:10:31 | 1145601 | 13070442 | hadcm3n_o0tr_1900_40_007196402_2 | 648,000 | 1,659,174 | 2.5605 |
27 Aug 2011 21:10:31 | 1145601 | 13070442 | hadcm3n_o0tr_1900_40_007196402_2 | 622,080 | 1,595,072 | 2.5641 |
20 Aug 2011 18:34:11 | 1145601 | 13070442 | hadcm3n_o0tr_1900_40_007196402_2 | 596,160 | 1,531,326 | 2.5686 |
19 Aug 2011 21:37:12 | 1145601 | 13070442 | hadcm3n_o0tr_1900_40_007196402_2 | 570,240 | 1,468,142 | 2.5746 |
19 Aug 2011 02:13:47 | 1145601 | 13070442 | hadcm3n_o0tr_1900_40_007196402_2 | 544,320 | 1,403,966 | 2.5793 |
18 Aug 2011 06:39:37 | 1145601 | 13070442 | hadcm3n_o0tr_1900_40_007196402_2 | 518,400 | 1,338,368 | 2.5817 |
17 Aug 2011 10:16:21 | 1145601 | 13070442 | hadcm3n_o0tr_1900_40_007196402_2 | 492,480 | 1,270,205 | 2.5792 |
16 Aug 2011 13:57:47 | 1145601 | 13070442 | hadcm3n_o0tr_1900_40_007196402_2 | 466,560 | 1,205,954 | 2.5848 |
15 Aug 2011 18:18:39 | 1145601 | 13070442 | hadcm3n_o0tr_1900_40_007196402_2 | 440,640 | 1,141,490 | 2.5905 |
14 Aug 2011 22:45:51 | 1145601 | 13070442 | hadcm3n_o0tr_1900_40_007196402_2 | 414,720 | 1,075,118 | 2.5924 |
14 Aug 2011 01:42:03 | 1145601 | 13070442 | hadcm3n_o0tr_1900_40_007196402_2 | 388,800 | 1,005,942 | 2.5873 |
13 Aug 2011 06:36:46 | 1145601 | 13070442 | hadcm3n_o0tr_1900_40_007196402_2 | 362,880 | 938,974 | 2.5876 |
30 Jul 2011 12:48:18 | 1145601 | 13070442 | hadcm3n_o0tr_1900_40_007196402_2 | 336,960 | 871,774 | 2.5872 |
29 Jul 2011 18:14:49 | 1145601 | 13070442 | hadcm3n_o0tr_1900_40_007196402_2 | 311,040 | 808,822 | 2.6004 |
29 Jul 2011 00:28:18 | 1145601 | 13070442 | hadcm3n_o0tr_1900_40_007196402_2 | 285,120 | 747,385 | 2.6213 |
©2024 cpdn.org