Name | hadcm3n_n1rs_1880_40_008286005_3 |
Workunit | 8437140 |
Created | 12 Jun 2013, 20:20:43 UTC |
Sent | 12 Jun 2013, 21:15:20 UTC |
Report deadline | 12 Sep 2013, 4:42:31 UTC |
Received | 14 Nov 2013, 8:13:38 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1259931 |
Run time | 21 days 7 hours 17 min 33 sec |
CPU time | 20 days 6 hours 10 min 15 sec |
Validate state | Invalid |
Credit | 12,441.60 |
Device peak FLOPS | 2.41 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 14:23:32 (4056): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:38:00 (4772): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 06:49:46 (5212): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/n1rsko.pja1c10 Error converting file to netcdf: dataout/n1rsko.pia1c10 Error converting file to netcdf: dataout/n1rsko.pfa1c10 Error converting file to netcdf: dataout/n1rska.pha1c10 Error converting file to netcdf: dataout/n1rska.pga1c10 Error converting file to netcdf: dataout/n1rska.pea1c10 Error converting file to netcdf: dataout/n1rska.pda1c10 15:22:57 (4224): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:32:15 (6032): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:27:50 (3932): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:30:34 (4264): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:57:49 (7552): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 02:03:41 (5400): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:03:42 (5400): No heartbeat from core client for 30 sec - exiting 02:03:43 (5400): No heartbeat from core client for 30 sec - exiting 02:03:44 (5400): No heartbeat from core client for 30 sec - exiting 02:03:45 (5400): No heartbeat from core client for 30 sec - exiting 02:03:46 (5400): No heartbeat from core client for 30 sec - exiting 02:03:47 (5400): No heartbeat from core client for 30 sec - exiting 02:03:48 (5400): No heartbeat from core client for 30 sec - exiting 02:03:49 (5400): No heartbeat from core client for 30 sec - exiting 02:03:50 (5400): No heartbeat from core client for 30 sec - exiting 02:03:51 (5400): No heartbeat from core client for 30 sec - exiting 11:05:09 (3544): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... 08:17:50 (4600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77653AC3 read attempt to address 0x4046C382 Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x779D3AC3 read attempt to address 0x4046C382 Engaging BOINC Windows Runtime Debugger... Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_n1rs_1880_40_008286005/dataout/shmem_restart.day Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
07 Nov 2013 17:37:29 | 1259931 | 15840606 | hadcm3n_n1rs_1880_40_008286005_3 | 1,036,800 | 1,749,619 | 1.6875 |
07 Nov 2013 06:20:11 | 1259931 | 15840606 | hadcm3n_n1rs_1880_40_008286005_3 | 1,010,880 | 1,710,767 | 1.6924 |
06 Nov 2013 19:01:31 | 1259931 | 15840606 | hadcm3n_n1rs_1880_40_008286005_3 | 984,960 | 1,671,730 | 1.6973 |
06 Nov 2013 06:26:48 | 1259931 | 15840606 | hadcm3n_n1rs_1880_40_008286005_3 | 959,040 | 1,629,569 | 1.6992 |
05 Nov 2013 17:36:57 | 1259931 | 15840606 | hadcm3n_n1rs_1880_40_008286005_3 | 933,120 | 1,586,396 | 1.7001 |
05 Nov 2013 04:36:58 | 1259931 | 15840606 | hadcm3n_n1rs_1880_40_008286005_3 | 907,200 | 1,543,150 | 1.7010 |
04 Nov 2013 16:14:42 | 1259931 | 15840606 | hadcm3n_n1rs_1880_40_008286005_3 | 881,280 | 1,500,600 | 1.7028 |
04 Nov 2013 03:32:35 | 1259931 | 15840606 | hadcm3n_n1rs_1880_40_008286005_3 | 855,360 | 1,457,970 | 1.7045 |
03 Nov 2013 16:26:31 | 1259931 | 15840606 | hadcm3n_n1rs_1880_40_008286005_3 | 829,440 | 1,419,073 | 1.7109 |
03 Nov 2013 05:42:21 | 1259931 | 15840606 | hadcm3n_n1rs_1880_40_008286005_3 | 803,520 | 1,380,934 | 1.7186 |
02 Nov 2013 18:52:38 | 1259931 | 15840606 | hadcm3n_n1rs_1880_40_008286005_3 | 777,600 | 1,342,290 | 1.7262 |
02 Nov 2013 08:08:00 | 1259931 | 15840606 | hadcm3n_n1rs_1880_40_008286005_3 | 751,680 | 1,304,318 | 1.7352 |
01 Nov 2013 21:13:37 | 1259931 | 15840606 | hadcm3n_n1rs_1880_40_008286005_3 | 725,760 | 1,266,188 | 1.7446 |
01 Nov 2013 09:07:19 | 1259931 | 15840606 | hadcm3n_n1rs_1880_40_008286005_3 | 699,840 | 1,224,031 | 1.7490 |
31 Oct 2013 17:21:01 | 1259931 | 15840606 | hadcm3n_n1rs_1880_40_008286005_3 | 673,920 | 1,171,761 | 1.7387 |
30 Oct 2013 19:13:34 | 1259931 | 15840606 | hadcm3n_n1rs_1880_40_008286005_3 | 648,000 | 1,107,070 | 1.7084 |
29 Oct 2013 21:54:43 | 1259931 | 15840606 | hadcm3n_n1rs_1880_40_008286005_3 | 622,080 | 1,043,219 | 1.6770 |
29 Oct 2013 02:45:16 | 1259931 | 15840606 | hadcm3n_n1rs_1880_40_008286005_3 | 596,160 | 980,499 | 1.6447 |
28 Oct 2013 08:45:49 | 1259931 | 15840606 | hadcm3n_n1rs_1880_40_008286005_3 | 570,240 | 918,685 | 1.6110 |
27 Oct 2013 13:50:53 | 1259931 | 15840606 | hadcm3n_n1rs_1880_40_008286005_3 | 544,320 | 853,034 | 1.5672 |
©2024 climateprediction.net