Name | hadcm3n_n2n0_1920_40_008377457_0 |
Workunit | 8528316 |
Created | 30 May 2013, 11:54:44 UTC |
Sent | 30 May 2013, 16:33:48 UTC |
Report deadline | 30 Aug 2013, 0:00:59 UTC |
Received | 18 Jul 2013, 11:48:43 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION |
Computer ID | 963814 |
Run time | 12 days 21 hours 0 min 27 sec |
CPU time | 12 days 4 hours 54 min 9 sec |
Validate state | Invalid |
Credit | 9,331.20 |
Device peak FLOPS | 2.98 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> (unknown error) - exit code -1073741819 (0xc0000005) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9816, iMonCtr=1 Model crash detected, will try to restart... 15:47:41 (3652): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4808, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4808, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 12:26:03 (6036): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:45:29 (5676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:07:10 (5892): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 14:11:17 (4776): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:40:08 (7960): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:35:31 (1384): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 22:40:11 (7560): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:32:08 (2912): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5904, iMonCtr=1 Model crash detected, will try to restart... 12:27:06 (5380): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:27:08 (3056): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:35:44 (7812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 23:13:06 (3164): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:15:16 (8880): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 23:42:19 (6772): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:50:14 (7228): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:25:43 (4708): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 04:58:00 (4916): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:52:15 (8452): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 15:27:50 (4600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:40:45 (3376): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:35:50 (1776): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:09:07 (4928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:56:42 (2732): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:37:13 (8324): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:51:19 (1216): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:42:39 (8528): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 23:12:35 (8068): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3572, iMonCtr=1 Model crash detected, will try to restart... 10:57:49 (5564): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:23:17 (5376): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:04:43 (4632): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7848, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5192, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5192, iMonCtr=1 Model crash detected, will try to restart... 12:08:32 (5908): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3532, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2148, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2148, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 11:08:19 (2532): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:14:46 (3012): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:26:38 (5208): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:49:52 (4872): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 13:14:57 (4852): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x778C432D read attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3892, selfPID=3892, iMonCtr=1 </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
23 Jul 2013 15:18:10 | 963814 | 15806891 | hadcm3n_n2n0_1920_40_008377457_0 | 777,600 | 1,054,423 | 1.3560 |
23 Jul 2013 15:18:10 | 963814 | 15806891 | hadcm3n_n2n0_1920_40_008377457_0 | 751,680 | 1,017,325 | 1.3534 |
11 Jul 2013 12:32:17 | 963814 | 15806891 | hadcm3n_n2n0_1920_40_008377457_0 | 725,760 | 979,807 | 1.3500 |
08 Jul 2013 09:17:26 | 963814 | 15806891 | hadcm3n_n2n0_1920_40_008377457_0 | 699,840 | 942,301 | 1.3465 |
04 Jul 2013 14:28:30 | 963814 | 15806891 | hadcm3n_n2n0_1920_40_008377457_0 | 673,920 | 904,586 | 1.3423 |
03 Jul 2013 09:58:08 | 963814 | 15806891 | hadcm3n_n2n0_1920_40_008377457_0 | 648,000 | 867,673 | 1.3390 |
03 Jul 2013 00:23:28 | 963814 | 15806891 | hadcm3n_n2n0_1920_40_008377457_0 | 622,080 | 831,033 | 1.3359 |
02 Jul 2013 13:14:45 | 963814 | 15806891 | hadcm3n_n2n0_1920_40_008377457_0 | 596,160 | 794,398 | 1.3325 |
28 Jun 2013 10:24:13 | 963814 | 15806891 | hadcm3n_n2n0_1920_40_008377457_0 | 570,240 | 757,531 | 1.3284 |
25 Jun 2013 12:02:39 | 963814 | 15806891 | hadcm3n_n2n0_1920_40_008377457_0 | 544,320 | 723,693 | 1.3295 |
25 Jun 2013 02:33:26 | 963814 | 15806891 | hadcm3n_n2n0_1920_40_008377457_0 | 518,400 | 690,494 | 1.3320 |
24 Jun 2013 17:19:32 | 963814 | 15806891 | hadcm3n_n2n0_1920_40_008377457_0 | 492,480 | 656,928 | 1.3339 |
20 Jun 2013 06:31:59 | 963814 | 15806891 | hadcm3n_n2n0_1920_40_008377457_0 | 466,560 | 622,333 | 1.3339 |
19 Jun 2013 20:10:09 | 963814 | 15806891 | hadcm3n_n2n0_1920_40_008377457_0 | 440,640 | 587,867 | 1.3341 |
19 Jun 2013 08:51:29 | 963814 | 15806891 | hadcm3n_n2n0_1920_40_008377457_0 | 414,720 | 552,709 | 1.3327 |
18 Jun 2013 21:20:14 | 963814 | 15806891 | hadcm3n_n2n0_1920_40_008377457_0 | 388,800 | 518,113 | 1.3326 |
17 Jun 2013 20:20:44 | 963814 | 15806891 | hadcm3n_n2n0_1920_40_008377457_0 | 362,880 | 483,295 | 1.3318 |
14 Jun 2013 03:55:22 | 963814 | 15806891 | hadcm3n_n2n0_1920_40_008377457_0 | 336,960 | 448,971 | 1.3324 |
13 Jun 2013 00:20:36 | 963814 | 15806891 | hadcm3n_n2n0_1920_40_008377457_0 | 311,040 | 414,172 | 1.3316 |
12 Jun 2013 14:36:18 | 963814 | 15806891 | hadcm3n_n2n0_1920_40_008377457_0 | 285,120 | 380,013 | 1.3328 |
©2024 cpdn.org