climateprediction.net home page
Task 13552027

Task 13552027

Name hadcm3n_ygs5_1900_40_007523166_1
Workunit 7720641
Created 28 Oct 2011, 13:21:27 UTC
Sent 31 Oct 2011, 17:37:59 UTC
Report deadline 31 Jan 2012, 1:05:10 UTC
Received 4 Jan 2012, 20:44:06 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 193 (0x000000C1) EXIT_SIGNAL
Computer ID 1047483
Run time 8 days 8 hours 56 min 57 sec
CPU time 6 days 10 hours 52 min 51 sec
Validate state Invalid
Credit 3,110.40
Device peak FLOPS 2.16 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.56</core_client_version>
<![CDATA[
<message>
 - exit code 193 (0xc1)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4300, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1468, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=752, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1112, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3584, iMonCtr=1
Model crash detected, will try to restart...
12:36:33 (472): No heartbeat from core client for 30 sec - exiting
12:36:34 (472): No heartbeat from core client for 30 sec - exiting
12:36:35 (472): No heartbeat from core client for 30 sec - exiting
12:36:38 (472): No heartbeat from core client for 30 sec - exiting
12:36:39 (472): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3000, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3680, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3680, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3680, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3680, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=416, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4996, iMonCtr=1
Model crash detected, will try to restart...
11:44:51 (5432): No heartbeat from core client for 30 sec - exiting
11:44:52 (5432): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5332, iMonCtr=1
Model crash detected, will try to restart...
13:20:03 (800): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5716, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8052, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:35:38 (7288): No heartbeat from core client for 30 sec - exiting
20:35:39 (7288): No heartbeat from core client for 30 sec - exiting
20:35:40 (7288): No heartbeat from core client for 30 sec - exiting
20:35:41 (7288): No heartbeat from core client for 30 sec - exiting
20:35:42 (7288): No heartbeat from core client for 30 sec - exiting
20:35:44 (7288): No heartbeat from core client for 30 sec - exiting
20:35:45 (7288): No heartbeat from core client for 30 sec - exiting
20:35:46 (7288): No heartbeat from core client for 30 sec - exiting
20:35:47 (7288): No heartbeat from core client for 30 sec - exiting
20:35:48 (7288): No heartbeat from core client for 30 sec - exiting
20:35:49 (7288): No heartbeat from core client for 30 sec - exiting
20:35:50 (7288): No heartbeat from core client for 30 sec - exiting
20:35:51 (7288): No heartbeat from core client for 30 sec - exiting
20:35:53 (7288): No heartbeat from core client for 30 sec - exiting
20:35:54 (7288): No heartbeat from core client for 30 sec - exiting
20:35:55 (7288): No heartbeat from core client for 30 sec - exiting
20:35:56 (7288): No heartbeat from core client for 30 sec - exiting
20:35:57 (7288): No heartbeat from core client for 30 sec - exiting
20:35:58 (7288): No heartbeat from core client for 30 sec - exiting
20:35:59 (7288): No heartbeat from core client for 30 sec - exiting
20:36:00 (7288): No heartbeat from core client for 30 sec - exiting
20:36:01 (7288): No heartbeat from core client for 30 sec - exiting
20:36:02 (7288): No heartbeat from core client for 30 sec - exiting
20:36:04 (7288): No heartbeat from core client for 30 sec - exiting
20:36:05 (7288): No heartbeat from core client for 30 sec - exiting
20:36:06 (7288): No heartbeat from core client for 30 sec - exiting
20:36:07 (7288): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
14:44:50 (5716): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:48:18 (6776): Can't acquire lockfile (32) - waiting 35s
14:48:53 (6776): Can't acquire lockfile (32) - exiting
14:48:53 (6776): Error: The process cannot access the file because it is being used by another process. (0x20)
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
23:23:32 (6352): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:12:42 (3124): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:44:13 (416): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:07:47 (2160): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
23:03:14 (3964): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x00417B59 read attempt to address 0x73003737

Engaging BOINC Windows Runtime Debugger...

Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ygs5_1900_40_007523166/dataout/shmem_restart.day
Signal 11 received, exiting...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
04 Jan 2012 20:45:54 1047483 13552027 hadcm3n_ygs5_1900_40_007523166_1 259,200 557,567 2.1511
03 Jan 2012 23:58:52 1047483 13552027 hadcm3n_ygs5_1900_40_007523166_1 233,280 502,262 2.1530
31 Dec 2011 15:53:03 1047483 13552027 hadcm3n_ygs5_1900_40_007523166_1 207,360 443,327 2.1380
23 Dec 2011 16:37:53 1047483 13552027 hadcm3n_ygs5_1900_40_007523166_1 181,440 388,481 2.1411
04 Dec 2011 18:02:23 1047483 13552027 hadcm3n_ygs5_1900_40_007523166_1 155,520 334,310 2.1496
25 Nov 2011 17:13:07 1047483 13552027 hadcm3n_ygs5_1900_40_007523166_1 129,600 280,822 2.1668
20 Nov 2011 22:05:43 1047483 13552027 hadcm3n_ygs5_1900_40_007523166_1 103,680 226,471 2.1843
19 Nov 2011 15:55:38 1047483 13552027 hadcm3n_ygs5_1900_40_007523166_1 77,760 169,330 2.1776
16 Nov 2011 22:35:38 1047483 13552027 hadcm3n_ygs5_1900_40_007523166_1 51,840 113,797 2.1952
15 Nov 2011 17:35:15 1047483 13552027 hadcm3n_ygs5_1900_40_007523166_1 25,920 57,078 2.2021


©2024 climateprediction.net