climateprediction.net home page
Task 15417074

Task 15417074

Name hadcm3n_zjwq_1880_40_008241903_2
Workunit 8397027
Created 29 Oct 2012, 17:32:25 UTC
Sent 29 Oct 2012, 17:32:32 UTC
Report deadline 29 Jan 2013, 0:59:43 UTC
Received 15 Jan 2013, 13:50:37 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1185186
Run time 11 days 17 hours 4 min 30 sec
CPU time 9 days 12 hours 34 min 31 sec
Validate state Invalid
Credit 4,976.64
Device peak FLOPS 2.16 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
El dispositivo no reconoce el comando. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5452, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4544, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5184, iMonCtr=1
Model crash detected, will try to restart...
Atmos Hold Restart file rename failed on atmos_restart.hold
Atmos Restart file copy failed on zjwqka.da81bp0
Atmos Hold Restart file rename failed on atmos_restart.hold
Atmos Restart file copy failed on zjwqka.da81bq0
Atmos Hold Restart file rename failed on atmos_restart.hold
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5332, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2472, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4212, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2060, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4872, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5420, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5572, iMonCtr=1
Model crash detected, will try to restart...
C15:17:27 (5304): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5328, iMonCtr=1
Model crash detected, will try to restart...
01:48:29 (5488): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
12:48:13 (5968): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
14:28:33 (3552): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5468, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not runninController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5432, iMonCtr=1
Model crash detected, will try to restart...
Atmos Hold Restart file rename failed on atmos_restart.hold
20:36:10 (5544): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5048, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5620, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5620, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5620, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5620, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5620, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5620, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
07 Jan 2013 21:13:45 1185186 15417074 hadcm3n_zjwq_1880_40_008241903_2 414,720 776,031 1.8712
04 Jan 2013 18:53:31 1185186 15417074 hadcm3n_zjwq_1880_40_008241903_2 388,800 727,838 1.8720
02 Jan 2013 01:08:26 1185186 15417074 hadcm3n_zjwq_1880_40_008241903_2 362,880 679,690 1.8730
28 Dec 2012 16:46:15 1185186 15417074 hadcm3n_zjwq_1880_40_008241903_2 336,960 631,840 1.8751
18 Dec 2012 14:15:28 1185186 15417074 hadcm3n_zjwq_1880_40_008241903_2 311,040 581,391 1.8692
13 Dec 2012 22:17:30 1185186 15417074 hadcm3n_zjwq_1880_40_008241903_2 285,120 533,128 1.8698
13 Dec 2012 22:17:30 1185186 15417074 hadcm3n_zjwq_1880_40_008241903_2 259,200 485,121 1.8716
06 Dec 2012 16:50:36 1185186 15417074 hadcm3n_zjwq_1880_40_008241903_2 233,280 438,148 1.8782
04 Dec 2012 20:29:39 1185186 15417074 hadcm3n_zjwq_1880_40_008241903_2 207,360 392,012 1.8905
03 Dec 2012 19:16:39 1185186 15417074 hadcm3n_zjwq_1880_40_008241903_2 181,440 345,341 1.9033
29 Nov 2012 11:58:11 1185186 15417074 hadcm3n_zjwq_1880_40_008241903_2 155,520 297,341 1.9119
26 Nov 2012 20:29:47 1185186 15417074 hadcm3n_zjwq_1880_40_008241903_2 129,600 248,564 1.9179
20 Nov 2012 14:11:51 1185186 15417074 hadcm3n_zjwq_1880_40_008241903_2 103,680 200,303 1.9319
19 Nov 2012 15:14:32 1185186 15417074 hadcm3n_zjwq_1880_40_008241903_2 77,760 152,316 1.9588
15 Nov 2012 20:07:46 1185186 15417074 hadcm3n_zjwq_1880_40_008241903_2 51,840 102,372 1.9748
12 Nov 2012 18:03:37 1185186 15417074 hadcm3n_zjwq_1880_40_008241903_2 25,920 50,970 1.9664


©2024 cpdn.org