Name | hadcm3n_yeno_1940_40_007547754_3 |
Workunit | 7744986 |
Created | 2 Jan 2012, 22:45:04 UTC |
Sent | 2 Jan 2012, 22:45:11 UTC |
Report deadline | 3 Apr 2012, 6:12:22 UTC |
Received | 25 Jan 2012, 14:44:27 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1107394 |
Run time | 3 days 20 hours 30 min 46 sec |
CPU time | 3 days 7 hours 45 min 19 sec |
Validate state | Invalid |
Credit | 2,488.32 |
Device peak FLOPS | 2.95 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> El dispositivo no reconoce el comando. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3528, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3372, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=164, iMonCtr=1 Model crash detected, will try to restart... 10:10:02 (4672): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:02:32 (1772): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:47:22 (580): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3204, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3204, iMonCtr=1 Model crash detected, will try to restart... 09:55:59 (4708): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5184, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2024, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2024, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2024, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2024, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2024, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2024, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish 10:01:57 (816): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:03:05 (4176): No heartbeat from core client for 30 sec - exiting 10:03:06 (4176): No heartbeat from core client for 30 sec - exiting 10:03:07 (4176): No heartbeat from core client for 30 sec - exiting 10:03:08 (4176): No heartbeat from core client for 30 sec - exiting 10:03:09 (4176): No heartbeat from core client for 30 sec - exiting 10:03:10 (4176): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:03:11 (4176): No heartbeat from core client for 30 sec - exiting BUFFIN: C I/O Error feof - Unit 21 - Return code = 16 Model crashed: READDUMP: BAD BUFFIN OF DATA tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 21 - Return code = 16 Model crashed: READDUMP: BAD BUFFIN OF DATA tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 21 - Return code = 16 Model crashed: READDUMP: BAD BUFFIN OF DATA tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 21 - Return code = 16 Model crashed: READDUMP: BAD BUFFIN OF DATA tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 21 - Return code = 16 Model crashed: READDUMP: BAD BUFFIN OF DATA tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 21 - Return code = 16 Model crashed: READDUMP: BAD BUFFIN OF DATA tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
24 Jan 2012 19:29:28 | 1107394 | 13852636 | hadcm3n_yeno_1940_40_007547754_3 | 207,360 | 276,210 | 1.3320 |
23 Jan 2012 22:28:27 | 1107394 | 13852636 | hadcm3n_yeno_1940_40_007547754_3 | 181,440 | 241,132 | 1.3290 |
22 Jan 2012 10:31:31 | 1107394 | 13852636 | hadcm3n_yeno_1940_40_007547754_3 | 155,520 | 206,276 | 1.3264 |
20 Jan 2012 20:01:53 | 1107394 | 13852636 | hadcm3n_yeno_1940_40_007547754_3 | 129,600 | 171,810 | 1.3257 |
17 Jan 2012 21:41:26 | 1107394 | 13852636 | hadcm3n_yeno_1940_40_007547754_3 | 103,680 | 137,239 | 1.3237 |
16 Jan 2012 11:45:22 | 1107394 | 13852636 | hadcm3n_yeno_1940_40_007547754_3 | 77,760 | 102,883 | 1.3231 |
14 Jan 2012 17:08:57 | 1107394 | 13852636 | hadcm3n_yeno_1940_40_007547754_3 | 51,840 | 68,785 | 1.3269 |
04 Jan 2012 19:17:48 | 1107394 | 13852636 | hadcm3n_yeno_1940_40_007547754_3 | 25,920 | 34,049 | 1.3136 |
©2024 cpdn.org