Name | famous_v3fq_999_200_006731750_6 |
Workunit | 6935091 |
Created | 25 Jan 2011, 20:26:56 UTC |
Sent | 25 Jan 2011, 20:26:58 UTC |
Report deadline | 27 Apr 2011, 3:54:09 UTC |
Received | 9 Jun 2011, 16:08:45 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 124409 |
Run time | |
CPU time | 12 days 12 hours 7 min 30 sec |
Validate state | Invalid |
Credit | 4,138.23 |
Device peak FLOPS | 1.27 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>5.10.20</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 22:40:37 (2812): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 22:41:14 (2812): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 15:38:07 (2732): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 01:47:37 (2520): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 03:18:37 (1660): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:31:29 (1188): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CSuspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 03:14:25 (2788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CSignal 11 received, exiting... 22:11:18 (1616): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2484, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... No Process Handle Worker:: CPDN process is09:39:22 (332): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:40:03 (332): No heartbeat from core client for 30 sec - exiting 10:40:05 (1076): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:14:06 (3912): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:14:48 (3912): No heartbeat from core client for 30 sec - exiting 10:48:36 (312): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:49:13 (312): No heartbeat from core client for 30 sec - exiting 13:48:32 (2160): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:48:34 (2820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:48:37 (2468): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:49:06 (2336): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:49:42 (2336): No heartbeat from core client for 30 sec - exiting 21:48:58 (472): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:49:30 (472): No heartbeat from core client for 30 sec - exiting 22:51:47 (3300): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:49:00 (3320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:49:31 (3320): No heartbeat from core client for 30 sec - exiting 01:58:13 (3480): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:53:32 (2644): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:29:29 (3200): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:30:05 (3200): No heartbeat from core client for 30 sec - exiting 22:32:29 (1684): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:33:06 (1684): No heartbeat from core client for 30 sec - exiting 23:32:22 (132): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:32:58 (132): No heartbeat from core client for 30 sec - exiting 16:28:56 (3660): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:32:21 (1656): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:32:57 (1656): No heartbeat from core client for 30 sec - exiting BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy no start tag in app init data 10:41:29 (544): Can't parse init data file - running in standalone mode BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy no start tag in app init data 12:37:38 (2620): Can't parse init data file - running in standalone mode no start tag in app init data 12:37:38 (2424): Can't parse init data file - running in standalone mode BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy no start tag in app init data 16:53:25 (2620): Can't parse init data file - running in standalone mode no start tag in app init data 16:53:25 (3456): Can't parse init data file - running in standalone mode BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA 19:05:41 (2620): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
08 Jun 2011 18:57:34 | 124409 | 12520074 | famous_v3fq_999_200_006731750_6 | 1,254,266 | 1,078,458 | 0.8598 |
30 May 2011 16:01:09 | 124409 | 12520074 | famous_v3fq_999_200_006731750_6 | 1,244,906 | 1,069,924 | 0.8594 |
28 May 2011 21:08:47 | 124409 | 12520074 | famous_v3fq_999_200_006731750_6 | 1,235,546 | 1,061,828 | 0.8594 |
18 May 2011 21:18:24 | 124409 | 12520074 | famous_v3fq_999_200_006731750_6 | 1,226,186 | 1,049,649 | 0.8560 |
27 Apr 2011 02:30:43 | 124409 | 12520074 | famous_v3fq_999_200_006731750_6 | 1,216,826 | 1,029,968 | 0.8464 |
11 Mar 2011 07:04:56 | 124409 | 12520074 | famous_v3fq_999_200_006731750_6 | 1,207,466 | 1,014,953 | 0.8406 |
11 Mar 2011 04:37:32 | 124409 | 12520074 | famous_v3fq_999_200_006731750_6 | 1,198,106 | 1,007,144 | 0.8406 |
11 Mar 2011 02:20:36 | 124409 | 12520074 | famous_v3fq_999_200_006731750_6 | 1,188,746 | 999,296 | 0.8406 |
10 Mar 2011 23:57:48 | 124409 | 12520074 | famous_v3fq_999_200_006731750_6 | 1,179,386 | 991,421 | 0.8406 |
10 Mar 2011 18:42:51 | 124409 | 12520074 | famous_v3fq_999_200_006731750_6 | 1,170,026 | 983,493 | 0.8406 |
10 Mar 2011 16:26:25 | 124409 | 12520074 | famous_v3fq_999_200_006731750_6 | 1,160,666 | 975,423 | 0.8404 |
10 Mar 2011 14:10:57 | 124409 | 12520074 | famous_v3fq_999_200_006731750_6 | 1,151,306 | 967,362 | 0.8402 |
10 Mar 2011 08:53:29 | 124409 | 12520074 | famous_v3fq_999_200_006731750_6 | 1,141,946 | 959,373 | 0.8401 |
10 Mar 2011 06:35:20 | 124409 | 12520074 | famous_v3fq_999_200_006731750_6 | 1,132,586 | 951,431 | 0.8401 |
10 Mar 2011 04:19:12 | 124409 | 12520074 | famous_v3fq_999_200_006731750_6 | 1,123,226 | 943,471 | 0.8400 |
09 Mar 2011 22:50:45 | 124409 | 12520074 | famous_v3fq_999_200_006731750_6 | 1,113,866 | 935,501 | 0.8399 |
09 Mar 2011 20:21:06 | 124409 | 12520074 | famous_v3fq_999_200_006731750_6 | 1,104,506 | 927,549 | 0.8398 |
09 Mar 2011 18:08:22 | 124409 | 12520074 | famous_v3fq_999_200_006731750_6 | 1,095,146 | 919,888 | 0.8400 |
09 Mar 2011 13:02:15 | 124409 | 12520074 | famous_v3fq_999_200_006731750_6 | 1,085,786 | 912,213 | 0.8401 |
09 Mar 2011 07:50:17 | 124409 | 12520074 | famous_v3fq_999_200_006731750_6 | 1,076,426 | 904,530 | 0.8403 |
©2024 cpdn.org