climateprediction.net home page
Task 15134487

Task 15134487

Name hadcm3n_ylrw_1980_40_008154412_0
Workunit 8309536
Created 17 Aug 2012, 12:08:54 UTC
Sent 17 Aug 2012, 12:12:33 UTC
Report deadline 16 Nov 2012, 19:39:44 UTC
Received 28 Aug 2012, 1:36:56 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1186899
Run time 8 days 9 hours 35 min 57 sec
CPU time 8 days 5 hours 49 min 27 sec
Validate state Invalid
Credit 4,665.60
Device peak FLOPS 2.82 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.25</core_client_version>
<![CDATA[
<message>
too many boinc_temporary_exit()s
</message>
<stderr_txt>
11:47:45 (3188): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:02:18 (2640): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:31:13 (2372): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:06:35 (3508): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
10:28:37 (3788): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:00:32 (4584): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4336, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
12:56:14 (4336): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5844, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5844, iMonCtr=1
Model crash detected, will try to restart...
12:56:39 (5928): Can't acquire lockfile (32) - waiting 35s
Signal 22 received, exiting...
Called boinc_finish
12:56:56 (5844): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:57:14 (5928): Can't set up shared mem: -1. Will run in standalone mode.
12:57:14 (3060): Can't set up shared mem: -1. Will run in standalone mode.
Signal 22 received, exiting...
08:18:44 (4412): Can't acquire lockfile (32) - waiting 35s
08:19:19 (4412): Can't acquire lockfile (32) - exiting
08:19:19 (4412): Error: The process cannot access the file because it is being used by another process. (0x20)
09:03:17 (1604): Can't acquire lockfile (32) - waiting 35s
09:03:52 (1604): Can't acquire lockfile (32) - exiting
09:03:52 (1604): Error: The process cannot access the file because it is being used by another process. (0x20)
09:20:04 (4832): Can't acquire lockfile (32) - waiting 35s
09:20:39 (4832): Can't acquire lockfile (32) - exiting
09:20:39 (4832): Error: The process cannot access the file because it is being used by another process. (0x20)
09:30:47 (684): Can't acquire lockfile (32) - waiting 35s
09:31:22 (684): Can't acquire lockfile (32) - exiting
09:31:22 (684): Error: The process cannot access the file because it is being used by another process. (0x20)
10:00:32 (6084): Can't acquire lockfile (32) - waiting 35s
10:01:07 (6084): Can't acquire lockfile (32) - exiting
10:01:07 (6084): Error: The process cannot access the file because it is being used by another process. (0x20)
10:11:18 (5000): Can't acquire lockfile (32) - waiting 35s
10:11:53 (5000): Can't acquire lockfile (32) - exiting
10:11:53 (5000): Error: The process cannot access the file because it is being used by another process. (0x20)
10:22:15 (5468): Can't acquire lockfile (32) - waiting 35s
10:22:50 (5468): Can't acquire lockfile (32) - exiting
10:22:50 (5468): Error: The process cannot access the file because it is being used by another process. (0x20)
10:33:04 (6720): Can't acquire lockfile (32) - waiting 35s
10:33:39 (6720): Can't acquire lockfile (32) - exiting
10:33:39 (6720): Error: The process cannot access the file because it is being used by another process. (0x20)
10:43:42 (1304): Can't acquire lockfile (32) - waiting 35s
10:44:17 (1304): Can't acquire lockfile (32) - exiting
10:44:17 (1304): Error: The process cannot access the file because it is being used by another process. (0x20)
11:00:30 (5612): Can't acquire lockfile (32) - waiting 35s
11:01:05 (5612): Can't acquire lockfile (32) - exiting
11:01:05 (5612): Error: The process cannot access the file because it is being used by another process. (0x20)
11:11:16 (6032): Can't acquire lockfile (32) - waiting 35s
11:11:51 (6032): Can't acquire lockfile (32) - exiting
11:11:51 (6032): Error: The process cannot access the file because it is being used by another process. (0x20)
11:22:01 (7528): Can't acquire lockfile (32) - waiting 35s
11:22:36 (7528): Can't acquire lockfile (32) - exiting
11:22:36 (7528): Error: The process cannot access the file because it is being used by another process. (0x20)
11:32:47 (4988): Can't acquire lockfile (32) - waiting 35s
11:33:22 (4988): Can't acquire lockfile (32) - exiting
11:33:22 (4988): Error: The process cannot access the file because it is being used by another process. (0x20)
11:43:33 (7956): Can't acquire lockfile (32) - waiting 35s
11:44:08 (7956): Can't acquire lockfile (32) - exiting
11:44:08 (7956): Error: The process cannot access the file because it is being used by another process. (0x20)
11:54:19 (7216): Can't acquire lockfile (32) - waiting 35s
11:54:54 (7216): Can't acquire lockfile (32) - exiting
11:54:54 (7216): Error: The process cannot access the file because it is being used by another process. (0x20)
12:05:04 (5376): Can't acquire lockfile (32) - waiting 35s
12:05:39 (5376): Can't acquire lockfile (32) - exiting
12:05:39 (5376): Error: The process cannot access the file because it is being used by another process. (0x20)
12:16:07 (6912): Can't acquire lockfile (32) - waiting 35s
12:16:42 (6912): Can't acquire lockfile (32) - exiting
12:16:42 (6912): Error: The process cannot access the file because it is being used by another process. (0x20)
12:27:00 (6932): Can't acquire lockfile (32) - waiting 35s
12:27:35 (6932): Can't acquire lockfile (32) - exiting
12:27:35 (6932): Error: The process cannot access the file because it is being used by another process. (0x20)
12:45:47 (5748): Can't acquire lockfile (32) - waiting 35s
12:46:22 (5748): Can't acquire lockfile (32) - exiting
12:46:22 (5748): Error: The process cannot access the file because it is being used by another process. (0x20)
12:56:46 (1808): Can't acquire lockfile (32) - waiting 35s
12:57:21 (1808): Can't acquire lockfile (32) - exiting
12:57:21 (1808): Error: The process cannot access the file because it is being used by another process. (0x20)
13:07:57 (5180): Can't acquire lockfile (32) - waiting 35s
13:08:32 (5180): Can't acquire lockfile (32) - exiting
13:08:32 (5180): Error: The process cannot access the file because it is being used by another process. (0x20)
13:19:31 (7340): Can't acquire lockfile (32) - waiting 35s
13:20:06 (7340): Can't acquire lockfile (32) - exiting
13:20:06 (7340): Error: The process cannot access the file because it is being used by another process. (0x20)
13:30:18 (3344): Can't acquire lockfile (32) - waiting 35s
13:30:53 (3344): Can't acquire lockfile (32) - exiting
13:30:53 (3344): Error: The process cannot access the file because it is being used by another process. (0x20)
13:47:38 (5388): Can't acquire lockfile (32) - waiting 35s
13:48:13 (5388): Can't acquire lockfile (32) - exiting
13:48:13 (5388): Error: The process cannot access the file because it is being used by another process. (0x20)
13:58:49 (7580): Can't acquire lockfile (32) - waiting 35s
13:59:24 (7580): Can't acquire lockfile (32) - exiting
13:59:24 (7580): Error: The process cannot access the file because it is being used by another process. (0x20)
14:15:52 (5132): Can't acquire lockfile (32) - waiting 35s
14:16:27 (5132): Can't acquire lockfile (32) - exiting
14:16:27 (5132): Error: The process cannot access the file because it is being used by another process. (0x20)
14:31:54 (7956): Can't acquire lockfile (32) - waiting 35s
14:32:29 (7956): Can't acquire lockfile (32) - exiting
14:32:29 (7956): Error: The process cannot access the file because it is being used by another process. (0x20)
14:50:50 (6784): Can't acquire lockfile (32) - waiting 35s
14:51:25 (6784): Can't acquire lockfile (32) - exiting
14:51:25 (6784): Error: The process cannot access the file because it is being used by another process. (0x20)
15:14:50 (7568): Can't acquire lockfile (32) - waiting 35s
15:15:25 (7568): Can't acquire lockfile (32) - exiting
15:15:25 (7568): Error: The process cannot access the file because it is being used by another process. (0x20)
15:26:01 (7360): Can't acquire lockfile (32) - waiting 35s
15:26:36 (7360): Can't acquire lockfile (32) - exiting
15:26:36 (7360): Error: The process cannot access the file because it is being used by another process. (0x20)
15:36:47 (7592): Can't acquire lockfile (32) - waiting 35s
15:37:22 (7592): Can't acquire lockfile (32) - exiting
15:37:22 (7592): Error: The process cannot access the file because it is being used by another process. (0x20)
15:58:17 (8904): Can't acquire lockfile (32) - waiting 35s
15:58:52 (8904): Can't acquire lockfile (32) - exiting
15:58:52 (8904): Error: The process cannot access the file because it is being used by another process. (0x20)
16:09:53 (8696): Can't acquire lockfile (32) - waiting 35s
16:10:28 (8696): Can't acquire lockfile (32) - exiting
16:10:28 (8696): Error: The process cannot access the file because it is being used by another process. (0x20)
16:30:56 (4480): Can't acquire lockfile (32) - waiting 35s
16:31:31 (4480): Can't acquire lockfile (32) - exiting
16:31:31 (4480): Error: The process cannot access the file because it is being used by another process. (0x20)
16:41:40 (9112): Can't acquire lockfile (32) - waiting 35s
16:42:15 (9112): Can't acquire lockfile (32) - exiting
16:42:15 (9112): Error: The process cannot access the file because it is being used by another process. (0x20)
16:59:52 (7784): Can't acquire lockfile (32) - waiting 35s
17:00:27 (7784): Can't acquire lockfile (32) - exiting
17:00:27 (7784): Error: The process cannot access the file because it is being used by another process. (0x20)
17:11:18 (8332): Can't acquire lockfile (32) - waiting 35s
17:11:53 (8332): Can't acquire lockfile (32) - exiting
17:11:53 (8332): Error: The process cannot access the file because it is being used by another process. (0x20)
17:22:29 (8428): Can't acquire lockfile (32) - waiting 35s
17:23:04 (8428): Can't acquire lockfile (32) - exiting
17:23:04 (8428): Error: The process cannot access the file because it is being used by another process. (0x20)
17:33:40 (7728): Can't acquire lockfile (32) - waiting 35s
17:34:15 (7728): Can't acquire lockfile (32) - exiting
17:34:15 (7728): Error: The process cannot access the file because it is being used by another process. (0x20)
17:46:15 (8860): Can't acquire lockfile (32) - waiting 35s
17:46:50 (8860): Can't acquire lockfile (32) - exiting
17:46:50 (8860): Error: The process cannot access the file because it is being used by another process. (0x20)
18:01:27 (2464): Can't acquire lockfile (32) - waiting 35s
18:02:02 (2464): Can't acquire lockfile (32) - exiting
18:02:02 (2464): Error: The process cannot access the file because it is being used by another process. (0x20)
18:12:56 (136): Can't acquire lockfile (32) - waiting 35s
18:13:31 (136): Can't acquire lockfile (32) - exiting
18:13:31 (136): Error: The process cannot access the file because it is being used by another process. (0x20)
18:29:29 (6392): Can't acquire lockfile (32) - waiting 35s
18:30:04 (6392): Can't acquire lockfile (32) - exiting
18:30:04 (6392): Error: The process cannot access the file because it is being used by another process. (0x20)
18:40:41 (9392): Can't acquire lockfile (32) - waiting 35s
18:41:16 (9392): Can't acquire lockfile (32) - exiting
18:41:16 (9392): Error: The process cannot access the file because it is being used by another process. (0x20)
18:51:59 (9524): Can't acquire lockfile (32) - waiting 35s
18:52:34 (9524): Can't acquire lockfile (32) - exiting
18:52:34 (9524): Error: The process cannot access the file because it is being used by another process. (0x20)
19:03:10 (10012): Can't acquire lockfile (32) - waiting 35s
19:03:45 (10012): Can't acquire lockfile (32) - exiting
19:03:45 (10012): Error: The process cannot access the file because it is being used by another process. (0x20)
19:15:16 (7336): Can't acquire lockfile (32) - waiting 35s
19:15:51 (7336): Can't acquire lockfile (32) - exiting
19:15:51 (7336): Error: The process cannot access the file because it is being used by another process. (0x20)
19:41:42 (8884): Can't acquire lockfile (32) - waiting 35s
19:42:17 (8884): Can't acquire lockfile (32) - exiting
19:42:17 (8884): Error: The process cannot access the file because it is being used by another process. (0x20)
19:53:44 (9384): Can't acquire lockfile (32) - waiting 35s
19:54:19 (9384): Can't acquire lockfile (32) - exiting
19:54:19 (9384): Error: The process cannot access the file because it is being used by another process. (0x20)
20:04:55 (7816): Can't acquire lockfile (32) - waiting 35s
20:05:30 (7816): Can't acquire lockfile (32) - exiting
20:05:30 (7816): Error: The process cannot access the file because it is being used by another process. (0x20)
20:15:30 (10016): Can't acquire lockfile (32) - waiting 35s
20:16:05 (10016): Can't acquire lockfile (32) - exiting
20:16:05 (10016): Error: The process cannot access the file because it is being used by another process. (0x20)

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
26 Aug 2012 13:31:08 1186899 15134487 hadcm3n_ylrw_1980_40_008154412_0 388,800 701,991 1.8055
25 Aug 2012 23:38:24 1186899 15134487 hadcm3n_ylrw_1980_40_008154412_0 362,880 656,325 1.8087
25 Aug 2012 10:35:08 1186899 15134487 hadcm3n_ylrw_1980_40_008154412_0 336,960 609,918 1.8101
24 Aug 2012 21:32:03 1186899 15134487 hadcm3n_ylrw_1980_40_008154412_0 311,040 563,086 1.8103
24 Aug 2012 08:16:08 1186899 15134487 hadcm3n_ylrw_1980_40_008154412_0 285,120 515,699 1.8087
23 Aug 2012 19:31:26 1186899 15134487 hadcm3n_ylrw_1980_40_008154412_0 259,200 468,131 1.8061
22 Aug 2012 16:18:33 1186899 15134487 hadcm3n_ylrw_1980_40_008154412_0 233,280 421,934 1.8087
22 Aug 2012 03:35:10 1186899 15134487 hadcm3n_ylrw_1980_40_008154412_0 207,360 376,937 1.8178
21 Aug 2012 14:36:10 1186899 15134487 hadcm3n_ylrw_1980_40_008154412_0 181,440 330,728 1.8228
21 Aug 2012 01:18:10 1186899 15134487 hadcm3n_ylrw_1980_40_008154412_0 155,520 283,559 1.8233
20 Aug 2012 12:35:15 1186899 15134487 hadcm3n_ylrw_1980_40_008154412_0 129,600 236,481 1.8247
19 Aug 2012 22:07:14 1186899 15134487 hadcm3n_ylrw_1980_40_008154412_0 103,680 187,489 1.8083
19 Aug 2012 09:04:11 1186899 15134487 hadcm3n_ylrw_1980_40_008154412_0 77,760 140,651 1.8088
18 Aug 2012 19:04:24 1186899 15134487 hadcm3n_ylrw_1980_40_008154412_0 51,840 93,771 1.8089
18 Aug 2012 05:58:36 1186899 15134487 hadcm3n_ylrw_1980_40_008154412_0 25,920 47,070 1.8160


©2024 cpdn.org