Einstein@Home keeping TCP connections open?

Yatima
Yatima
Joined: 23 Feb 05
Posts: 1
Credit: 6347715
RAC: 0
Topic 196772

Hello,

I have just noticed that the Einstein@Home program seems to be keeping TCP connections open. More precisely, the TCP connections to the Einstein@Home servers stay in CLOSE_WAIT, apparently forever, even if only the Boinc Client is currently running and no workunits are being crunched.

This being Linux, I use "netstat" to dump the connections, so for example I see:

tcp        1      0 192.168.1.63:39248      130.75.116.34:80        CLOSE_WAIT 
tcp        1      0 192.168.1.63:39249      130.75.116.34:80        CLOSE_WAIT 

i.e. two connections from the local machine to 130.75.116.34:80 (einstein4.aei.uni-hannover.de), staying CLOSE_WAIT.

Probably a bug whereby the connection is not closed locally for some reason. See:

http://www.sunmanagers.org/pipermail/summaries/2006-January/007068.html

"The problem is your program running on the local machine is not closing the socket. It is not a TCP tuning issue. A connection can (and quite correctly) stay in CLOSE_WAIT forever while the program holds the
connection open."

P.S. Could anyone look into increasing the width of this textarea? 55 chars wide is ridiculously small on anything that's past VT220.

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4275
Credit: 245504102
RAC: 11292

Einstein@Home keeping TCP connections open?

Well, the Einstein@Home application normally shouldn't connect to the outside at all (in some BOINC Windows API signal handlers there is something to load the symbols from a project symstore, but only if the program crashes).

einstein4 is the server results get uploaded to by the BOINC Client. It is certainly possible that there is a bug in the upload routines there.

BM

BM

Neil Newell
Neil Newell
Joined: 20 Nov 12
Posts: 176
Credit: 169699457
RAC: 0

It's very common to see loads

It's very common to see loads of connections in CLOSE_WAIT, especially for HTTP where all sorts of interesting proxying is done by ISPs and carriers, with HTTP 1.1 persistent connections and mobile devices adding to the fun.

There's 8 (4 BRP, 2 Gamma, 2 BOINC) on this box to e@h:80 as I type, with 5 and 4 on a couple of other local hosts. On webservers, I often see much larger numbers of connections in this state.

Zapp
Zapp
Joined: 27 Mar 10
Posts: 5
Credit: 1354983
RAC: 0

I'm seeing this as

I'm seeing this as well:

# netstat -nap | grep CLOSE_WAIT
tcp        1      0 172.23.112.99:34971     193.1.132.201:80        CLOSE_WAIT  22070/hsgamma_FGRP2 
tcp        1      0 172.23.112.99:34893     193.1.132.201:80        CLOSE_WAIT  21591/wcg_hpf2_rose 
tcp        1      0 172.23.112.99:34970     193.1.132.201:80        CLOSE_WAIT  21914/wcgrid_sn2s_v 
tcp        1      0 172.23.112.99:35451     193.1.132.201:80        CLOSE_WAIT  14596/boinc         


The funny thing is that the stuck connection is to boinc.ucd.ie (FMAH), so it seems that the boinc client occasionally forgets to close a descriptor used for server communication and gets inherited by all science apps.

Neil Newell
Neil Newell
Joined: 20 Nov 12
Posts: 176
Credit: 169699457
RAC: 0

RE: Well, the Einstein@Home

Quote:
Well, the Einstein@Home application normally shouldn't connect to the outside at all

Without knowing how it works internally, I'm guessing that the BOINC/application interface results in the application owning the file descriptor - so although there may be no explicit connection, connections are made in the context of the application and *appear* to come directly from it (at least as far as 'netstat' is concerned, anyway - see below).

tcp        1      0 stone.woodbarn.lo:34247 einstein4.aei.uni-h:www CLOSE_WAIT  11385/einsteinbinar 
tcp        1      0 stone.woodbarn.lo:34254 einstein4.aei.uni-h:www CLOSE_WAIT  2281/boinc          
tcp        1      0 stone.woodbarn.lo:34243 einstein4.aei.uni-h:www CLOSE_WAIT  11314/hsgamma_FGRP2 
tcp        1      0 stone.woodbarn.lo:34236 einstein4.aei.uni-h:www CLOSE_WAIT  11258/hsgamma_FGRP2 
tcp        1      0 stone.woodbarn.lo:34248 einstein4.aei.uni-h:www CLOSE_WAIT  11385/einsteinbinar 
tcp        1      0 stone.woodbarn.lo:34246 einstein4.aei.uni-h:www CLOSE_WAIT  11318/einsteinbinar 
tcp        1      0 stone.woodbarn.lo:34242 einstein4.aei.uni-h:www CLOSE_WAIT  11314/hsgamma_FGRP2 
tcp        1      0 stone.woodbarn.lo:33927 einstein.phys.uwm.e:www CLOSE_WAIT  11251/hsgamma_FGRP2 
tcp        1      0 stone.woodbarn.lo:34255 einstein4.aei.uni-h:www CLOSE_WAIT  2281/boinc          
tcp        1      0 stone.woodbarn.lo:34235 einstein4.aei.uni-h:www CLOSE_WAIT  11258/hsgamma_FGRP2 
tcp        1      0 stone.woodbarn.lo:34245 einstein4.aei.uni-h:www CLOSE_WAIT  11318/einsteinbina


Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.