Uploaded image for project: 'ESCAPE DataLake Operations'
  1. ESCAPE DataLake Operations
  2. EDLK-95

LAPP upload failures with rucio

    XporterXMLWordPrintable

Details

    Description

      For all transfer with LAPP-WEBDAV as a source, the Rucio dashboard  shows "no data". But looking at the FTS onethe FTS one, everything seems ok.
      Here are logs if we try to create data there

      [root@624071128b1d src]# export FILE=`uuidgen` && echo "test">> $FILE && rucio -v upload --rse LAPP-WEBDAV --lifetime 3600 --scope SKA_SKAO_JOSHI-testing --register-after-upload $FILE
      2020-11-05 10:07:06,344 DEBUG   uploadclient.py upload  Num. of files that upload client is processing: 1
      2020-11-05 10:07:07,329 DEBUG   uploadclient.py upload  Input validation done.
      2020-11-05 10:07:07,330 INFO    Preparing upload for file e4eab913-8923-4c20-b742-a852f3ffca46
      2020-11-05 10:07:08,130 DEBUG   uploadclient.py upload  wan domain is used for the upload
      2020-11-05 10:07:08,288 DEBUG   gfal.py connect connecting
      2020-11-05 10:07:08,650 DEBUG   gfal.py exists  path None
      2020-11-05 10:07:08,651 DEBUG   gfal.py __gfal2_exist   path None
      2020-11-05 10:07:08,788 DEBUG   rsemanager.py   exists  Checking if davs://lapp-esc02.in2p3.fr:8001/webdav/SKA_SKAO_JOSHI-testing/6a/b4/e4eab913-8923-4c20-b742-a852f3ffca46 exists
      2020-11-05 10:07:08,789 DEBUG   gfal.py exists  path davs://lapp-esc02.in2p3.fr:8001/webdav/SKA_SKAO_JOSHI-testing/6a/b4/e4eab913-8923-4c20-b742-a852f3ffca46
      2020-11-05 10:07:08,789 DEBUG   gfal.py __gfal2_exist   path davs://lapp-esc02.in2p3.fr:8001/webdav/SKA_SKAO_JOSHI-testing/6a/b4/e4eab913-8923-4c20-b742-a852f3ffca46
      2020-11-05 10:08:09,097 ERROR   The requested service is not available at the moment.
      Details: An unknown exception occurred.
      Details: Connection timed out
      Completed in 62.8069 sec.
      

      UPDATED to include LAPP-DCACHE failures. These are seen across 3 machines and at least 2 separate user certificates.

      [root@55a9ed027720 rucio-analysis]# export FILE=`uuidgen` && echo "test">> $FILE && rucio -v upload --rse LAPP-DCACHE --lifetime 3600 --scope SKA_SKAO_JOSHI-testing --register-after-upload $FILE
      2020-11-09 10:42:58,186 DEBUG   uploadclient.py upload  Num. of files that upload client is processing: 1
      2020-11-09 10:42:58,306 DEBUG   uploadclient.py upload  Input validation done.
      2020-11-09 10:42:58,306 INFO    Preparing upload for file f2aba86a-c7cc-4b5f-97de-3c5f84ea15b4
      2020-11-09 10:42:58,417 DEBUG   uploadclient.py upload  wan domain is used for the upload
      2020-11-09 10:42:58,430 DEBUG   gfal.py connect connecting
      2020-11-09 10:42:58,448 DEBUG   gfal.py exists  path None
      2020-11-09 10:42:58,448 DEBUG   gfal.py __gfal2_exist   path None
      2020-11-09 10:42:58,477 DEBUG   rsemanager.py   exists  Checking if davs://lapp-dcache01.in2p3.fr:2880//data/escape/rucio/lapp_dcache/SKA_SKAO_JOSHI-testing/62/58/f2aba86a-c7cc-4b5f-97de-3c5f84ea15b4 exists
      2020-11-09 10:42:58,477 DEBUG   gfal.py exists  path davs://lapp-dcache01.in2p3.fr:2880//data/escape/rucio/lapp_dcache/SKA_SKAO_JOSHI-testing/62/58/f2aba86a-c7cc-4b5f-97de-3c5f84ea15b4
      2020-11-09 10:42:58,477 DEBUG   gfal.py __gfal2_exist   path davs://lapp-dcache01.in2p3.fr:2880//data/escape/rucio/lapp_dcache/SKA_SKAO_JOSHI-testing/62/58/f2aba86a-c7cc-4b5f-97de-3c5f84ea15b4
      2020-11-09 10:43:58,590 ERROR   The requested service is not available at the moment.
      Details: An unknown exception occurred.
      Details: Connection timed out
      Completed in 60.4079 sec.
      

      At a gfal level

      [root@624071128b1d src]# gfal-ls -la davs://lapp-dcache01.in2p3.fr:2880//data/escape/rucio/lapp_dcache/
      gfal-ls error: 110 (Connection timed out) - Connection timed out
      

      Grange, Yan observes the same issue:

      Singularity> export FILE=`uuidgen` && echo "test">> $FILE && rucio -v upload --rse LAPP-DCACHE --lifetime 3600 --scope LOFAR_ASTRON_GRANGE --register-after-upload $FILE
      2020-11-03 15:00:59,579	DEBUG	uploadclient.py	upload	Num. of files that upload client is processing: 1
      2020-11-03 15:00:59,716	DEBUG	uploadclient.py	upload	Input validation done.
      2020-11-03 15:00:59,716	INFO	Preparing upload for file 292d2866-5732-45ad-9475-917c29b14f79
      2020-11-03 15:00:59,825	DEBUG	uploadclient.py	upload	wan domain is used for the upload
      2020-11-03 15:00:59,841	DEBUG	gfal.py	connect	connecting
      2020-11-03 15:00:59,881	DEBUG	gfal.py	exists	path None
      2020-11-03 15:00:59,881	DEBUG	gfal.py	__gfal2_exist	path None
      2020-11-03 15:00:59,925	DEBUG	rsemanager.py	exists	Checking if davs://lapp-dcache01.in2p3.fr:2880//data/escape/rucio/lapp_dcache/LOFAR_ASTRON_GRANGE/3e/6a/292d2866-5732-45ad-9475-917c29b14f79 exists
      2020-11-03 15:00:59,925	DEBUG	gfal.py	exists	path davs://lapp-dcache01.in2p3.fr:2880//data/escape/rucio/lapp_dcache/LOFAR_ASTRON_GRANGE/3e/6a/292d2866-5732-45ad-9475-917c29b14f79
      2020-11-03 15:00:59,925	DEBUG	gfal.py	__gfal2_exist	path davs://lapp-dcache01.in2p3.fr:2880//data/escape/rucio/lapp_dcache/LOFAR_ASTRON_GRANGE/3e/6a/292d2866-5732-45ad-9475-917c29b14f79
      2020-11-03 15:02:00,090	ERROR	The requested service is not available at the moment.
      Details: An unknown exception occurred.
      Details: Connection timed out
      Completed in 60.5808 sec.
      

      but Di Maria, Riccardo does not:

      [root@escape-crons-78bc6669f8-sg2wk scripts]# gfal-ls -la davs://lapp-dcache01.in2p3.fr:2880//data/escape/rucio/lapp_dcache/
      drwxrwxrwx   0 0     0             0 Oct  1 14:34 gfal_sam	
      drwxrwxrwx   0 0     0             0 Oct 14 08:03 ESCAPE_CERN_TEAM-noise	
      drwxrwxrwx   0 0     0             0 Oct  8 07:02 LSST_CCIN2P3_GOUNON	
      drwxrwxrwx   0 0     0             0 Oct  9 14:07 SKA_SKAO_BARNSLEY-testing	
      drwxrwxrwx   0 0     0             0 Oct  6 15:03 FAIR_GSI_SZUBA	
      drwxrwxrwx   0 0     0             0 Oct 28 09:23 CTA_LAPP_FREDERIC	
      drwxrwxrwx   0 0     0             0 Oct 15 09:38 ESCAPE_DESY_TEAM-testing	
      drwxrwxrwx   0 0     0             0 Oct  5 15:17 fts-testing	
      drwxrwxrwx   0 0     0             0 Oct  6 15:02 rucio-testing	
      drwxrwxrwx   0 0     0             0 Oct 21 16:47 SKA_SKAO_COLL-testing	
      drwxrwxrwx   0 0     0             0 Oct 27 16:44 SKA_SKAO_JOSHI-testing	
      drwxrwxrwx   0 0     0             0 Oct 30 10:16 ATLAS_LAPP_JEZEQUEL
      

      Attachments

        Issue Links

          Structure

            Activity

              People

                F.Gillardo Gillardo, Frederic [X] (Inactive)
                P.Musset Musset,Paul [X] (Inactive)
                Votes:
                0 Vote for this issue
                Watchers:
                1 Start watching this issue

                Dates

                  Created:
                  Updated:
                  Resolved:

                  Structure Helper Panel