WP2 fortnightly meeting
→
Europe/Paris
Description
Join Zoom Meeting https://cern.zoom.us/j/92926380866?pwd=YkNNb0lPM0RCbjJMajh0SmJBUUV2QT09 Meeting ID: 929 2638 0866 Passcode: 373008
-
-
11:00
→
11:10
News 10mOrateurs: Rosie Bolton (Square Kilometre Array Organisation), Xavier Espinal (CERN)
- Mid-term review will take place second half of November 2020.
- The reviewer from the EC reviewer has been appointed.
- Preparations: ~30mins/WP + ~20mins discussion draft to be circulated with project management by Friday 23 October
- E-EB meeting in the first week of November (2-6) to discuss and review the slides of each WP.
- Milestones and deliverables:
- M2.3 (M22): Second WP2/DIOS Workshop (virtual event)
- Proposal: 9th of December
- At the last meeting we were targeting 26 Nov but WP5 just sent yesterday the agenda for their progress meeting 26-27th November: https://indico.in2p3.fr/event/22482/)
- Proposal: 9th of December
- M2.4 (M24): Expanded prototype. Verify experiment data access from compute platforms (including commercial clouds)
- Heads-up to review storage resources deployed. To evolve data lake infrastructure towards a meaningful size, allowing to scale-up data ingestion and throughput challenges.
- M2.3 (M22): Second WP2/DIOS Workshop (virtual event)
- FDR exercise dates proposal
- 1st Dress Rehearsal: approx. one week before the workshop (24th Nov)
- 2nd Dress Rehearsal: approx. two weeks after the workshop (15th Dec)
- Misc:
- EC-Projects:
- EOSC-Future (INFRAEOSC03) proposal approved.
- MECHANICS proposal not approved (interest of ESCAPE was related to the integration of commercial cloud resources)
- PRACE-CERN-GÉANT-SKAO kick-off workshop on High Performance Computing 29 September [agenda]
- Presentations by Ian, Rosie and myself
- ESCAPE talks:
- American Geophysical Union conference:
- https://www.agu.org/fall-meeting (date not fixed yet: ~1-17 December)
- PHIDIAS project (13 Oct, webinar):
- Research Data Alliance (RDA) virtual plenary meeting: Talk + BoF session (date TBD)
- American Geophysical Union conference:
- Possible contributions:
- ADDASS (poster call still open): https://adass2020.es (Nov 8-12)
- EC-Projects:
- Mid-term review will take place second half of November 2020.
-
11:10
→
11:15
Pilot data lake assessment: EOS EULAKE update 5mOrateur: Xavier Espinal (CERN)
- EOS instance at CERN (aka EULAKE) functional, transfer tests reasonably happy overall.
- Several issues were fixed
- Some of them coming from legacy usage (first prototype for early data lake tests)
- Directory permissions
- Legacy disk layouts, legacy configurations coming from Wigner times.
- Remote FSTs running old FST daemons (or unattended) has been set to read-only, let me know in case you want them back (IIRC this is tracked by the depops team)
- Spotted some syntax problems on the CRIC configuration, they have been fixed.
- Some of them coming from legacy usage (first prototype for early data lake tests)
- 2 more disk servers have been added to allow setting up different QoS and in preparation for the prototype phase: 163 FS and 460TB in total.
- xrdcp, GFAL, FTS and RUCIO OK. Webdav, gsiftp and root protocols enabled.
- Few remaining issues (being followed-up by the DepOps team)
- xroot TPC when EOS is the source and dCache is the destination
- At the time of the opening the TPC sockets, EOS FTSs only accepts UNIX connections and if dCache node accepts only accept gsi outgoing connections they do not understand each other well.
- The fix is on the dcache movers config: pool.mover.xrootd.tpc-authn-plugins=gsi,unix
- At the time of the opening the TPC sockets, EOS FTSs only accepts UNIX connections and if dCache node accepts only accept gsi outgoing connections they do not understand each other well.
- GSI TPC: the FQDN of the client that is seen at the destination is different from the FQDN that the client see when connecting to the source: tpc origin mismatch
- xroot TPC when EOS is the source and dCache is the destination
- Few remaining issues (being followed-up by the DepOps team)
-
11:15
→
11:20
Pilot data lake assessment: Experiment data injection update 5mOrateurs: Andrea Ceccanti (INFN), Riccardo Di Maria (CERN), Yan Grange (ASTRON, the Netherlands Institute for Radio Astronomy)
-
11:20
→
11:25
Pilot data lake assessment: QoS/data lifecycles update 5mOrateurs: M. Muhammad Aleem Sarwar (ESCAPE Project), Paul Millar (DESY)
-
11:25
→
11:30
Pilot data lake assessment: datalake automated tests and monitoring 5mOrateurs: Rizart Dona (CERN), Rosie Bolton (Square Kilometre Array Organisation)
- Dashboards development is ongoing.
- Gfal tests are running and are successful for all sites at the moment (https://monit-grafana.cern.ch/d/TMScKNjWk/gfal-testing?orgId=51).
- FTS tests are running and are ~90% successful at the moment (https://monit-grafana.cern.ch/d/000000420/fts-transfers?orgId=51), details of the failures are being followed up on the DepOps meeting as well as in the JIRA tickets.
- Rucio tests are running.
- Rucio hermes2 deployment effort is ongoing, hermes1 is not running at the moment thus you cannot see any Rucio events at the dashboard, by the end of the week this will be restored.
- Continuous development of testing code, token based authZ integration is yet to be implemented among other things (proper error handling/metadata support for FTS transfers, etc.).
- We need to start coordinating about how to perform the periodical Datalake debugging process, that is, inspect monitoring, identify the current issues of sites/endpoints, take action to solve them.
-
11:35
→
11:55
AOB/Shadow round table 20m
Please chime in in case you have something to report:
- Sites: CERN, INFN, DESY, GSI, INFN, Nikhef, RUG, SURFSara, CC-IN2P3, IFAE-PIC, LAPP, INAF, Aarnet
- Experiments: HL-LHC, FAIR, KM3Net, SKA, CTA
-
11:00
→
11:10