LSST school & workshop: Getting ready to do science with LSST data

Name: LSST school & workshop: Getting ready to do science with LSST data
Start: 2017-06-12T08:15:00+02:00
End: 2017-06-16T18:15:00+02:00
Location: IN2P3 computing center

12–16 juin 2017

IN2P3 computing center

Fuseau horaire Europe/Paris

Need help? Contact the organizers

Exploring Spark and MongoDb for LSST

15 juin 2017, 12:20

30m

Amphitheatre (IN2P3 computing center)

Amphitheatre

IN2P3 computing center

21 Avenue Pierre de Coubertin CS70202 69627 VILLEURBANNE Cedex FRANCE Lat: 45°46'57.8"N Lon: 4°51'54.9"E

Workshop

M. Christian Arnault (CNRS)

Spark is a very promising technology offering distributed data and computing mechanisms. At LAL(Orsay) we have started to look at how the typical computing workflows used in LSST could use the Spark eco-system: How to distribute algorithms in a map-reduce approach How to format various data structures to partition them in a distributed file system Thus, a OpenStack based cluster has been configured at LAL with Spark and its various associated components, and several models are experienced to evaluate the performance and configuration parameters (memory, CPU, …) In the same context, in the process in exploring various technologies related with QServ or the catalog access techniques, we are working on two promissing technologies: MongoDB and Spark DataFrames, both offering a natural data or processing distribution approach. The method is similar for both: we exploit one limited dataset (2To) (sources and objects) and try and apply the benchmarking queries that used to be applied to QServ. The concepts, the ingestion, and the querying methods are explored, in particular looking at possible functional or performance limitations for both systems. Several platforms are used for this study: The Galactica cluster at Clermont (Petasky context) OpenStack at LAL (VirtualData context) A test cluster CCIN2P3.

Topic:	Computing infrastructure and data management

Transparents

20170615_SparkMongoDb.pptx

Video

https://webcast.in2p3.fr/videos-exploring_spark_and_mongodb_for_lsst

LSST school & workshop: Getting ready to do science with LSST data

Need help? Contact the organizers

Exploring Spark and MongoDb for LSST

Amphitheatre

IN2P3 computing center

Orateur

Description

Documents de présentation

Choisissez le fuseau horaire

LSST school & workshop: Getting ready to do science with LSST data

Need help? Contact the organizers

Orateur

Description

Documents de présentation