Kai Polsterer
(HITS gGmbH)
With the exponential increase in available data, new data analysis paradigms are required. The tradition single source science approaches used for nowadays data archives do not scale with recent data challenges. Machine learning is one of the key tools to provide an assistance in dealing with this data avalanche. This brings forth new topics to be dealt with:
- pre-processing and compressed representations of data
- from data formats to data interface
- metadata, provenance, versioning, snapshots and others