Author: Clegg, R.J.
Paper Title Page
MOMPR007 Scalable High Demand Analytics Environments with Heterogeneous Clouds 171
MOPHA161   use link to see paper's listing under its alternate paper code  
 
  • K. Woods, R.J. Clegg, R. Millward
    Tessella, Abingdon, United Kingdom
  • F. Barnsely, C. Jones
    STFC/RAL, Chilton, Didcot, Oxon, United Kingdom
 
  Funding: UK Research and Innovation - Science & Technology Facilities Council (UK SBS IT18160)
The Ada Lovelace Centre (ALC) at STFC provides on-demand, data analysis, interpretation and analytics services to scientists using UK research facilities. ALC and Tessella have built software systems to scale analysis environments to handle peaks and troughs in demand as well as to reduce latency by provision environments closer to scientists around the world. The systems can automatically provision infrastructure and supporting systems within compute resources around the world and in different cloud types (including commercial providers). The system then uses analytics to dynamically provision and configure virtual machines in various locations ahead of demand so that users experience as little delay as possible. In this poster, we report on the architecture and complex software engineering used to automatically scale analysis environments to heterogeneous clouds, make them secure and easy to use. We then discuss how analytics was used to create intelligent systems in order to allow a relatively small team to focus on innovation rather than operations.
 
poster icon Poster MOMPR007 [1.650 MB]  
DOI • reference for this paper ※ https://doi.org/10.18429/JACoW-ICALEPCS2019-MOMPR007  
About • paper received ※ 30 September 2019       paper accepted ※ 09 October 2019       issue date ※ 30 August 2020  
Export • reference for this paper using ※ BibTeX, ※ LaTeX, ※ Text/Word, ※ RIS, ※ EndNote (xml)  
 
MOPHA160 Enabling Data Analytics as a Service for Large Scale Facilities 614
 
  • K. Woods, R.J. Clegg, N.S. Cook, R. Millward
    Tessella, Abingdon, United Kingdom
  • F. Barnsely, C. Jones
    STFC/RAL, Chilton, Didcot, Oxon, United Kingdom
 
  Funding: UK Research and Innovation - Science & Technology Facilities Council (UK SBS IT18160)
The Ada Lovelace Centre (ALC) at STFC is an integrated, cross-disciplinary data intensive science centre, for better exploitation of research carried out at large scale UK Facilities including the Diamond Light Source, the ISIS Neutron and Muon Facility, the Central Laser Facility and the Culham Centre for Fusion Energy. ALC will provide on-demand, data analysis, interpretation and analytics services to worldwide users of these research facilities. Using open-source components, ALC and Tessella have together created a software infrastructure to support the delivery of that vision. The infrastructure comprises a Virtual Machine Manager, for managing pools of VMs across distributed compute clusters; components for automated provisioning of data analytics environments across heterogeneous clouds; a Data Movement System, to efficiently transfer large datasets; a Kubernetes cluster to manage on demand submission of Spark jobs. In this paper, we discuss the challenges of creating an infrastructure to meet the differing analytics needs of multiple facilities and report the architecture and design of the infrastructure that enables Data Analytics as a Service.
 
poster icon Poster MOPHA160 [1.665 MB]  
DOI • reference for this paper ※ https://doi.org/10.18429/JACoW-ICALEPCS2019-MOPHA160  
About • paper received ※ 30 September 2019       paper accepted ※ 10 October 2019       issue date ※ 30 August 2020  
Export • reference for this paper using ※ BibTeX, ※ LaTeX, ※ Text/Word, ※ RIS, ※ EndNote (xml)