abstract / cfp / submissions / WIP session / workshop registration / committees
PDSW25 Reproducability Addendum
SUBMISSION DEADLINE: Aug 1st, 2025, 11:59 PM AoE
agenda
Any additional genda information, slides and abstracts will be posted here as soon as it becomes available. You will also be able to view the official agenda on the SC workshop page for the latest information and abstracts for each of the talks at a future date.
WORKSHOP ABSTRACT
Efficient storage, movement, and management of data are crucial to application performance and scientific productivity in both traditional simulation-oriented HPC environments and Cloud, AI/ML/Big Data analysis environments. This issue is further exacerbated by the growing volume of experimental and observational data, the widening gap between the performance of computational hardware and storage hardware, and the emergence of new data-driven algorithms in machine learning. The goal of this workshop is to facilitate in-depth discussions of research and development that address the most critical challenges in large-scale data storage and data processing. PDSW will continue to build on the successful tradition established by its predecessor workshops: the Petascale Data Storage Workshop (PDSW, 2006-2015) and the Data Intensive Scalable Computing Systems (DISCS 2012-2015) workshop. These workshops were successfully combined in 2016, and the resulting joint workshop has attracted up to 45 full paper submissions and 195 attendees per year from 2016 to 2024.
Topics of Interest:
- Scalable Architectures: Distributed data storage, archival, and virtualization.
- New Data Processing Models and Algorithms: Application of innovative data processing models and algorithms for parallel computing and analysis.
- Performance Analysis: Benchmarking, resource management, and workload studies.
- Cloud and Container-Based Models: Enabling cloud and container-based frameworks for large-scale data analysis.
- Storage Technologies: Adaptation to emerging hardware and computing models.
- Data Integrity: Techniques to ensure data integrity, availability, reliability, and fault tolerance.
- Programming Models and Frameworks: Big data solutions for data-intensive computing.
- Hybrid Cloud Data Processing: Integration of hybrid cloud and on-premise data processing.
- Cloud-Specific Opportunities: Data storage and transit opportunities specific to cloud computing.
- Storage System Programmability: Enhancing programmability in storage systems.
- Data Reduction Techniques: Filtering, compression, and reduction techniques for large-scale data.
- File and Metadata Management: Parallel file systems, metadata management at scale.
- In-Situ and In-Transit Processing: Integrating computation into the memory and storage hierarchy for in-situ and in-transit data processing.
- Alternative Storage Models: Object stores, key-value stores, and other data storage models.
- Productivity Tools: Tools for data-intensive computing, data mining, and knowledge discovery.
- Data Movement: Managing data movement between compute and data-intensive components.
- Cross-Cloud Data Management: Efficient data management across different cloud environments.
- AI-enhanced Systems: Storage system optimization and data analytics using machine learning.
- New Memory and Storage Systems: Innovative techniques and performance evaluation for new memory and storage systems.
CALL FOR PAPERS
Call for papers now available [pdf].
Last update June 6, 2025.
Regular paper SUBMISSIONS
All submissions to the PDSW’25 will undergo a rigorous double-anonymous peer review process overseen by the workshop program committee. Successful submissions will be published in the SC25 Workshop Proceedings and featured on the workshop website alongside associated talk slides.
Template and Submission
- A full paper up to 6 pages in length, excluding references and AD/AE appendices.
- Artifact Description (AD) Appendix is mandatory and Artifact Evaluation (AE) Appendix is optional.
- AD due: Aug 8th, 2025, 11:59 PM AoE
- Submissions with AD and AE Appendix will be considered favorably for the PDSW Best Paper award.
- Papers must adhere to the IEEE proceedings template. Download it here.
- FINAL DEADLINE - Submit your papers by Aug 1st, 2025, 11:59 PM AoEat https://submissions.supercomputing.org/
Reproducibility Initiative
Aligned with the SC25 Reproducibility Initiative, we encourage detailed and structured artifact descriptions (AD) using the SC25 format. The AD should include a field for one or more links to data (zenodo, figshare, etc.) and code (Github, GitLab, Bitbucket, etc.) repositories. For the artifacts that will be placed in the code repository, we encourage authors to follow the PDSW 2025 Reproducibility Addendum on how to structure the artifact, as it will make it easier for the reviewing committee and readers of the paper in the future.
Deadlines - Regular Papers and Reproducibility Study Papers
Submissions website: https://submissions.supercomputing.org/
Submissions due: Aug 1st, 2025, 11:59 PM AoE
AD due: Aug 8th, 2025, 11:59 PM AoE
Paper Notification: Sep 5th, 2025
Camera ready due: Sep 27th, 2025, 11:59 PM AoE
Final AD/AE due: Oct 15, 2025, 11:59 PM AoE
Copyright info due: TBD
Slides due before workshop: TBD
Work In Progress (WIP) Session
The WIP session will showcase brief 5-minute presentations on ongoing work that may not yet be ready for a full paper submission. WIP papers will not be included in the proceedings. A one-page abstract is required for participation.
Submissions due: September 12, 2025 AoE
WIP Notification: September 20, 2025
Workshop Registration
Housing Opens June 3, 2025; Registration opens July 9, 2025: This page will allow you to prepare, find further details on registration pricing, and policies affecting registration changes and cancellations.
PDSW 25 Committee Members:
Technical Committee
- Moiz Arif, Micron Technology Inc.
- Oceane Bel, Pacific Northwest National Laboratory
- Francieli Boito, University of Bordeaux/Inria, France
- Jalil Boukhobza, University of Western Brittany, France
- Hariharan Devarajan, Lawrence Livermore National Laboratory
- Andreas Dilger, DDN / Whamcloud
- Qian Gong, Oak Ridge National Laboratory
- Velusamy Kaushik, Argonne National Laboratory
- Youngjae Kim, Sogang University, South Korea
- Johann Lombardi, HPE, France
- Qizhong Mao, Bytedance Inc., China
- Arnab K. Paul, BITS Pilani, K K Birla Goa Campus, India
- Joao Paulo, INESC TEC, Portugal
- M. Mustafa Rafique, Rochester Institute of Technology
- Woong Shin, Oak Ridge National Laboratory
- Masahiro Tanaka,Microsoft
- Osamu Tatebe, University of Tsukuba, Japan
- Lipeng Wan, Georgia State University
- Wei Zhang, Lawrence Berkeley National Laboratory
- Qing Zheng, Los Alamos National Laboratory
- Mai Zheng, Iowa State University
Steering Committee
- John Bent, Cray
- Ali R. Butt, Virginia Tech
- Suren Byna, The Ohio State University
- Philip Carns, Argonne National Laboratory
- Shane Canon, Lawrence Berkeley National Laboratory
- Raghunath Raja Chandrasekar, Amazon Web Services
- Yong Chen, Texas Tech University
- Evan J. Felix, Pacific Northwest National Laboratory
- Gary Grider, Los Alamos National Laboratory
- William D. Gropp, University of Illinois at Urbana-Champaign
- Dean Hildebrand, Google
- Shadi Ibrahim, Inria, France
- Dries Kimpe, KCG, USA
- Glenn Lockwood, Lawrence Berkeley National Laboratory
- Jay Lofstead, Sandia National Laboratories
- Xiaosong Ma, Qatar Computing Research Institute, Qatar
- Kathryn Mohror, Lawrence Livermore National Laboratory
- Robert Ross, Argonne National Laboratory
- Kento Sato, Riken, Japan
- John Shalf, Lawrence Berkeley National Laboratory
- Xian-He Sun, Illinois Institute of Technology
- Rajeev Thakur, Argonne National Laboratory
- Brent Welch, Google
- Bing Xie, Meta
- Amelie Chi Zhou, Hong Kong Baptist University, China