7th Parallel Data Storage Workshopheld in conjunction with
|
pdsw 2012 posters
Discovering Structure in Unstructured I/O
Jun He1,2, John Bent3, Aaron Torres4, Gary Grider4, Garth Gibson5, Carlos Maltzahn6,
Xian-He Sun1
1Illinois Institute of Technology; 2New Mexico Consortium; 3EMC; 4Los Alamos
National Laboratory;
5Carnegie Mellon University; 6University of California Santa Cruz
A Multi-tiered Dataflow and Storage System
Chen Wu1, Andreas Wicenec1, Dave Pallot2, and Alessio Checcucci1
1ICRAR/University of Western Australia;
2ICRAR/Curtin University
Compressing Intermediate Keys between Mappers and Reducers in SciHadoop
Adam Crume, Joe Buck, Carlos Maltzahn, Scott Brandt
University of California, Santa Cruz
Parallel I/O Framework for Data-Intensive Parallel Applications
Rengan Xu1, Mauricio Araya-Polo2, Barbara Chapman1
1University of Houston, USA; 2Repsol, USA.
Hadoop's Adolescence:
A Comparative Workloads Analysis from Three Research Clusters
Data Collection
Kai Ren1, Garth Gibson1, YongChul Kwon2, Magdalena Balazinska2, Bill Howe2
1Carnegie Mellon University; 2UW Seattle
A Case for Scaling HPC Metadata Performance through Deāspecialization
Swapnil Patil, Kai Ren, Kartik Kulkarni, Garth Gibson
Carnegie Mellon University
DataMods: Generalizing File System Services
Noah Watkins and Carlos Maltzahn
Systems Research Lab | UC Santa Cruz