Papers I co-authored and papers where I built the underlying production infrastructure. The two are intentionally separated — author lines and acknowledgements credit different kinds of work.
Acknowledgements#
Papers where I am credited for infrastructure contributions rather than listed as an author.
Virchow2: Scaling Self-Supervised Mixed Magnification Models in Pathology#
E. Zimmermann, S. Liu, et al. · arXiv preprint, 2024
Three vision transformer foundation models — Virchow2 (632M), Virchow2G (1.9B), and Virchow2G Mini (22M distilled) — trained on 3.1 million histopathology whole-slide images. State-of-the-art on 12 tile-level tasks. Later published in Nature Medicine.
Contribution: Designed and operated the GPU compute infrastructure and high-throughput storage environment used to train and validate all three models.
PRISM: A Multi-Modal Generative Foundation Model for Slide-Level Histopathology#
S. Liu, et al. · arXiv preprint, 2024
A multi-modal generative foundation model operating at the slide level for computational pathology, jointly modeling histology and clinical text.
Contribution: Built and maintained the HPC infrastructure supporting large-scale whole-slide image preprocessing, model training, and validation.
Co-authored#
Papers I co-authored from the Caltech CMS / Large Hadron Collider years.
SDN-NGenIA: A Software Defined Next Generation Integrated Architecture for HEP and Data Intensive Science#
J. Balcas, et al. · Journal of Physics: Conference Series, Vol. 898 (CHEP 2016)
A software-defined next-generation integrated architecture supporting high-energy physics and data-intensive science. Presented at the 22nd International Conference on Computing in High Energy and Nuclear Physics (CHEP 2016), San Francisco.
HTTP as a Data Access Protocol: Trials with XrootD in CMS’s AAA Project#
J. Balcas, et al. · Journal of Physics: Conference Series, Vol. 898 (CHEP 2016)
Evaluation of HTTP as a data access protocol for the CMS Any-data Anytime Anywhere (AAA) project, comparing performance and operational characteristics against XrootD’s native protocol.
High Speed Scientific Data Transfers Using Software Defined Networking#
H. Newman, et al. · INDIS ‘15 (SC15), Austin, Texas
Second Workshop on Innovating the Network for Data-Intensive Science, co-located with SC15: The International Conference for High Performance Computing, Networking, Storage and Analysis.