Skip to main content

Publications and Acknowledgements

Papers I co-authored and papers where I built the underlying production infrastructure. The two are intentionally separated — author lines and acknowledgements credit different kinds of work.

Acknowledgements
#

Papers where I am credited for infrastructure contributions rather than listed as an author.

Virchow2: Scaling Self-Supervised Mixed Magnification Models in Pathology
#

E. Zimmermann, S. Liu, et al. · arXiv preprint, 2024

Three vision transformer foundation models — Virchow2 (632M), Virchow2G (1.9B), and Virchow2G Mini (22M distilled) — trained on 3.1 million histopathology whole-slide images. State-of-the-art on 12 tile-level tasks. Later published in Nature Medicine.

Contribution: Designed and operated the GPU compute infrastructure and high-throughput storage environment used to train and validate all three models.

arXiv · PDF


PRISM: A Multi-Modal Generative Foundation Model for Slide-Level Histopathology
#

S. Liu, et al. · arXiv preprint, 2024

A multi-modal generative foundation model operating at the slide level for computational pathology, jointly modeling histology and clinical text.

Contribution: Built and maintained the HPC infrastructure supporting large-scale whole-slide image preprocessing, model training, and validation.

arXiv · PDF

Co-authored
#

Papers I co-authored from the Caltech CMS / Large Hadron Collider years.

SDN-NGenIA: A Software Defined Next Generation Integrated Architecture for HEP and Data Intensive Science
#

J. Balcas, et al. · Journal of Physics: Conference Series, Vol. 898 (CHEP 2016)

A software-defined next-generation integrated architecture supporting high-energy physics and data-intensive science. Presented at the 22nd International Conference on Computing in High Energy and Nuclear Physics (CHEP 2016), San Francisco.

PDF


HTTP as a Data Access Protocol: Trials with XrootD in CMS’s AAA Project
#

J. Balcas, et al. · Journal of Physics: Conference Series, Vol. 898 (CHEP 2016)

Evaluation of HTTP as a data access protocol for the CMS Any-data Anytime Anywhere (AAA) project, comparing performance and operational characteristics against XrootD’s native protocol.

PDF


High Speed Scientific Data Transfers Using Software Defined Networking
#

H. Newman, et al. · INDIS ‘15 (SC15), Austin, Texas

Second Workshop on Innovating the Network for Data-Intensive Science, co-located with SC15: The International Conference for High Performance Computing, Networking, Storage and Analysis.

PDF