publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

ACL-2025

TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining

Jeffrey Li^*, Mohammadreza Armandpour^*, Iman Mirzadeh, and 8 more authors

ACL Main (Oral), 2025
ICLR-2025

Language models scale reliably with over-training and on downstream tasks

Samir Yitzhak Gadre, Georgios Smyrnis, Vaishaal Shankar, and 22 more authors

ICLR, 2025

2024

NeurIPS-2024

DataComp-LM: In search of the next generation of training sets for language models

Jeffrey Li^*, Alex Fang^*, Georgios Smyrnis^*, and 56 more authors

NeurIPS Datasets and Benchmarks, 2024

Website
NeurIPS-2024

Stronger Than You Think: Benchmarking Weak Supervision on Realistic Tasks

Tianyi Zhang^*, Linrong Cai^*, Jeffrey Li, and 4 more authors

NeurIPS Datasets and Benchmarks, 2024
EMNLP-Find.-2024

Better Alignment with Instruction Back-and-Forth Translation

Thao Nguyen, Jeffrey Li, Sewoong Oh, and 4 more authors

EMNLP Findings, 2024

2023

NeurIPS-2023

Characterizing the Impacts of Semi-supervised Learning for Weak Supervision

Jeffrey Li, Jieyu Zhang, Ludwig Schmidt, and 1 more author

NeurIPS, 2023

2022

ACM-Comm-2022

Interpretable machine learning: Moving from mythos to diagnostics

Valerie Chen^*, Jeffrey Li^*, Joon Sik Kim, and 2 more authors

Communications of the ACM, 2022

2021

ICLR-2021

A Learning Theoretic Perspective on Local Explainability

Jeffrey Li^*, Vaishnavh Nagarajan^*, Gregory Plumb, and 1 more author

ICLR, 2021

2020

ICLR-2020

Differentially Private Meta-Learning

Jeffrey Li, Mikhail Khodak, Sebastian Caldas, and 1 more author

ICLR, 2020