publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. ACL-2025
    tic-lm-main.png
    TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining
    Jeffrey Li*, Mohammadreza Armandpour*, Iman Mirzadeh, and 8 more authors
    ACL Main (Oral), 2025
  2. ICLR-2025
    scaling-fig-1.png
    Language models scale reliably with over-training and on downstream tasks
    Samir Yitzhak Gadre, Georgios Smyrnis, Vaishaal Shankar, and 22 more authors
    ICLR, 2025

2024

  1. NeurIPS-2024
    dclm-fig-1.png
    DataComp-LM: In search of the next generation of training sets for language models
    Jeffrey Li, Alex Fang, Georgios Smyrnis, and 56 more authors
    NeurIPS Datasets and Benchmarks, 2024
  2. NeurIPS-2024
    styt.png
    Stronger Than You Think: Benchmarking Weak Supervision on Realistic Tasks
    Tianyi Zhang*, Linrong Cai*, Jeffrey Li, and 4 more authors
    NeurIPS Datasets and Benchmarks, 2024
  3. EMNLP-Find.-2024
    back-and-forth-fig-1.png
    Better Alignment with Instruction Back-and-Forth Translation
    Thao Nguyen, Jeffrey Li, Sewoong Oh, and 4 more authors
    EMNLP Findings, 2024

2023

  1. NeurIPS-2023
    ssl4ws.png
    Characterizing the Impacts of Semi-supervised Learning for Weak Supervision
    Jeffrey Li, Jieyu Zhang, Ludwig Schmidt, and 1 more author
    NeurIPS, 2023

2022

  1. ACM-Comm-2022
    acm-fig-1.png
    Interpretable machine learning: Moving from mythos to diagnostics
    Valerie Chen*, Jeffrey Li*, Joon Sik Kim, and 2 more authors
    Communications of the ACM, 2022

2021

  1. ICLR-2021
    interpretability_theory.png
    A Learning Theoretic Perspective on Local Explainability
    Jeffrey Li*, Vaishnavh Nagarajan*, Gregory Plumb, and 1 more author
    ICLR, 2021

2020

  1. ICLR-2020
    dpml.png
    Differentially Private Meta-Learning
    Jeffrey Li, Mikhail Khodak, Sebastian Caldas, and 1 more author
    ICLR, 2020