publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

An up-to-date list is available on Google Scholar

2025

  1. cornstack.png
    CoRNStack: High-Quality Contrastive Data for Better Code Ranking
    Tarun Suresh, Revanth Gangi Reddy, Yifei Xu, and 4 more authors
    In International Conference on Learning Representations (ICLR), 2025

2024

  1. nomic-embed.jpeg
    Nomic embed: Training a reproducible long context text embedder
    Zach Nussbaum, John X Morris, Brandon Duderstadt, and 1 more author
    arXiv preprint arXiv:2402.01613, 2024
  2. DNA-Diffusion: Leveraging Generative Models for Controlling Chromatin Accessibility and Gene Expression via Synthetic Regulatory Elements
    Lucas Ferreira DaSilva, Simon Senan, Zain Munir Patel, and 8 more authors
    bioRxiv, 2024
  3. dna-diffusion.png
    DNA-Diffusion: Leveraging Generative Models for Controlling Chromatin Accessibility and Gene Expression via Synthetic Regulatory Elements
    Simon Senan, Aniketh Janardhan Reddy, Zach Nussbaum, and 5 more authors
    In ICLR 2024 Workshop on Machine Learning for Genomics Explorations, 2024
  4. Nomic Embed Vision: Expanding the Latent Space
    Zach Nussbaum, Brandon Duderstadt, and Andriy Mulyar
    arXiv preprint arXiv:2406.18587, 2024

2023

  1. gpt4all.jpeg
    Gpt4all: Training an assistant-style chatbot with large scale data distillation from gpt-3.5-turbo
    Yuvanesh Anand, Zach Nussbaum, Brandon Duderstadt, and 2 more authors
    GitHub (2023), 2023
  2. big-rna.png
    An RNA foundation model enables discovery of disease mechanisms and candidate therapeutics
    Albi Celaj, Alice Jiexin Gao, Tammy TY Lau, and 8 more authors
    bioRxiv, 2023
  3. GPT4All: An ecosystem of open source compressed language models
    Yuvanesh Anand, Zach Nussbaum, Adam Treat, and 6 more authors
    arXiv preprint arXiv:2311.04931, 2023

2021

  1. Machine learning methods for extracting structure functions from experimental data
    Andrew Hoyle, Michelle Kuchera, Pawel Ambrozewicz, and 7 more authors
    In APS April Meeting Abstracts, 2021
  2. two-tails.png
    A tale of two long tails
    Daniel D’souza, Zach Nussbaum, Chirag Agarwal, and 1 more author
    arXiv preprint arXiv:2107.13098, 2021

2020

  1. Using Adversarial Networks to Generate Realistic Structure Function Surfaces
    Andrew Hoyle, Michelle Kuchera, Raghu Ramanujan, and 4 more authors
    In APS Division of Nuclear Physics Meeting Abstracts, 2020