cv

Basics

Name Zach Nussbaum
Label Machine Learning Engineer
Email zanussbaum@gmail.com
Url https://www.linkedin.com/in/zach-nussbaum/
Summary Experienced Machine Learning Engineer with a focus on natural language processing, computer vision, and large-scale model deployment.

Work

  • 2023.03 - Present

    New York, NY

    Principal Machine Learning Engineer
    Nomic AI
    Leading the Embedding Team and implementing state-of-the-art embedding models.
    • Implemented Nomic Embed, powering Atlas embedding space visualization for text and multimodal applications
    • Trained and deployed Nomic Embed Text, outperforming OpenAI with 15M+ monthly downloads
    • Trained Nomic Embed Vision, the first multimodal embedding space performing highly on both multimodal and text embedding benchmarks
    • Deployed Nomic Embed models serving as backbone to Atlas, handling billions of tokens monthly
  • 2022.01 - 2023.03

    Remote

    Machine Learning Engineer
    Deep Genomics
    Focused on genomics and RNA research using machine learning techniques.
    • Reimplemented and improved DeepMind gene expression model in Tensorflow by 15-30% on TPUs
    • Led research on representation learning for chemical modifications of RNA molecules using Transformer based architectures
    • Helped standardize and run highly distributed training of attention-based models across many GPUs and TPUs
    • Reduced latency of large Deep Learning models by ~16x utilizing model distillation and TensorRT
    • Ported Maximal Update Parameterization to Tensorflow for zero-shot hyperparameter transfer across model width and depth
  • 2020.09 - 2021.12

    Seattle, Washington

    Machine Learning Engineer
    Amazon
    Worked on Alexa Speech Speaker ID model and optimized AWS infrastructure.
    • Maintained and improved evaluation pipelines for Alexa Speech Speaker ID model for 25M+ global consumers
    • Improved runtime of large PySpark processing jobs by 25% and reduced job failures by 50%
    • Drove redesign and engineering for Alexa SpeakerID weekly evaluation pipeline, reducing manual work by 75%
    • Optimized AWS EMR clusters for ML Ad Ranking Team, reducing operational costs by 50% leveraging AWS Lambda and AWS Event Bridge

Education