CV — Nanshu Wang

Staff Software Engineer at Meta (SuperIntelligence Lab) with 7+ years shipping AI products at scale. Deep expertise in LLM post-training: RLHF, curriculum reinforcement learning, SFT, and DPO across LLM capability on multimodal perception, instruction following, agentic systems with tool use and RAG. Led 50+ person teams across AI Glass (Ray-Ban Meta), Llama post-training, and Meta AI, with products reaching hundreds of millions of users.

Experience

Meta — Staff Software Engineer 2022 – present

SuperIntelligence Lab · Wearable AI

Tech lead for LiveAI on Ray-Ban Meta Glasses — real-time multimodal perception over ego-centric video and audio, with VLM post-training via RLHF, curriculum reinforcement learning, and proactive reasoning. Founding tech lead for the Meta AI platform: agents, plugins, tool use, and RAG powering Meta AI's 2023 debut and 2024 Llama 3 upgrade.

Meta — Senior Software Engineer 2021 – 2022

AI Assistant · Reality Labs

Tech lead for on-device text assist (Smart Compose, Quick Reply) on Quest/Oculus devices using on-device generative LMs, federated learning, and differential privacy.

Meta — Software Engineer 2018 – 2021

AI Assistant · Messenger

Built Meta's first LM product (Smart Compose) and first on-device compressed model for Messenger. Scaled Smart Compose to billions of MAU on Facebook News Feed.

Education

Carnegie Mellon University 2017 – 2018

M.S. in Software Engineering

Chinese Academy of Sciences 2014 – 2017

M.S. in Computer Science

Renmin University of China 2010 – 2014

B.S. in Human Resources Management

Fun

Been coding since high school — and won First Prize at the National Olympiad in Informatics (NOIP) twice.

Work / Media

Open Source

TorchGlow Meta

ML compiler and execution engine for hardware accelerators

Multi-layer LSTM support.

PyText Meta

Deep-learning NLP modeling framework built on PyTorch

ONNX support and contextualized model implementation.

DeepSpeech Mozilla

End-to-end speech recognition

Data refinement pipeline for loading speech corpus.

Publications & Patents

He, Y., Li, W., Zhang, H., Li, S., …, Wang, N., et al. Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM Instruction Following. ACL 2026.

Qin, C., Zhou, W., Sankararaman, K.A., Wang, N., et al. Learning Auxiliary Tasks Improves Reference-Free Hallucination Detection in Open-Domain Long-Form Generation. ACL 2025.

Sun, Y., Yin, X., Jiang, J., Sekar, V., Lin, F., Wang, N., et al. CS2P: Improving Video Bitrate Selection and Adaptation with Data-Driven Throughput Prediction. ACM SIGCOMM 2016.

Sun, Y., Jiang, J., Sekar, V., Zhang, H., Lin, F., & Wang, N. Using Video-Based Measurements to Generate a Real-Time Network Traffic Map. ACM HotNets 2014.

Gill, P., Liu, H., Yang, W., Malik, K., Wang, N., & Reiss, D. Methods, mediums, and systems for providing a model for an end-user device. U.S. Patent 11,501,081. 2022. Patent

Gill, P., Liu, H., Yang, W., Malik, K., Wang, N., & Reiss, D. Methods, mediums, and systems for training a model. U.S. Patent 11,455,555. 2022. Patent

Botros, F., Wang, N., Wang, F., et al. Voice-based Auto-Completions and Auto-Responses for Assistant Systems. U.S. Patent Application 17/120,013. 2022. Patent

Gill, P., Liu, H., Yang, W., Malik, K., Wang, N., & Reiss, D. Methods, mediums, and systems for representing a model in a memory of device. U.S. Patent 11,227,122. 2022. Patent