Staff Software Engineer at Meta (SuperIntelligence Lab) with 7+ years shipping AI products at scale. Deep expertise in LLM post-training: RLHF, curriculum reinforcement learning, SFT, and DPO across LLM capability on multimodal perception, instruction following, agentic systems with tool use and RAG. Led 50+ person teams across AI Glass (Ray-Ban Meta), Llama post-training, and Meta AI, with products reaching hundreds of millions of users.
Been coding since high school — and won First Prize at the National Olympiad in Informatics (NOIP) twice.
He, Y., Li, W., Zhang, H., Li, S., …, Wang, N., et al. Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM Instruction Following. ACL 2026.
Qin, C., Zhou, W., Sankararaman, K.A., Wang, N., et al. Learning Auxiliary Tasks Improves Reference-Free Hallucination Detection in Open-Domain Long-Form Generation. ACL 2025.
Sun, Y., Yin, X., Jiang, J., Sekar, V., Lin, F., Wang, N., et al. CS2P: Improving Video Bitrate Selection and Adaptation with Data-Driven Throughput Prediction. ACM SIGCOMM 2016.
Sun, Y., Jiang, J., Sekar, V., Zhang, H., Lin, F., & Wang, N. Using Video-Based Measurements to Generate a Real-Time Network Traffic Map. ACM HotNets 2014.
Gill, P., Liu, H., Yang, W., Malik, K., Wang, N., & Reiss, D. Methods, mediums, and systems for providing a model for an end-user device. U.S. Patent 11,501,081. 2022. Patent
Gill, P., Liu, H., Yang, W., Malik, K., Wang, N., & Reiss, D. Methods, mediums, and systems for training a model. U.S. Patent 11,455,555. 2022. Patent
Botros, F., Wang, N., Wang, F., et al. Voice-based Auto-Completions and Auto-Responses for Assistant Systems. U.S. Patent Application 17/120,013. 2022. Patent
Gill, P., Liu, H., Yang, W., Malik, K., Wang, N., & Reiss, D. Methods, mediums, and systems for representing a model in a memory of device. U.S. Patent 11,227,122. 2022. Patent