I'm Alexander Wan, a third-year undergraduate at UC Berkeley, majoring in Computer Science, Statistics, and Mathematics. I'm broadly interested in Machine Learning and NLP, particularly in improving the robustness and interpretability of large language models. I work closely with folks at the Berkeley NLP Group and the MSU Heterogeneous Learning and Reasoning lab.
See my: LinkedIn / Github / Google Scholar / Twitter
Nov 2023
I gave a talk at USC ISI's Natural Language seminar on the manipulation of LLMs through data.
Apr 2023
Our paper on poisoning instruction-tuned models was accepted to ICML.
What Evidence Do Language Models Find Convincing?
Alexander Wan, Eric Wallace, Dan Klein
Preprint 2024
Poisoning Language Models During Instruction Tuning
Alexander Wan*, Eric Wallace*, Sheng Shen, Dan Klein
ICML 2023
GLUECons: A Generic Benchmark for Learning Under Constraints
Hossein Rajaby Faghihi, Aliakbar Nafar, Chen Zheng, Roshanak Mirzaee, Yue Zhang, Andrzej Uszok, Alexander Wan, Tanawan Premsri, Dan Roth, Parisa Kordjamshidi
AAAI 2023
I was an instructor at InspiritAI, where I introduced AI concepts and Scratch programming to 5th/6th graders.
I am occasionally active on the Artificial Intelligence StackExchange, answering questions about AI.
Email: first 4 letters of first name + last name [at] berkeley [dot] edu