Portrait image
Portrait image

Hi! 👋

I'm Alexander Wan, a third-year undergraduate at UC Berkeley, majoring in Computer Science, Statistics, and Mathematics. I'm broadly interested in Machine Learning and NLP, particularly in improving the robustness and interpretability of large language models. I work closely with folks at the Berkeley NLP Group and the MSU Heterogeneous Learning and Reasoning lab.

See my: LinkedIn / Github / Google Scholar / Twitter


News

Nov 2023

I gave a talk at USC ISI's Natural Language seminar on the manipulation of LLMs through data.

Apr 2023

Our paper on poisoning instruction-tuned models was accepted to ICML.


Publications

What Evidence Do Language Models Find Convincing?

Alexander Wan, Eric Wallace, Dan Klein

Preprint 2024

arXiv

Poisoning Language Models During Instruction Tuning

Alexander Wan*, Eric Wallace*, Sheng Shen, Dan Klein

ICML 2023

arXiv

GLUECons: A Generic Benchmark for Learning Under Constraints

Hossein Rajaby Faghihi, Aliakbar Nafar, Chen Zheng, Roshanak Mirzaee, Yue Zhang, Andrzej Uszok, Alexander Wan, Tanawan Premsri, Dan Roth, Parisa Kordjamshidi

AAAI 2023

arXiv


Miscellaneous


Contact

Email: first 4 letters of first name + last name [at] berkeley [dot] edu