I (he/him) am an Assistant Professor in the College of Information Sciences and Technology at Penn State, where I lead the Human Language Technologies Lab. My research spans natural language processing, privacy, security, and computational social science.
You can reach me at shomir _at_ psu.edu. If you are on Penn State's University Park campus, you can stop by my office at E310 in the Westgate Building.
Students taking my classes may benefit from browsing my Guide for Interacting With Faculty before contacting me.
Students interested in joining my lab: I sometimes have openings for PhD students, MS students, and undergraduates to work on projects related to natural language processing or privacy. If you're interested in working with me, first consult my Guide for Joining My Lab and then email me with "read your recruiting note" as the subject line. Include a CV and an explanation of your specific interests in my research.
2022-10-17: Here's an article in Penn State News about my PhD students' work identifying biases in language models against terms that describe people with disabilities.
2022-10-05: I've been selected to join the Steering Committee for Penn State's Center for Socially Responsible Artificial Intelligence.
2022-08-14: I will participate in the seminar "Privacy in Speech and Language Technology" at Dagstuhl in late August.
2022-07-01: We've released the GPI ("Government Privacy Instructions") Corpus, a collection of 1,043 privacy laws, regulations, and guidelines from 182 jurisdictions around the world. Read our ArXiv paper about it and download it here.
2022-06-02: I led (with Athina Markopoulou) a breakout session titled "Privacy, Policy, and People" at the NSF SaTC PI Meeting.
2022-04-22: Thanks to Sepideh Ghanavati for virtually hosting me for a talk with her lab at the University of Maine.
2022-02-04: I've been elected to Penn State's University Faculty Senate for a four-year term, starting in Fall 2022 and ending in Spring 2026.
2022-01-15: Our paper "Automated Detection of Doxing on Twitter" has been accepted to CSCW 2022. Here's a preprint on arXiv.
For older news, check the archive.