Publications


Selected titles are prefixed with a caret (^). This list also appears in my curriculum vitae.

Peer-Reviewed Conference Proceedings

Identifying the provision of choices in privacy policy text. Kanthashree Sathyendra, Shomir Wilson, Florian Schaub, Norman Sadeh, and Sebastian Zimmeck. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, Copenhagen, Denmark, 2017.

^ Automated analysis of privacy requirements for mobile apps. Sebastian Zimmeck, Ziqi Wang, Lieyong Zou, Roger Iyengar, Bin Liu, Florian Schaub, Shomir Wilson, Norman Sadeh, Steven M. Bellovin, and Joel Reidenberg. In Proceedings of the Network and Distributed System Security Symposium, San Diego, California, March 2017.

^ The creation and analysis of a website privacy policy corpus. Shomir Wilson, Florian Schaub, Aswarth Abhilash Dara, Frederick Liu, Sushain Cherivirala, Pedro Giovanni Leon, Mads Schaarup Andersen, Sebastian Zimmeck, Kanthashree Mysore Sathyendra, N. Cameron Russell, Thomas B. Norton, Eduard Hovy, Joel Reidenberg, and Norman Sadeh. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, Berlin, Germany, August 2016.

^ Crowdsourcing annotations of websites' privacy policies: Can it really work? Shomir Wilson, Florian Schaub, Rohan Ramanath, Norman Sadeh, Fei Liu, Noah Smith and Frederick Liu. In Proceedings of the 25th International World Wide Web Conference, Montréal, Canada, April 2016. Best Paper Finalist.

This table is different: A WordNet-based approach to identifying references to document entities. Shomir Wilson, Alan W Black, and Jon Oberlander. In Proceedings of The 8th International Global WordNet Conference, Bucharest, Romania, January 2016. [data]

Identifying relevant text fragments to help crowdsource privacy policy annotations. Rohan Ramanath, Florian Schaub, Shomir Wilson, Fei Liu, Norman Sadeh, and Noah Smith. In Proceedings of the Second AAAI Conference on Human Computation and Crowdsourcing, works-in-progress track, Pittsburgh, PA, November 2014.

Determiner-established deixis to communicative artifacts in pedagogical text. Shomir Wilson and Jon Oberlander. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, Baltimore, USA, June 23-25, 2014. [data]

Toward automatic processing of English metalanguage. Shomir Wilson. In Proceedings of the 6th International Joint Conference on Natural Language Processing (IJCNLP), Nagoya, Japan, October 14-18, 2013.

^ Privacy manipulation and acclimation in a location sharing application. Shomir Wilson, Justin Cranshaw, Norman Sadeh, Alessandro Acquisti, Lorrie Cranor, Jay Springfield, Sae Young Jeong, and Arun Balasubramanian. In Proceedings of the ACM International Joint Conference on Pervasive and Ubiquitous Computing (Ubicomp), Zurich, Switzerland, September 8-12, 2013.

^ Tweets are forever: A large-scale quantitative analysis of deleted tweets. Hazim Almuhimedi, Shomir Wilson, Bin Liu, Norman Sadeh, and Alessandro Acquisti. In Proceedings of the 2013 ACM Conference on Computer Supported Cooperative Work, San Antonio, TX, February 23-27, 2013.

^ The creation of a corpus of English metalanguage. Shomir Wilson. Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, South Korea, July 9-11, 2012. [slides] [corpus]

Application of MCL in a dialog agent. Darsana Josyula, Scott Fults, Michael L. Anderson, Shomir Wilson, and Don Perlis. In Papers from the Third Language and Technology Conference, 2007.

Toward domain-neutral human-level metacognition. Michael L. Anderson, Matt Schmill, Tim Oates, Don Perlis, Darsana Josyula, Dean Wright, and Shomir Wilson. In Papers from the 2007 AAAI Spring Symposium on Logical Formalizations of Commonsense Reasoning, 2007.

Peer-Reviewed Symposium Proceedings

Analyzing vocabulary intersections of expert annotations and topic models for data practices in privacy policies. Frederick Liu, Shomir Wilson, Florian Schaub and Norman Sadeh. In Proceedings of the AAAI Fall Symposium on Privacy and Language Technologies, Arlington, VA, November 2016.

Automatic extraction of opt-out choices from privacy policies. Kanthashree Mysore Sathyendra, Florian Schaub, Shomir Wilson and Norman Sadeh. In Proceedings of the AAAI Fall Symposium on Privacy and Language Technologies, Arlington, VA, November 2016.

Analyzing and predicting privacy law compliance of mobile apps. Sebastian Zimmeck, Ziqi Wang, Lieyong Zou, Roger Iyengar, Bin Liu, Florian Schaub, Shomir Wilson, Norman Sadeh, Steven M. Bellovin and Joel Reidenberg. In Proceedings of the AAAI Fall Symposium on Privacy and Language Technologies, Arlington, VA, November 2016.

The Metacognitive Loop: An architecture for building robust intelligent systems. Hamid Haidarian, Wikum Dinalankara, Scott Fults, Shomir Wilson, Don Perlis, Matt Schmill, Tim Oates, Darsana Josyula, and Michael Anderson. In Proceedings of the AAAI Fall Symposium on Commonsense Knowledge (AAAI/CSK'10), Arlington, VA, USA, November 11-13, 2010.

Peer-Reviewed Journal Articles

^ (Accepted) PrivOnto: A semantic framework for the analysis of privacy policies. Alessandro Oltramari, Dhivya Piraviperumal, Florian Schaub, Shomir Wilson, Sushain Cherivirala, Thomas B. Norton, N. Cameron Russell, Peter Story, Joel Reidenberg, and Norman Sadeh. To appear in Semantic Web Journal.

^ Nudges for privacy and security: Understanding and assisting users' choices online. Alessandro Acquisti, Idris Adjerid, Rebecca Balebako, Laura Brandimarte, Lorrie Faith Cranor, Saranga Komanduri, Pedro Giovanni Leon, Norman Sadeh, Florian Schaub, Manya Sleeper, Yang Wang, and Shomir Wilson. ACM Computing surveys 50(3), August 2017.

In search of the use-mention distinction and its impact on language processing tasks. Shomir Wilson. The International Journal of Computational Linguistics and Applications 2(1-2), pp 139-154, 2011.

Book Chapters

(Accepted) A bridge from the use-mention distinction to natural language processing. Shomir Wilson. To appear in Saka, P., Johnson, M. (Ed.), The Semantics and Pragmatics of Quotation. Springer, 2017.

The metacognitive loop and reasoning about anomalies. Matthew Schmill, Michael L. Anderson, Scott Fults, Darsana Josyula, Tim Oates, Donald Perlis, Hamid Haidarian Shahri, Shomir Wilson, and Dean Wright. In Cox, M., Raja, A. (Ed.), Metareasoning: Thinking about Thinking. MIT Press, MA, USA, 2010.

Magazine Article

A self-help guide for autonomous systems. Michael L. Anderson, Scott Fults, Darsana P. Josyula, Tim Oates, Don Perlis, Matthew D. Schmill, Shomir Wilson, and Dean Wright. AI Magazine, Summer 2008.

Peer-Reviewed Conference Poster Abstracts

Increasing the salience of data use opt-outs online. Namita Nisal, Sushain K. Cherivirala, Kanthashree M. Sathyendra, Margaret Hagan, Florian Schaub, Shomir Wilson, Lorrie Faith Cranor, and Norman Sadeh. In Proceedings of the Thirteenth Symposium on Usable Privacy and Security, Santa Clara, CA, June 2017.

Mobile app privacy compliance: Automated technology to help regulators, app stores and developers. Sebastian Zimmeck, Lieyong Zou, Bin Liu, Shomir Wilson, Steven M. Bellovin, Ziqi Wang, Roger Iyengar, Florian Schaub, Norman Sadeh, and Joel Reidenberg. In Proceedings of the Thirteenth Symposium on Usable Privacy and Security, Santa Clara, CA, June 2017.

Visualization and interactive exploration of data practices in privacy policies. Sushain K. Cherivirala, Florian Schaub, Mads Schaarup Andersen, Shomir Wilson, Norman Sadeh, and Joel R. Reidenberg. In Proceedings of the Twelfth Symposium on Usable Privacy and Security, Denver, CO, June 2016.

Towards usable privacy policies: Semi-automatically extracting data practices from websites' privacy policies. Norman Sadeh, Alessandro Acquisti, Travis Breaux, Lorrie Cranor, Aleecia McDonald, Joel Reidenberg, Noah Smith, Fei Liu, N. Cameron Russell, Florian Schaub, Shomir Wilson, James Graves, Pedro Leon, Rohan Ramanath, and Ashwini Rao. In Proceedings of the Tenth Symposium on Usable Privacy and Security, Palo Alto, CA, July 2014.

Peer-Reviewed Workshop Proceedings

Demystifying privacy policies with language technologies: Progress and challenges. Shomir Wilson, Florian Schaub, Aswarth Dara, Sushain K. Cherivirala, Sebastian Zimmeck, Mads Schaarup Andersen, Pedro Giovanni Leon, Eduard Hovy, and Norman Sadeh. In Proceedings of the Workshop on Text Analytics for Cybersecurity and Online Safety (TA-COS) at LREC, Portoro, Solvenia, May 2016.

Distinguishing use and mention in natural language. Shomir Wilson. In Proceedings of the NAACL HLT Student Research Workshop, 29-33. Los Angeles, CA: Association for Computational Linguistics. 2010.

The role of metacognition in robust AI systems. Matt Schmill, Tim Oates, Michael L. Anderson, Darsana Josyula, Don Perlis, Shomir Wilson, and Scott Fults. In Papers from the Workshop on Metareasoning at the Twenty-Third AAAI Conference on Artificial Intelligence, 2008.

Ontologies for reasoning about failures in AI systems. Michael L. Anderson, Scott Fults, Darsana Josyula, Tim Oates, Don Perlis, Matt Schmill, and Shomir Wilson. Proceedings of the First International Workshop on Metareasoning in Agent-Based Systems, Hawaii, 2007.

Dissertation

A Computational Theory of the Use-Mention Distinction in Natural Language. Shomir Wilson. University of Maryland, 2011.

Technical Reports

The Usable Privacy Policy Project: Combining crowdsourcing, machine learning and natural language processing to semi-automatically answer those privacy questions users care about. Norman Sadeh, Alessandro Acquisti, Travis Breaux, Lorrie Cranor, Aleecia McDonald, Joel Reidenberg, Noah Smith, Fei Liu, N. Cameron Russell, Florian Schaub, and Shomir Wilson. Technical Report CMU-ISR-13-119, Carnegie Mellon University, 2013.

Automatic categorization of privacy policies: A pilot study. Waleed Ammar, Shomir Wilson, Norman Sadeh, and Noah A. Smith. Technical Report CMU-LTI-12-019 / CMU-ISR-12-114, Carnegie Mellon University, 2012.