| CARVIEW |
|
Brendan T. O'ConnorAssociate Professor, College of Information and Computer SciencesAssociate Director, Computational Social Science Institute University of Massachusetts Amherst brenocon@cs.umass.edu, @brendan642, @brenocon.bsky.social Office: CSB W238, 140 Governors Drive, Amherst, MA 01003 |
I am an associate professor in the College of Information and Computer Sciences at UMass Amherst. My group is the SLANG Lab, part of UMass NLP. I am also an associate director of the Computational Social Science Institute, and affiliated with the UMass CogSci and the Centers for Data Science and Intelligent Information Retrieval.
Links: CV, Bio, Teaching, Talks, Notes, Misc.
Some current things:
- For Fall 2025, I'm teaching CS 685, Advanced Natural Language Processing. (See also my previous teaching.)
Some current/recent collaborative projects:
- Co-Insights: Fostering community collaboration to combat misinformation. We are part of the UMass team, in a multi-site project with several other institutions.
- Understanding variation in African American Language: Corpus and prosodic fieldwork perspectives with Kristine Yu, Lisa Green, and Meghan Armstrong-Abrami; see also our earlier work in disparities in natural language processing.
- Analyzing Cross-country Bias in News Coverage of International Conflicts and Disasters, 2023 Interdisciplinary Research Grant Project, with Przemyslaw Grabowicz, Ethan Zuckerman, and Paul Musgrave.
- SaTC: Identifying the Demographic Representativeness of Social Media Polls, with Przemyslaw Grabowicz.
- Leveraging Large Language Models to Provide Clinically Feasible Tools for Assessing Discourse in Individuals with Communication Impairments, 2024 Interdisciplinary Research Grant Project, with Jacquie Kurland and Anna Liu.
Research:
What can statistical text analysis tell us about society?
I develop
text analysis
methods
that can help answer social science
questions.
I'm interested in
statistical machine learning and
natural language processing,
especially when informed by or applied to areas like
political science or sociolinguistics.
My work often uses text data from news and social media.
There is a rich set of other faculty at UMass interested in areas from computational social science to natural language processing. See the Computational Social Science Institute (CSSI) website, and UMass NLP affiliates.
Background:
I joined UMass after receiving my PhD
from
Carnegie Mellon University's
Machine Learning Department.
I have also been a
Visiting Fellow at Harvard IQSS,
and
interned with the Facebook Data Science team.
Before grad school,
I worked on crowdsourced annotations at CrowdFlower / Dolores Labs,
and
natural language search at Powerset.
I started studying the intersection of AI and social science
as an undergrad/masters student in
Stanford
Symbolic Systems (cognitive science, more or less).
Link: Full bio.
Papers/Publications
(For others, see Google Scholar or my CV.)-
Proceedings of ACL 2025.
- Data (github).
- Earlier version presented at ACM Symposium on CS&Law 2025 (slides).
-
Findings of ACL 2025.
- Data/code (github).
- Earlier version presented at Clinical Aphasiology Conference 2025.
-
Jacquie Kurland, Vishnupriya Varadharaju, Anna Liu, Polly Stokes, Ankita Gupta, Marisa Hudspeth, and Brendan O'Connor.American Journal of Speech-Language Pathology. Mar. 2025.
- arXiv:2505.10798, May 2025.
-
Erica Cai, Xi Chen, Reagan Grey Keeney, Ethan Zuckerman, Brendan O'Connor, and Przemyslaw A. Grabowicz.In ICWSM 2025. (AAAI Conference on Web and Social Media)
- Also presented at IC2S2 2025.
- In WebSci 2025.
-
Journal of Law and Courts, Mar. 2025.
- Earlier version: SocArXiv preprint, Feb. 2023.
- Earlier version presented at New Directions in Analyzing Text as Data (TADA 2022), titled Quantifying the Causal Effect of Gender on Interruptions in Supreme Court Oral Arguments.
- Github repository
- Press coverage: Axios, Balls and Strikes, Strict Scrutiny
-
Maha Alkhairy, Vincent Homer, and Brendan O'Connor.arXiv:2502.08415, Feb 2025.
-
Proceedings of ACL 2024.
- Also plenary lightning talk (slides) at IC2S2 2024, for extended abstract titled "Making sense of public participation in rulemaking using argument explication" (Gupta, Zuckerman, and O'Connor).
- Findings of ACL 2024.
- First Workshop on Machine Learning for Ancient Languages (ML4AL), 2024.
-
NLP+CSS Workshop at NAACL 2024.
- Also panel talk at IC2S2 2024 for extended abstract.
- Proceedings of ACL 2023.
-
arXiv:2305.15051.
NeurIPS 2023 Workshop on Instruction Tuning and Instruction Following.
- Also presented at New England Natural Language Processing, 2023.
-
Ankita Gupta, Marzena Karpinska, Wenlong Zhao, Kalpesh Krishna, Jack Merullo, Luke Yeh, Mohit Iyyer, and Brendan O'Connor.Findings of ACL: EACL 2023. Also earlier arxiv version (2022)
- NLP+CSS workshop at EMNLP 2022 (Proceedings of the Fifth Workshop on Natural Language Processing and Computational Social Science)
- Abstract presented at New Ways of Analyzing Variation (NWAV50), 2022.
-
Proceedings of the Workshop on Noisy User-generated Text (W-NUT) at COLING 2022.
- Link: Code repository, including paper erratum.
-
Corpus-Guided Contrast Sets for Morphosyntactic Feature Detection in Low-Resource English Varieties.Proceedings of the 1st Field Matters Workshop on NLP Applications to Field Linguistics, at COLING 2022.
- Link: Code repository
- Proceedings of the 2022 ACM Conference on Computer Supported Cooperative Work (ACM CSCW 2022).
- ACM Transactions on Interactive Intelligent Systems, 2022.
-
First Workshop on Causal Inference & NLP (CI-NLP) at EMNLP 2021.
Abstract presented at 11th Annual Conference on New Directions in Analyzing Text as Data (TADA 2021). - Abstract presented at the UnImplicit workshop at ACL-IJCNLP 2021.
- Findings of ACL 2021. Also presented at the CASE workshop at ACL-IJCNLP 2021
-
Global Networks. 2021.
- Honorable Mention, Political Ties Award for Best Published Article in 2021, APSA Political Networks section.
- See Kevin and Tuugi's article about this work at The Conversation.
- Links: Journal, PDF
- NLP+CSS workshop at EMNLP 2020 (Proceedings of the Fourth Workshop on Natural Language Processing and Computational Social Science).
- NLP+CSS workshop at EMNLP 2020 (Proceedings of the Fourth Workshop on Natural Language Processing and Computational Social Science).
- Proceedings of ACL 2020.
-
Proceedings of EMNLP 2019.
- Data
- Press coverage: The Undefeated website, Twitter thread
-
Abram Handler and Brendan O'Connor.Proceedings of EMNLP 2019.
- arxiv preprint, 2019.
- Proceedings of EMNLP 2018.
- Proceedings of ACL 2018.
- Proceedings of NAACL 2018.
Relational Summarization for Corpus Analysis. Proceedings of NAACL 2018.- Proceedings of ICTIR 2018.
Johnny Tian-Zheng Wei, Khiem Pham, Brian Dillon, and Brendan O'Connor,BlackboxNLP workshop at EMNLP 2018 (Analyzing and interpreting neural networks for NLP).A Probabilistic Approach for Learning with Label Proportions Applied to the US Presidential Election.Tao Sun, Daniel Sheldon, and Brendan O'Connor.Proceedings of ICDM 2017.Katherine A. Keith, Abram Handler, Michael Pinkham, Cara Magliozzi, Joshua McDuffie, and Brendan O'Connor.Proceedings of EMNLP 2017.- Website
- Video presentation by Katherine A. Keith.
- ACL Anthology page
- Workshop on Data Science + Journalism at KDD 2017.
- Fairness, Accountability, and Transparency in Machine Learning (FAT/ML) workshop at KDD 2017.
Su Lin Blodgett, Johnny Tian-Zheng Wei, and Brendan O'Connor.3rd Workshop on Noisy User-generated Text (W-NUT) at EMNLP 2017.
Best paper award.- Proceedings of WWW 2017.
Bag of What? Simple Noun Phrase Extraction for Text Analysis. NLP+CSS Workshop at EMNLP 2016.- Proceedings of EMNLP 2016.
- Proceedings of CIKM 2016.
- At WHI 2016 - Workshop on Human Interpretability in Machine Learning (workshop at ICML 2016).
- At TPDP 2016 - Theory and Practice of Differential Privacy (workshop at ICML 2016).
- Proceedings of EMNLP 2015.
- PLOS-ONE, November 2014.
- Also arXiv:1210.5268; an earlier version was from Oct. 2012 and poster at NIPS 2012 Workshop on Social Network and Social Media Analysis.
- PhD Thesis, Carnegie Mellon University, 2014.
- ACL Workshop on Interactive Language Learning, Visualization, and Interfaces, June 2014. (Proceedings of ACL 2014.)
- In SemEval-2014 (Proceedings of the International (COLING) Workshop on Semantic Evaluations, Dublin, Ireland, August 2014).
- Proceedings of ACL 2013.
- Proceedings of ACL 2013.
- Proceedings of NAACL 2013
- arXiv:1310.1975, Oct 2013.
- arXiv:1307.7382, Data Analysis Project report, Machine Learning Department, CMU. July 2013.
Nathan Schneider, Brendan O’Connor, Naomi Saphra, David Bamman, Manaal Faruqui, Noah A. Smith, Chris Dyer, and Jason Baldridge.In Linguistic Annotation Workshop, 2013.- In First Monday 17.3, March 2012.
- [web] [pdf]
- Press coverage: BBC, New Scientist, etc.
- In NIPS Workshop on Comptuational Social Science and the Wisdom of Crowds, Sierra Nevada, Spain, December 2011.
Dani Yogatama, Michael Heilman, Brendan O'Connor, Chris Dyer, Bryan R. Routledge, and Noah A. Smith.In Proceedings of EMNLP 2011.Kevin Gimpel, Nathan Schneider, Brendan O'Connor, Dipanjan Das, Daniel Mills, Jacob Eisenstein>, Michael Heilman, Dani Yogatama, Jeffrey Flanigan and Noah A. Smith.In ACL-2011 (short paper).- In NIPS-2010 Workshop on Machine Learning and Social Computing.
- In Proceedings of EMNLP 2010 (presentation).
- Appendix
- Data
- Press coverage: New York Times, All Things Considered, BBC, Washington Post, Wall Street Journal, Associated Press, New Scientist, San Francisco Chronicle, Ars Technica, LA Weekly, MSNBC, etc.
- In ICWSM-2010 (presentation).
- Video, Slides
- Press coverage: Pittsburgh Tribune-Review, Mashable, Ars Technica, New Scientist, CNN Tech, Fast Company, Science Now, Economic Times, BBC Radio 5 (at 13:00) and others.
- In ICWSM-2010 (demo track).
Superficial Data Analysis: Exploring Millions of Social Stereotypes.In Beautiful Data, ed. Toby Segaran and Jeff Hammerbacher. O'Reilly Media. 2009.- In EMNLP-2008 (presentation).
@brenocon@masto.ai