References
Bécue-Bertaut, Monica. 2019. Textual Data Science with
R. CRC Press.
Benoit, Kenneth. 2020. “Text as Data: An Overview.” In
Handbook of Research Methods in Political Science and International
Relations, edited by Luigi Curini and Robert Franzese, 461–97.
Thousand Oaks: Sage.
Benoit, Kenneth, and Akitaka Matsuo. 2020. Spacyr: Wrapper to the
’spaCy’ ’NLP’ Library. https://CRAN.R-project.org/package=spacyr.
Benoit, Kenneth, Kevin Munger, and Arthur Spirling. 2019.
“Measuring and Explaining Political Sophistication Through Textual
Complexity.” American Journal of Political Science 63
(2): 491–508. https://doi.org/10.1111/ajps.12423.
Benoit, Kenneth, Kohei Watanabe, Haiyan Wang, Paul Nulty, Adam Obeng,
Stefan Müller, and Akitaka Matsuo. 2018. “Quanteda: An
R Package for the Quantitative Analysis of Textual
Data.” Journal of Open Source Software 3 (30): 774. https://doi.org/10.21105/joss.00774.
Boussalis, Constantine, Travis G. Coan, Mirya R. Holman, and Stefan
Müller. 2021. “Gender, Candidate Emotional Expression, and Voter
Reactions During Televised Debates.” American Political
Science Review 115 (4): 1242–57. https://doi.org/10.1017/S0003055421000666.
Castanho Silva, Bruno, and Sven-Oliver Proksch. 2021. “Politicians
Unleashed? Political Communication on Twitter and in Parliament in
Western Europe.” Political Science Research and Methods
published ahead of print (doi: 10.1017/psrm.2021.36). https://doi.org/10.1017/psrm.2021.36.
Chang, Jonathan. 2015. Lda: Collapsed Gibbs Sampling Methods for
Topic Models. https://CRAN.R-project.org/package=lda.
Crisp, Brian F., Benjamin Schneider, Amy Catalinac, and Taishi Muraoka.
2021. “Capturing Vote-Seeking Incentives and the Cultivation of a
Personal and Party Vote.” Electoral Studies 72: 102369.
https://doi.org/10.1016/j.electstud.2021.102369.
Denny, Matthew W., and Arthur Spirling. 2018. “Text Preprocessing
for Unsupervised Learning: Why It Matters, When It Misleads, and What to
Do about It.” Political Analysis 26 (2): 168–89. https://doi.org/10.1017/pan.2017.44.
Eshima, Shusei, Kosuke Imai, and Tomoya Sakasi. 2023. “Keyword
Assisted Topic Models.” American Journal of Political
Science online first. https://doi.org/10.1111/ajps.12779.
Feinerer, Ingo, Kurt Hornik, and David Meyer. 2008a. “Text Mining
Infrastructure in R.” Journal of Statistical
Software 25 (5): 1–54. https://doi.org/10.18637/jss.v025.i05.
———. 2008b. “Text Mining Infrastructure in R.”
Journal of Statistical Software 25 (5): 1–54. https://www.jstatsoft.org/v25/i05/.
Gessler, Theresa, and Sophia Hunger. 2022. “How the Refugee Crisis
and Radical Right Parties Shape Party Competition on
Immigration.” Political Science Research and Methods 10
(3): 524–44. https://doi.org/10.1017/psrm.2021.64.
Grimmer, Justin, Margaret E. Roberts, and Brandon M. Stewart. 2022.
Text as Data: The Promise and Pitfalls of Automatic Content Analysis
Methods for Political Texts: A New Framework for Machine Learning and
the Social Sciences. Princeton: Princeton University Press.
Grün, Bettina, and Kurt Hornik. 2011. “topicmodels: An R Package for Fitting
Topic Models.” Journal of Statistical Software 40 (13):
1–30. https://doi.org/10.18637/jss.v040.i13.
Herzog, Alexander, and Kenneth Benoit. 2015. “The Most Unkindest
Cuts: Speaker Selection and Expressed Goverment Dissent During Economic
Crisis.” The Journal of Politics 77 (4): 1157–75. https://doi.org/10.1086/682670.
Honnibal, Matthew, Ines Montani, Sophie Van Landeghem, and Adriane Boyd.
2020. “spaCy: Industrial-Strength
Natural Language Processing in Python.” https://doi.org/10.5281/zenodo.1212303.
Hvitfeldt, Emil, and Julia Silge. 2021. Supervised Machine Learning
for Text Analysis in r. Boca Raton: CRC Press. https://smltar.com.
Kwartler, Ted. 2017. Text Mining in Practice with
R. Chichester: John Wiley & Sons.
Lupia, Arthur, Stuart N. Soroka, and Alison Beatty. 2020. “What
Does Congress Want from the National Science Foundation? A Content
Analysis of Remarks from 1995 to 2018.” Science Advances
6 (33): eaaz6300. https://doi.org/10.1126/sciadv.aaz6300.
Manning, Christopher D., Prabhakar Raghavan, and Hinrich Schütze. 2008.
An Introduction to Information Retrieval. New York: Cambridge
University Press. https://nlp.stanford.edu/IR-book/.
Monroe, B. L., K. M. Quinn, and M. P. Colaresi. 2008. “Fightin’
Words: Lexical Feature Selection and Evaluation for
Identifying the Content of Political Conflict.” Political
Analysis 16 (4): 372–403.
Monroe, Burt L., and Philip A. Schrodt. 2008. “Introduction to the
Special Issue: The Statistical Analysis of Political Text.”
Political Analysis 16 (4): 351–55. https://doi.org/10.1093/pan/mpn017.
Mosteller, F., and D. L. Wallace. 1963. “Inference in an
Authorship Problem.” Journal of the American Statistical
Assocation 58 (302): 275–309.
Mullen, Lincoln A., Kenneth Benoit, Os Keyes, Dmitry Selivanov, and
Jeffrey Arnold. 2018. “Fast, Consistent Tokenization of Natural
Language Text.” Journal of Open Source Software 3: 655.
https://doi.org/10.21105/joss.00655.
Müller, Stefan. 2022. “The Temporal Focus of Campaign
Communication.” The Journal of Politics 84 (1): 585–90.
https://doi.org/10.1086/715165.
Pang, B., L. Lee, and S. Vaithyanathan. 2002. “Thumbs up?
Sentiment Classification Using Machine Learning Techniques.”
Proceedings of the Conference on Empirical Methods in Natural
Language Processing (EMNLP), 79–86.
Porter, Martin F. 2001. “Snowball: A Language for Stemming
Algorithms.” In.
Rauh, Christopher, Bart Joachim Bes, and Martijn Schoonvelde. 2020.
“Undermining, Defusing or Defending European Integration?
Assessing Public Communication of European Executives in Times of EU
Politicisation.” European Journal of Political Research
59 (2): 397–423. https://doi.org/10.1111/1475-6765.12350.
Roberts, Margaret E., Brandon M. Stewart, and Dustin Tingley. 2019.
“stm: An R Package for
Structural Topic Models.” Journal of Statistical
Software 91 (2): 1–40. https://doi.org/10.18637/jss.v091.i02.
Rodriguez, Pedro L., Arthur Spirling, and Brandon Stewart. 2023.
conText: ’A La Carte’ on Text (ConText) Embedding Regression.
https://CRAN.R-project.org/package=conText.
Sarica, Serhad, and Jianxi Luo. 2021. “Stopwords in Technical
Language Processing.” PLoS One 16 (8): e0254937. https://doi.org/10.1371/journal.pone.0254937.
Schofield, Alexandra, and David Mimno. 2016. “Comparing Apples to
Apple: The Effects of Stemmers on Topic Models.” Transactions
of the Association for Computational Linguistics 4: 287–300. https://doi.org/10.1162/tacl_a_00099.
Silge, Julia, and David Robinson. 2016. “Tidytext: Text Mining and
Analysis Using Tidy Data Principles in r.” Journal of Open
Source Software 1 (3). https://doi.org/10.21105/joss.00037.
———. 2017. Text Mining with r: A Tidy Approach. O’Reilly Media,
Inc.
Slapin, Jonathan B., and Sven-Oliver Proksch. 2008. “A Scaling
Model for Estimating Time-Series Party Positions from Texts.”
American Journal of Political Science 52 (3): 705–22. https://doi.org/10.1111/j.1540-5907.2008.00338.x.
Turenne, Nicolas. 2016. Analyse de Données Textuelles
Sous r. ISTE Group.
Watanabe, Kohei. 2023. LSX: Semi-Supervised Algorithm for Document
Scaling. https://CRAN.R-project.org/package=LSX.
Wickham, Hadley. 2021. Rvest: Easily Harvest (Scrape) Web
Pages. https://CRAN.R-project.org/package=rvest.
Wickham, Hadley, Mine Çetinkaya-Rundel, and Garrett Grolemund. 2023.
R for Data Science: Import, Tidy, Transform, Visualize, and Model
Data. 2nd ed. Sebastopol: O’Reilly. https://r4ds.hadley.nz.
Wilbur, W. John, and Karl Sirotkin. 1992. “The Automatic
Identification of Stop Words.” Journal of Information
Science 18 (1): 45–55. https://doi.org/10.1177/01655515920180010.
Young, Lori, and Stuart N. Soroka. 2012. “Affective News: The
Automated Coding of Sentiment in Political Texts.” Political
Communication 29 (2): 205–31.