Back to search

The Insight-Inference Loop: Efficient Text Classification via Natural Language Inference and Threshold-Tuning

DSEID
DSEID-001-9894397
DOI
10.1177/00491241251326819
Journal
Sociological Methods & Research
Publisher
SAGE Publications
Published
2026-5
Status
metadata_only

Abstract

Modern computational text classification methods have brought social scientists tantalizingly close to the goal of unlocking vast insights buried in text data—from centuries of historical documents to streams of social media posts. Yet three barriers still stand in the way: the tedious labor of manual text annotation, the technical complexity that keeps these tools out of reach for many researchers, and, perhaps most critically, the challenge of bridging the gap between sophisticated algorithms and the deep theoretical understanding social scientists have already developed about human interactions, social structures, and institutions. To counter these limitations, we propose an approach to large-scale text analysis that requires substantially less human-labeled data, and no machine learning expertise, and efficiently integrates the social scientist into critical steps in the workflow. This approach, which allows the detection of statements in text, relies on large language models pre-trained for natural language inference, and a “few-shot” threshold-tuning algorithm rooted in active learning principles. We describe and showcase our approach by analyzing tweets collected during the 2020 U.S. presidential election campaign, and benchmark it against various computational approaches across three datasets.

Metadata is indexed. Open-access discovery has not completed for this record yet.

Publisher or DOI landing page

PDF

No local PDF is available.

GROBID Extracted text; discontinued.

This text is generated from TEI extraction for accessibility, search, and TTS. Formulas, tables, figures, page layout, and references may not perfectly match the original PDF.

No accessible text representation is available. The text extraction service has been discontinued for the time being. If you require this service, for accessibility or any other reason, please submit an issue/request on this page.

Metadata

Title
The Insight-Inference Loop: Efficient Text Classification via Natural Language Inference and Threshold-Tuning
Delta ID
DSEID-001-9894397
Authors
Sandrine Chausson, Marion Fourcade, David J. Harding, Björn Ross, Grégory Renard
Abstract source
crossref
Source URL
None
Access
closed_or_uncertain
Licence
unknown
PDF SHA-256
TEI SHA-256
GROBID

Issues

No public issues have been filed for this DOI.

Submit an issue

Record history

WhenEventFieldOldNew
2026-06-18 19:37:53.011249+00:00identifier_assignedDSEIDDSEID-001-9894397