Generative Multimodal Models for Social Science: An Application with Satellite and Streetscape Imagery

Tina Law, Elizabeth Roberto

DSEID: DSEID-001-7213199
DOI: 10.1177/00491241251339673
Journal: Sociological Methods & Research
Publisher: SAGE Publications
Published: 2025-8
Status: metadata_only

Abstract

Although there is growing social science research examining how generative AI models can be effectively and systematically applied to text-based tasks, whether and how these models can be used to analyze images remain open questions. In this article, we introduce a framework for analyzing images with generative multimodal models, which consists of three core tasks: curation, discovery, and measurement and inference. We demonstrate this framework with an empirical application that uses OpenAI's GPT-4o model to analyze satellite and streetscape images ( n = 1,101) to identify built environment features that contribute to contemporary residential segregation in U.S. cities. We find that when GPT-4o is provided with well-defined image labels, the model labels images with high validity compared to expert labels. We conclude with thoughts for other use cases and discuss how social scientists can work collaboratively to ensure that image analysis with generative multimodal models is rigorous, reproducible, ethical, and sustainable.

Metadata is indexed. Open-access discovery has not completed for this record yet.

Publisher or DOI landing page

PDF

No local PDF is available.

GROBID Extracted text; discontinued.

This text is generated from TEI extraction for accessibility, search, and TTS. Formulas, tables, figures, page layout, and references may not perfectly match the original PDF.

No accessible text representation is available. The text extraction service has been discontinued for the time being. If you require this service, for accessibility or any other reason, please submit an issue/request on this page.

Metadata

Title: Generative Multimodal Models for Social Science: An Application with Satellite and Streetscape Imagery
Delta ID: DSEID-001-7213199
Authors: Tina Law, Elizabeth Roberto
Abstract source: crossref
Source URL: None
Access: closed_or_uncertain
Licence: unknown
PDF SHA-256
TEI SHA-256
GROBID

Issues

No public issues have been filed for this DOI.

Submit an issue

Record history

When	Event	Field	Old	New
2026-06-18 19:37:53.011249+00:00	identifier_assigned	DSEID		DSEID-001-7213199