Back to search

Data Imbalances in Coincidence Analysis: A Simulation Study

DSEID
DSEID-001-7709274
DOI
10.1177/00491241241227039
Journal
Sociological Methods & Research
Publisher
SAGE Publications
Published
2025-5
Status
failed

Abstract

In this paper, we investigate the conditions under which data imbalances, a common data characteristic that occurs when factor values are unevenly distributed, are problematic for the performance of Coincidence Analysis (CNA). We further examine how such imbalances relate to fragmentation and noise in data. We show that even extreme data imbalances, when not combined with fragmentation or noise, do not negatively affect CNA’s performance. However, an extended series of simulation experiments on fuzzy-set data reveals that, when mixed with fragmentation or noise, data imbalances may substantially impair CNA’s performance. Furthermore, we find that the performance impairment is higher when endogenous factors are imbalanced than when exogenous factors are concerned. Our results allow us to quantify these impacts and demarcate degrees at which data imbalances should be considered as problematic. Thus, applied researchers can use our demarcation guidelines to enhance the validity of their studies.

Ingestion failed: Traceback (most recent call last): File "/srv/app/app/worker.py", line 85, in run_once process_job(db, job) File "/srv/app/app/worker.py", line 39, in process_job pdf_path, info = fetch_pdf_temp(candidate["url"]) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/srv/app/app/downloader.py", line 129, in fetch_pdf_temp raise ValueError(f"PDF source returned HTTP {response.status_code}.") ValueError: PDF source returned HTTP 403.

PDF

No local PDF is available.

GROBID Extracted text; discontinued.

This text is generated from TEI extraction for accessibility, search, and TTS. Formulas, tables, figures, page layout, and references may not perfectly match the original PDF.

No accessible text representation is available. The text extraction service has been discontinued for the time being. If you require this service, for accessibility or any other reason, please submit an issue/request on this page.

Metadata

Title
Data Imbalances in Coincidence Analysis: A Simulation Study
Delta ID
DSEID-001-7709274
Authors
Martyna Daria Swiatczak, Michael Baumgartner
Abstract source
crossref
Source URL
https://journals.sagepub.com/doi/pdf/10.1177/00491241241227039
Access
open
Licence
cc-by
PDF SHA-256
TEI SHA-256
GROBID

Issues

No public issues have been filed for this DOI.

Submit an issue

Record history

WhenEventFieldOldNew
2026-06-18 19:37:53.011249+00:00identifier_assignedDSEIDDSEID-001-7709274