Título : Resampling methods for score likelihood ratio based inference for source attribution problems
Autor(es) : Veneri Guarch, Federico A.
Fecha de publicación : dic-2024
Tipo de publicación: Tesis de doctorado
Versión: Publicado
Supervisor(es) : Ommen, Danica M.
Publicado por: Iowa State University
Areas del conocimiento : Ciencias Naturales y Exactas
Matemáticas
Estadística y Probabilidad
Matemática Aplicada
Otros descriptores : Common source problem
Forensic Statistics
Score Likelihood Ratios
Source attribution
Specific source problem
Resumen : This dissertation addresses source attribution problems, an inferential task that contrasts two opposing propositions regarding the origin of items. These inferential problems arise in multiple domains but play a key role in forensic science. Due to the complexity of evidence found in practical applications, machine learning has been proposed as an alternative to evaluate the similarity between items when a probabilistic model is not feasible to construct a traditional Likelihood ratio. Score-based likelihood ratio inference hence provides an alternative framework to assess the strength of statistical evidence in this context. Our work focuses on the common and specific source inferential problems and addresses the dependence structure generated when creating training and estimation sets to develop these inferential systems. We present resampling plans to remedy these shortcomings and how ensemble learning approaches could strengthen the current methods. Chapter 2 introduces Strong Source Resampling (SSR), a source-aware resampling plan for the common source problem. This idea is extended to Weak Source Resampling (WSR) in Chapter 4. These resampling plans are the basis for developing base systems combined into a final value of evidence using an ensemble learning approach proposed in Chapter 2. Chapter 3 focuses on the specific source problem, introducing synthetic source anchoring, which uses synthetic items as data augmentation, allowing the development of specific source score likelihood ratios. Lastly, Chapter 4 introduces discrepancy metrics for score likelihood ratio-based inference that can be used to study model misspecification and the effects of not accounting for dependence. Simulation results and applications in both chapters suggest that combining ensemble learning with a source-aware resampling could provide stronger, more stable statistical evidence value in the correct direction for machine learning and simple score-based likelihood ratios. Chapter 5 provides general conclusions and some avenues for further research
URI / Handle: https://hdl.handle.net/20.500.12381/4051
Otros recursos relacionados: https://dr.lib.iastate.edu/handle/20.500.12876/dv6lp7Xz
DOI: https://doi.org/10.31274/td-20250502-145
Financiadores: Agencia Nacional de Investigación e Innovación
Becas de Posgrado Fulbright
Identificador ANII: POS_FUL_2019_1_1008440
Nivel de Acceso: Acceso abierto
Licencia CC: Reconocimiento 4.0 Internacional. (CC BY)
Aparece en las colecciones: Publicaciones de ANII

Archivos en este ítem:
archivo  Descripción Tamaño Formato
VeneriGuarch_iastate_0097E_21857-1-2.pdfDescargar 11.06 MBAdobe PDF

Las obras en REDI están protegidas por licencias Creative Commons.
Por más información sobre los términos de esta publicación, visita: Reconocimiento 4.0 Internacional. (CC BY)