Search
Now showing items 1-10 of 13
The ‘Hot Hand’ An Investigation into Streakiness in Shooting
(2022-05-23)
The concept of the ‘hot hand’ is highly debated in the fields of statistics, data science, and psychology. The overwhelming consensus for the past four decades has generally revolved around the concept that ‘hot handedness’ ...
Algorithmic Fairness, Metric Embedding, and Metric Learning
(2022-02-24)
As algorithms are increasingly used to classify people in contexts like criminal justice, college admissions, and advertising, it is important to ensure that these algorithms are socially responsible and treat people the ...
Optimizing Methods for Suicide Prediction
(2022-05-23)
Suicide is one of the leading causes of death worldwide, yet clinicians find it difficult to reliably identify individuals at high risk for suicide. Algorithmic approaches for suicide risk detection have been developed in ...
On the Effect of Ranger Patrols on Deterring Poaching: A Bayesian Approach for Causal Inference Using Field Tests as an Instrument
(2022-05-23)
Wildlife conservation relies on ranger patrols to detect and remove animal traps set out by poachers. While these efforts are important, conservation biologists seek to understand the impact of ranger patrols on future ...
The Algorithmic Foundations of Private Computational Social Science
(2022-08-12)
Social scientists, political scientists, economists, and healthcare researchers crucially rely on statistical methods to further the study of individuals, society, and human behavior via inferential analysis. Unfortunately, ...
Invariance versus Adversarial Learning in Domain Generalization with Applications to Neuroscience
(2022-05-23)
We explore the practical application of two modern domain
invariant representation-learning techniques for addressing the domain
generalization problem in statistical machine learning. Specifically, we
investigate the ...
Discriminative Sequence Models Extract Personally Identifiable Information from Public Gene Expression Datasets
(2022-05-25)
The growing scale of functional genomics datasets is enabling researchers to better understand the genetic determinants of gene expression, for example through expression quantitative trait loci (eQTL) studies.
With an ...
Learning Optimal Summaries of Clinical Time-series with Concept Bottleneck Models
(2022-05-23)
Despite machine learning models' state-of-the-art performance in numerous clinical prediction and intervention tasks, their complex black-box processes pose a great barrier to their real-world deployment. Clinical experts ...
Ending Research Subject Overexploitation: Methods to Reduce Respondent Overuse and Privacy Violations while Increasing Insights from Data
(2022-11-23)
The low price of data collection and use in the Internet age has facilitated collective ir- responsibility, where private companies, academics, and governments all fail to internalize the costs to respondents and other ...
OpenDP Programming Framework for Renyi Privacy Filters and Odometers
(2022-05-23)
Data scientists work with large-scale sensitive data, which inevitably leads to privacy risks. Differential Privacy (DP) is a mathematical definition of privacy that aims to mitigate privacy risks inherent in data analysis ...