Search
Now showing items 1-10 of 24
Evaluating Lineups and Complementary Play Styles in the NBA
(2017-10-13)
NBA coaches and general managers are tasked with building lineups and rosters that maximize their chances of winning. Further, basketball is a team sport where interactions between the players in a lineup can be integral ...
The effect of quasi-identifier characteristics on statistical bias introduced by k-anonymization
(2015-04-08)
The de-identification of publicly released datasets that contain personal information is necessary to preserve personal privacy. One such de-identification algorithm, k-anonymization, reduces the risk of the re-identification ...
The Differential Privacy of Bayesian Inference
(2015-04-08)
Differential privacy is one recent framework for analyzing and quantifying the amount of privacy lost when data is released. Meanwhile, multiple imputation is an existing Bayesian-inference based technique from statistics ...
Modeling, Inference and Optimization With Composable Differentiable Procedures
(2016-05-14)
This thesis presents five contributions to machine learning, with themes of differentiability and Bayesian inference.
We present Firefly Monte Carlo, an auxiliary variable Markov chain Monte Carlo algorithm that only ...
Bayesian Methods for Discovering Structure in Neural Spike Trains
(2016-05-18)
Neuroscience is entering an exciting new age. Modern recording technologies enable simultaneous measurements of thousands of neurons in organisms performing complex behaviors. Such recordings offer an unprecedented opportunity ...
A method for identifying predictive markers of mental illness in social media data
(2017-02-14)
Undiagnosed mental illness poses a significant health risk. In-person screenings to identify individuals at-risk of mental illness are expensive, time-consuming, and often inaccurate. This report presents an array of ...
Advances in Monte Carlo Variational Inference and Applied Probabilistic Modeling
(2018-05-10)
Galvanized by the accelerated pace and ease of data collection, researchers in more and more disciplines are turning to large, heterogeneous datasets to answer scientific questions. Divining insight from massive and complex ...
Modeling Musical Influence Through Data
(2018-06-29)
Musical influence is a topic of interest and debate among critics, historians, and general listeners alike, yet to date there has been limited work done to tackle the subject in a quantitative way. In this thesis, we address ...
Characterizing Posterior Uncertainty for the Indian Buffet Process
(2017-07-14)
Many problems in data science and machine learning require identifying latent features that occur in a set of observations. For example, given a set of images, we may want to determine what objects are contained in each ...
Dealing with Interference on Experimentation Platforms
(2018-09-16)
The theory of causal inference, as formalized by the potential outcomes framework, relies on an assumption that the experimental units are independent. When independence is not tenable, we say there is interference, and ...