FISH-DISSERTATION-2018.pdf (1.65 MB)
Download fileNew Models and Algorithms for Data Analysis
thesis
posted on 2018-07-27, 00:00 authored by Benjamin FishIn this thesis, we introduce and analyze new models and new algorithms for problems in data analysis. Many new challenges and constraints for data analysis have arisen as data analysis has become increasingly important. We tackle a few of these challenges here.
First, we explore the challenges that arise when data analysis done adaptively, so that previous inferences guide future analyst’s queries. We give faster algorithms for providing accurate responses to data analyst’s adaptive queries. Secondly, we initiate the study of the theory for learning from label proportions, which captures voting and similar non-standard settings. We compare this type of learning to classical supervised machine learning, and show examples for which learning from label proportions is possible. Finally, we tackle learning social networks from voting data, where individuals communicate in order to decide their votes. We show that modeling assumptions on how votes are influenced by the network can be brittle: a small change to the model may drastically change hardness of learning the network, as well as change the resulting learned networks.
History
Advisor
Reyzin, LevChair
Reyzin, LevDepartment
Mathematics, Statistics, and Computer ScienceDegree Grantor
University of Illinois at ChicagoDegree Level
- Doctoral