Machine Learning
1. Computation Method
- A scalable bootstrap for massive data
2. Crowdsourcing
- Eliminating Spammers and Ranking Annotators for Crowdsourced Labeling Tasks
- Obtaining High-Quality Label by Distinguishing between Easy and Hard Items in Crowdsourcing
- Double or Nothing: Multiplicative Incentive Mechanisms for Crowdsourcing
- Repeated labeling using multiple noisy labelers
- Get Another Label? Improving Data Quality and Data Mining Using Multiple, Noisy Labelers
3. Dimension Reduction
- Visualizing Data using t-SNE)
- Umap: Uniform manifold approximation and projection for dimension reduction
4. Learning to Rank
- A Short Introduction to Learning to Rank
- DeepRank: A New Architecture for Relevance Ranking in Information Retrieval
5. Privacy
- Formal Privacy for Functional Data with Gaussian Perturbations
- Differential Privacy for Functions and Functional Data
- White-box vs Black-box- Bayes Optimal Strategies for Membership Inference
- Privacy Risk in Machine Learning- Analyzing the Connection to Overfitting
- Sublinear Space Private Algorithms Under the Sliding Window Model
6. Subspace Clustering
- Mixtures of Probabilistic Principal Component Analysers
- Robust Recovery of Subspace Structures by Low-Rank Representation
- A tutorial on subspace clustering
7. Test of Goodness of Fit
- A Kernelized Stein Discrepancy for Goodness-of-fit Tests
- A Linear-Time Kernel Goodness-of-Fit Test
8. Traditional Learning Theory
- Boosting the Margin_A New Explanation for the Effectiveness of Voting Methods
- The Weighted Majority Algorithm
- The Strength of Weak Learnability
- The Multiplicative Weights Update Method- A Meta-Algorithm and Applications
- Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples