The Traps That Must Be Encountered in Machine Learning Practices Hu Wei
Total Page:16
File Type:pdf, Size:1020Kb
Contents Instruction ....................................................................................................................................... 1 Program ........................................................................................................................................... 2 Abstracts On Gradient-Based Optimization: Accelerated, Stochastic and Nonconvex Michael I. Jordan .... 4 Value of Information Methods Le Bao ............................................................................................ 5 TBA Jian Cao ..................................................................................................................................... 6 Optimal covariance matrix estimation for high-dimensional noise in high-frequency data Jinyuan Chang ...................................................................................................................................... 7 Multivariate network meta-analysis made simple Yong Chen ........................................................ 8 Functional Canonical Correlation and Functional Prediction Di-Rong Chen ............................... 9 A Power One Test for Unit Roots Based on Sample Autocovariances Guanghui Cheng .............. 10 METRIC LEARNING VIA CROSS-VALIDATION Linlin Dai .................................................. 11 Dynamic Change Detection with False Discovery Rate Control Lilun Du .................................. 12 Valuing commodity options and futures options with changing economic conditions Kun Fan ............................................................................................................................................ 13 An extended Mallows model for ranked data aggregation Xiaodan Fan ..................................... 14 Some challenges in analyzing big data in health and medical research Bo Fu .......................... 15 Estimating Truncated Functional Linear Models with a Nested Group Bridge Approach Tianyu Guan ....................................................................................................................................... 16 Modeling Traffic Crash Risk Feng Guo ......................................................................................... 17 Moderate-Dimensional Inferences on Quadratic Functionals in Ordinary Least Squares Xiao Guo ........................................................................................................................................... 18 Local Inference in Additive Models with Decorrelated Local Linear Estimator Zijian Guo ..... 19 Nonlocal online RPCA for video denoising Zhi Han ..................................................................... 20 Oracle P-value and Variable Screening Ning Hao ........................................................................ 21 Inference in a mixture additive hazards cure model Haijin He ................................................... 22 The Pearson Correlation Between Tree-Shaped Data Sets: Estimating, Graphical Representation and Hypothesis Testing Jie Hu ............................................................................ 23 AI-Based Solution for Financial Risk Assessment and Fraud Detection Ling Huang ................ 24 Causal mediation of semicompeting risks Yen-Tsung Huang .......................................................... 25 Multiple Imputation on enhanced model identification for nonignorable nonresponse Jongho Im .......................................................................................................................................... 26 Generalized Four Moment Theorem and an Application to CLT for Spiked Eigenvalues of Large-dimensional Covariance Matrices Dandan Jiang ................................................................ 27 Functional-coefficient regression models with GARCH errors Jiancheng Jiang ......................... 28 Prediction of hospital readmission frailties with misspecified shared frailty models Xuejun Jiang ...................................................................................................................................... 29 The Operating Principle of Regularized Spectral Clustering Donggyu Kim ............................... 30 Discrepancy between global and local principal component analysis on large-panel high-frequency data Xin-Bing Kong ................................................................................................. 31 Optimal Estimation of Wasserstein Distance on Trees with An Application to Microbiome Studies Hongzhe Li ........................................................................................................................... 32 A supplement to Jiang's asymptotic distribution of the largest entry of a sample correlation matrix Deli Li .................................................................................................................................. 33 High-Dimensional Vector Autoregressive Time Series Modeling via Tensor Decomposition Guodong Li ........................................................................................................................................ 34 Tensor Analysis and Neuroimaging Applications Lexin Li ........................................................... 35 Statistical Learning for Personalized Wealth Management Yingying Li ..................................... 36 Mediation analysis for zero-inflated mediators Zhigang Li .......................................................... 37 Identiability and Non-Convex Algorithm for Multi-Channel Blind Deconvolution Song Li .... 38 A non-randomized multiple testing procedure for large-scale heterogeneous discrete hypotheses based on randomized tests Nan Lin ............................................................................ 39 Deep Neural Networks for Rotation-Invariance Approximation and Learning Shao-Bo Lin ....................................................................................................................................... 40 A Quantile Association-based Variable Selection Yuanyuan Lin ................................................... 41 Some Statistical Methods for Single-cell Genomics Zhixiang Lin ................................................. 42 Weighted multiple-quantile classifiers for functional data with application in multiple sclerosis screening Catherine Liu ..................................................................................................... 43 Optimal Covariance Matrix Estimation for High-dimensional Noise in High-frequency Data Cheng Liu .......................................................................................................................................... 44 Data-adaptive Kernel Support Vector Machine Xin Liu .............................................................. 45 Testing of covariate effects under ridge regression for high-dimensional data Xu Liu ............. 46 Towards Software-Defined Infrastructure for Decentralized Data Governance Xuanzhe Liu .. 47 Distributed learning from multiple EHR databases: Contextual embedding models for predicting medical events Qi Long ................................................................................................. 48 Wavelet Empirical Likelihood Estimator for Stationary and Locally Stationary Long Memory Processes Zhiping Lu ......................................................................................................... 49 GMV Prediction Using Driver Preference Shikai Luo .................................................................. 50 A Nonparametric Bayesian Approach to Simultaneous Subject and Cell Heterogeneity Discovery for Single Cell RNA-Seq Data Xiangyu Luo .................................................................. 51 A Versatile Estimation Procedure without Estimating the Nonignorable Missingness Mechanism Yanyuan Ma ................................................................................................................... 52 Matrix Completion under Low-Rank Missing Mechanism Xiaojun Mao .................................... 53 A mean field theory of two-layers neural networks Song Mei ..................................................... 54 A Dynamic Additive and Multiplicative Effects Model with Application to the United Nations Voting Behaviours Xiaoyue Niu ....................................................................................................... 55 A Super Scalable Algorithm for Short Segment Detection Yue Niu ............................................ 56 Improved doubly robust estimation in learning individualized treatment rules Yinghao Pan .. 57 Predicting terrorist events: opportunities and challenges Andre Python ..................................... 58 On the ‘Off-Label’ Use of Data Normalization for Sample Classification and Prognostication Li-Xuan Qin ....................................................................................................................................... 59 Adaptive Minimax Density Estimation for Huber’s Contamination Model under $L_p$ Losses Zhao Ren ................................................................................................................... 60 Dynamic Spatial Panel Data Models with Endogeneity and Common Factors Wei Shi ........... 61 Bridging the gap between noisy healthcare data and knowledge: automated translation of medical terminology Xu Shi ............................................................................................................ 62 Estimating the