Learning to Rank Model Performance and Review with Pairwise Transformations

Learning to Rank Model Performance and Review with Pairwise Transformations

Iowa State University Capstones, Theses and Creative Components Dissertations Spring 2021 Learning to Rank Model Performance and Review with Pairwise Transformations Ajinkya Vijay Koshti Follow this and additional works at: https://lib.dr.iastate.edu/creativecomponents Part of the Operational Research Commons Recommended Citation Koshti, Ajinkya Vijay, "Learning to Rank Model Performance and Review with Pairwise Transformations" (2021). Creative Components. 757. https://lib.dr.iastate.edu/creativecomponents/757 This Creative Component is brought to you for free and open access by the Iowa State University Capstones, Theses and Dissertations at Iowa State University Digital Repository. It has been accepted for inclusion in Creative Components by an authorized administrator of Iowa State University Digital Repository. For more information, please contact [email protected]. Learning to Rank Model Performance and Review with Pairwise Transformations by Ajinkya Koshti A creative component submitted to the graduate faculty in partial fulfillment of the requirements for the degree of MASTER OF SCIENCE Major: Industrial Engineering Program of Study Committee: Dr. Sigurdur Olafsson, Major Professor Dr. Gary Mirka The student author, whose presentation of the scholarship herein was approved by the program of study committee, is solely responsible for the content of this creative component. The Graduate College will ensure this creative component is globally accessible and will not permit alterations after a degree is conferred. Iowa State University Ames, Iowa 2021 Copyright © Ajinkya Koshti, 2021. All rights reserved. ii DEDICATION I would like to dedicate this creative component to my parents, Seema, and Vijay Koshti. Without their constant encouragement throughout my engineering education, this work would not have been possible. iii TABLE OF CONTENTS Page LIST OF FIGURES .........................................................................................................................v LIST OF TABLES ......................................................................................................................... vi ACKNOWLEDGMENTS ............................................................................................................ vii ABSTRACT ................................................................................................................................. viii CHAPTER 1. INTRODUCTION ....................................................................................................9 CHAPTER 2. LITERATURE REVIEW .......................................................................................14 CHAPTER 3. PAIRWISE TRANSFORMATIONS .....................................................................16 The concept .............................................................................................................................. 16 CHAPTER 4. THE DATASET .....................................................................................................18 CHAPTER 5. ALGORITHMS ......................................................................................................20 Decision Trees ......................................................................................................................... 20 Recursive Binary Splitting ................................................................................................. 22 Hyperparameters ................................................................................................................ 23 Random Forest ......................................................................................................................... 25 Bagging .............................................................................................................................. 26 XGBoost .................................................................................................................................. 28 CHAPTER 6. TESTING METHODOLOGIES AND EVALUATION METRICS .....................30 K-fold Cross-Validation .......................................................................................................... 30 Stratified K-fold Cross-Validation .......................................................................................... 30 ROC Curve .............................................................................................................................. 31 Area Under the Curve (AUC) .................................................................................................. 32 CHAPTER 7. FEATURE SELECTION AND HYPERPARAMETER OPTIMIZATION ..........34 Feature Selection ..................................................................................................................... 34 Grid-Search Optimization ........................................................................................................ 35 Random-Search Optimization ................................................................................................. 35 CHAPTER 8. RESULTS ...............................................................................................................36 Decision Tree Model ............................................................................................................... 36 Decision Tree ..................................................................................................................... 36 Baseline Model Results ...................................................................................................... 37 Stratified K-fold Cross-Validation Results ........................................................................ 37 Area Under the Curve......................................................................................................... 38 Feature Importance Score ................................................................................................... 39 iv Model Performance after Feature Selection ....................................................................... 39 Stratified K-fold Cross-Validation Results after Feature Selection ................................... 40 Area Under the Curve after Feature Selection ................................................................... 40 Optimized Hyperparameters............................................................................................... 41 Model Performance after Hyperparameter Optimization ................................................... 41 Random Forest Model ............................................................................................................. 42 Baseline Model Results ...................................................................................................... 42 Stratified K-Fold Cross-Validation Results ....................................................................... 42 Area Under the Curve......................................................................................................... 43 Feature Importance Score ................................................................................................... 44 Model Performance After Feature Selection ...................................................................... 44 Stratified K-Fold Cross-Validation Results After Feature Selection ................................. 45 Area Under the Curve After Feature Selection .................................................................. 45 Optimized Hyperparameters............................................................................................... 46 Model Performance After Hyperparameter Optimization .................................................. 46 XGBoost Results ..................................................................................................................... 47 Baseline Model Performance ............................................................................................. 47 Stratified K-Fold Cross-Validation Results ....................................................................... 47 Area Under the Curve......................................................................................................... 48 Feature Importance Score ................................................................................................... 49 Baseline Model Results After Feature Selection ................................................................ 49 Stratified K-Fold Cross-Validation Results After Feature Selection ................................. 50 Area Under the Curve After Feature Selection .................................................................. 50 Optimized Hyperparameters............................................................................................... 51 Model Performance after Hyperparameter Optimization ................................................... 51 Borda Count ............................................................................................................................. 52 Borda count generated list example: .................................................................................. 53 CHAPTER 9. CONCLUSION.......................................................................................................54 APPENDIX. THE PYTHON FUNCTION....................................................................................59 v LIST OF FIGURES Page Figure 1. Decision Tree................................................................................................................

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    61 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us