Severstal: Steel Defect Detection

Severstal: Steel Defect Detection 12th place solution overview The Team: Ilya Dobrynin @IlyaDo Pavel Yakubovskiy @qubvel ML trainings, 16.11.2019 Denis Kolpakov @kolpakovd Content 1. Task overview 2. Our team solution 3. Other teams solutions Task overview Task Overview Images 256x1600 (~ 1 GB) ● 12658 train ● 1801 test (public) ● ~3600 test (private) Defect masks ● 4 classes (unknown) ● rle encoded Metric Mean Dice = 2TP / (2TP + FP + FN) Task Overview ● 85.56 % empty masks (empty submit - 0.85560 LB) ● 247 images with defect of class “2” ● no masks intersection ● same train - test class distribution (LB probing) ● different “black area” train - test distribution ● different steel rolls for train/public/private * Task Overview ● not accurate masks ● image duplicates ● image duplicates with different masks https://www.kaggle.com/c/severstal-steel-defect-detection/discussion/107053#latest-621775 Task Overview Synchronous kernel-only competition Limitations: 1. No internet 2. 1 hour run-time 3. 30 hours/week with GPU The hard part - don't forget about “fair” submission code: Task Overview Holdout - 451 sample from the trainset, stratified by class. Local validation ~0.95 Public Leaderboard ~0.91 Steel Adversarial Validation - a classifier can distinguish whether it is a train or test image with 85% accuracy. Solution overview Best Solution Overview Images 3 step pipeline: empty MC 1. Multilabel Classification (MC) empty 2. Multilabel Segmentation (MS) MS - works as additional classifier non-empty 3. Binary Segmentation (BS) BS1 BS2 BS3 BS4 RLE Submission 1. Multilabel Classification Model Senet154, 3 folds Tips: Size Resize 128x800 1. Pseudolabels (+0.002 on LB): Augs Normalize, HFlip Confidence = np.mean(np.abs(np.subtract(prob_cls, 0.5))) Optim RAdam 2. Binarization threshold search based on f1 score. Loss BCE TTA None 2. Multilabel Segmentation Tips Models PSPNet on se-ResNext101_32x4d FPN on Senet154 1. Custom augmentation CropNonEmptyMaskIfExists: PSPNet on Senet154 crop image with defect area if one exists, else make random crop. Size Full images, Crops 256x256 2. Using Senet154 classifier from Stage 1 as a Augs Normalize, HFlip, VFlip, backbone for the segmentation. CropNonEmptyMaskIfExists 3. Remove small holes and objects < 128 px 4. Mask vanishing if the sum of pixels is less than Optim Adam, RAdam (700; 1300; 1000; 1300) Loss BCE, BCE + Jaccard 5. Thresholds (0.45; 0.45; 0.45; 0.45) TTA HFlip 3. Binary Segmentation ● Unet / FPN (SE-ResNeXt50 + SCSE attention in decoder) ● Only images with defects Unet FPN ● CropNonEmptyMaskIfExists 256x768 ● Train 60 epochs with MultiStepLR ● RAdam optimizer ● Loss: (1 - Dice) + Focal(gamma=2) ● Batch size: 8 images, no accumulation SE-ResNext50 ● 5 best checkpoints saving for SWA ● Train augmentations: Flip, Scale, Brightness, Illumination, JpegCompression ● Test augmentations: Flip LB score improvement over time 16 - multi-label classification senet154 + multi-label segmentation (6 models) 14 - add pseudolabels for classifier 13 - add 2 more models for multi-label segmentation 10 - mask min size threshold optimization 04 - add binary segmentation (1 model per class) as third stage 03 - ensemble of binary segmentation models on third stage What did not work ● multi-task training (classification + segmentation) Multi-task network ● binary classification (1 vs all) ● EfficientNets mask Encoder Decoder ● soft labels ● hard augmentations Global Pooling ● complex LR scheduling schemes Linear, Sigmoid ● hard examples mining label ● batchnorm fusion ● classification and segmentation soft merging ○ mean_prob = mean(mask[mask > threshold]) Random Shuffle 1st place solution ● Score 0.92124 / 0.90883 ● Classification Label “1” ○ EfficientNet-B1 ○ ResNet34 ● Large crops 224 x 1558 Label “0” ● Segmentation ○ 3 x Unet(EfficientNet-B3) ○ Public kernel models ● Defect blackout augmentation ● 2 rounds of pseudolabels What if “1st place solution” ● Score 0.91854 / 0.91023 - best submission 1. When the classifier output is high (fault), we leave the ● Classification + Segmentation pixel thresholds at their ● Custom merging approach normal level. 2. When the classifier output ● Pseudolabels is low (no fault), we raise ● RMS averaging costs 1st place the pixel threshold by some factor. ● Grayscale images Averaging Technique Public LB Private LB RMS 0.91844 0.90274 Mean 0.91699 0.90975 4th place solution ● Two head: segmentation + classification ● Inference soft gating: mask.sigmoid() * classifier.sigmoid() ● FPN, Unet ○ densenet201 ○ efficientnetb5 ○ resnet34 ○ seresnext50 ● Focal loss / BCE + Dice loss ● Training with grayscale ● Catalyst ● FP16 Apex Efficiency round Useful Tools [PyTorch] https://github.com/qubvel/segmentation_models.pytorch [PyTorch] https://github.com/qubvel/ttach [Keras] https://github.com/qubvel/segmentation_models [Keras] https://github.com/qubvel/efficientnet [Keras] https://github.com/qubvel/tta_wrapper [Everywhere] https://github.com/albu/albumentations Useful Papers Gradients https://medium.com/huggingface/training-larger-batches-practical-ti accumulation ps-on-1-gpu-multi-gpu-distributed-setups-ec88c3e51255 Stochastic Weight https://towardsdatascience.com/stochastic-weight-averaging-a-new Averaging -way-to-get-state-of-the-art-results-in-deep-learning-c639ccf36a Pseudolabels https://arxiv.org/abs/1904.04445 implementation Contacts Ilya Dobrynin Pavel Yakubovskiy Denis Kolpakov @llyaDo @qubvel @kolpakovd [email protected] [email protected] [email protected] @IlyaDo @qubvel @kolpakovdenis.

Severstal: Steel Defect Detection

Details

Download

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

Support