CrackVision12K
We present the CrackVision12k dataset, a collection of 12,000 crack images derived from 13 publicly available crack datasets. The individual datasets were too small to effectively train a deep learning model. Moreover, the masks in each dataset were annotated using different standards, so unifying the annotations was necessary. To achieve this, we applied various image processing techniques to each dataset to create masks that follow a consistent standard.
Crack datasets inherently suffer from class imbalance. To mitigate this issue, we selected images containing crack pixels of more than 5000 pixels and applied data augmentation techniques such as Gaussian noise and rotation. Finally, there is a corresponding refined ground truth for each crack image across the dataset to ensure uniformity and reliability.
The 13 datasets we combined are as follows: Aigle-RN, ESAR, LCMS, CRACK500, CrackLS315, CRKWH100, CrackTree260, DeepCrack, GAPS384, Masonry, Stone331, CFD, and SDNet2018.
- Paper & Code: https://github.com/junegoo94/Hybrid-Segmentor
- Citation:
@misc{goo2024hybridsegmentorhybridapproachautomated,
title={Hybrid-Segmentor: A Hybrid Approach to Automated Fine-Grained Crack Segmentation in Civil Infrastructure},
author={June Moh Goo and Xenios Milidonis and Alessandro Artusi and Jan Boehm and Carlo Ciliberto},
year={2024},
eprint={2409.02866},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2409.02866},
}
Funding
Research Center on Interactive Media, Smart System and Emerging Technologies
European Commission
Find out more...