University College London
Browse
- No file added yet -

CrackVision12K

Download (1.58 GB)
dataset
posted on 2024-09-06, 09:36 authored by June Moh GooJune Moh Goo, Xenios Milidonis, Alessandro Artusi, Jan BoehmJan Boehm, Carlo CilibertoCarlo Ciliberto

We present the CrackVision12k dataset, a collection of 12,000 crack images derived from 13 publicly available crack datasets. The individual datasets were too small to effectively train a deep learning model. Moreover, the masks in each dataset were annotated using different standards, so unifying the annotations was necessary. To achieve this, we applied various image processing techniques to each dataset to create masks that follow a consistent standard.

Crack datasets inherently suffer from class imbalance. To mitigate this issue, we selected images containing crack pixels of more than 5000 pixels and applied data augmentation techniques such as Gaussian noise and rotation. Finally, there is a corresponding refined ground truth for each crack image across the dataset to ensure uniformity and reliability.

The 13 datasets we combined are as follows: Aigle-RN, ESAR, LCMS, CRACK500, CrackLS315, CRKWH100, CrackTree260, DeepCrack, GAPS384, Masonry, Stone331, CFD, and SDNet2018.


  1. Paper & Code: https://github.com/junegoo94/Hybrid-Segmentor
  2. Citation:

@misc{goo2024hybridsegmentorhybridapproachautomated,
title={Hybrid-Segmentor: A Hybrid Approach to Automated Fine-Grained Crack Segmentation in Civil Infrastructure},
author={June Moh Goo and Xenios Milidonis and Alessandro Artusi and Jan Boehm and Carlo Ciliberto},
year={2024},
eprint={2409.02866},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2409.02866},
}


Funding

Research Center on Interactive Media, Smart System and Emerging Technologies

European Commission

Find out more...

History