Preprints
https://doi.org/10.5194/essd-2024-337
https://doi.org/10.5194/essd-2024-337
07 Oct 2024
 | 07 Oct 2024
Status: this preprint is currently under review for the journal ESSD.

cigFacies: a massive-scale benchmark dataset of seismic facies and its application

Hui Gao, Xinming Wu, Xiaoming Sun, Mingcai Hou, Hang Gao, Guangyu Wang, and Hanlin Sheng

Abstract. Seismic facies classification is crucial for seismic stratigraphic interpretation and hydrocarbon reservoir characterization but remains a tedious and time-consuming task that requires significant manual effort. The data-driven deep learning approaches are highly promising to automate the seismic facies classification with high efficiency and accuracy, as they have already achieved significant success in similar image classification tasks within the field of computer vision (CV). However, unlike the CV domain, the field of seismic exploration lacks a comprehensive benchmark dataset for seismic facies, severely limiting the development, application, and evaluation of deep learning approaches in seismic facies classification. To address this gap, we propose a comprehensive workflow to construct a massive-scale benchmark dataset of seismic facies and evaluate its effectiveness in training a deep learning model. Specifically, we first develop a knowledge graph of seismic facies based on the geological concepts and seismic reflection configurations. Guided by the graph, we then implement three strategies of field seismic data curation, knowledge-guided synthesization, and GAN-based generation to construct a benchmark dataset of 8000 diverse samples for five common seismic facies. Finally, we use the benchmark dataset to train a network and then apply it on two 3-D seismic data for automatic seismic facies classification. The predictions are highly consistent with expert interpretation results, demonstrating the diversity and representativeness of our benchmark dataset is sufficient to train a network that can generalize well in seismic facies classification across field data. We have made this dataset, the trained model and associated codes publicly available for further research and validation of intelligent seismic facies classification.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Hui Gao, Xinming Wu, Xiaoming Sun, Mingcai Hou, Hang Gao, Guangyu Wang, and Hanlin Sheng

Status: open (until 09 Dec 2024)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on essd-2024-337', Lorenzo Lipparini, 10 Nov 2024 reply
Hui Gao, Xinming Wu, Xiaoming Sun, Mingcai Hou, Hang Gao, Guangyu Wang, and Hanlin Sheng

Data sets

cigFacies: a massive-scale benchmark dataset of seismic facies Hui Gao, Xinming Wu, Xiaoming Sun, Mingcai Hou, Hang Gao, Guangyu Wang, and Hanlin Sheng https://zenodo.org/records/10777460

Model code and software

cigFaciesNet Hui Gao, Xinming Wu, Xiaoming Sun, Mingcai Hou, Hang Gao, Guangyu Wang, and Hanlin Sheng https://zenodo.org/records/13150879

Hui Gao, Xinming Wu, Xiaoming Sun, Mingcai Hou, Hang Gao, Guangyu Wang, and Hanlin Sheng

Viewed

Total article views: 250 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
156 81 13 250 4 7
  • HTML: 156
  • PDF: 81
  • XML: 13
  • Total: 250
  • BibTeX: 4
  • EndNote: 7
Views and downloads (calculated since 07 Oct 2024)
Cumulative views and downloads (calculated since 07 Oct 2024)

Viewed (geographical distribution)

Total article views: 241 (including HTML, PDF, and XML) Thereof 241 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 20 Nov 2024
Download
Short summary
We propose three strategies of field seismic data curation, knowledge-guided synthesization, and GAN-based generation to construct a massive-scale, feature-rich and high-realism benchmark dataset of seismic facies and evaluate its effectiveness in training a deep learning model for automatic seismic facies classification.
Altmetrics