基于改进SMOTE和随机森林算法的致密砂岩成岩相测井解释方法

2025年 47卷 第4期
阅读:35
查看详情
A Logging Interpretation Method for Tight Sandstone Diagenetic Facies Based on Improved SMOTE and Random Forest Algorithm
甄艳 康锦涛 赵晓明 葛家旺 代茂林
ZHENYan KANGJintao ZHAOXiaoming GEJiawang DAIMaolin
西南石油大学地球科学与技术学院, 四川 成都 610500 天然气地质四川省重点实验室, 四川 成都 610500
School of Geoscience and Technology, Southwest Petroleum University, Chengdu, Sichuan 610500, China Sichuan Key Laboratory of Natural Gas Geology, Chengdu, Sichuan 610500, China
成岩相测井解释是致密砂岩优质储层预测的关键,相较于常规的数理统计学方法,机器学习方法可以有效提高成岩相测井解释精度,但受样品数量不足的影响,其解释结果仍存在一定的多解性。为有效解决成岩相测井解释中样本数据不平衡问题,在经典SMOTE算法的基础上,顾及新增样本的空间约束,提出了一种RESMOTE算法,对不平衡数据中的少类样本进行新增,并利用随机森林模型进行成岩相的识别与解释。结果表明,RESMOTE算法优于经典SMOTE算法、Borderline-SMOTE算法和ADASYN算法,随机森林模型的精度从原来的77.27%提升至91.06%。采用RESMOTE算法可保证新增数据的准确性,有效解决了常规测井岩相识别分类方法中的过拟合和准确性不高的问题,对致密砂岩优质储层预测具有重要的应用价值。
Diagenetic logging interpretation is the key to predict high-quality tight sandstone reservoirs. Compared with conventional mathematical statistical methods, machine learning methods can effectively improve the accuracy of diagenetic facies logging interpretation. However, due to the insufficient number of samples, there are still some multi-solutions in the interpretation results. In order to effectively solve the problem of sample data imbalance in the logging interpretation of diagenetic facies, this paper considers the spatial constraints of new samples on the basis of the classical SMOTE (Synthetic Minority Over-sampling Technique) algorithm. This paper puts forward a RESMOTE (Repeat SMOTE) algorithm, which adds the few class samples in the imbalanced data, and uses the random forest model to identify and explain the diagenetic facies. The experimental results show that the RESMOTE algorithm is better than classical SMOTE algorithm, Borderline-SMOTE algorithm and ADASYN algorithm, and the accuracy of random forest model is improved from 77.27% to 91.06%. The RESMOTE algorithm can ensure the accuracy of new data, effectively solve the problem of over-fitting and low accuracy in conventional logging lithofacies identification and classification methods, and has important application value for the prediction of high quality tight sandstone reservoirs.
致密砂岩; 成岩相; RESMOTE算法; 随机森林; 测井解释;
tight sandstone; diagenetic facies; RESMOTE algorithm; random forest; log interpretation;
10.11885/j.issn.1674-5086.2022.05.04.01