Machine Learning for Prediction in Hydraulic Fracturing
Published:
Read this work in Analytics Vidhya
In unconventional oilfield, fracturing is the only way to extract the oil from the subsurface. Designing fracturing parameter such as number of clusters per stage is critical to contribute to its success. A cluster is a set of perforations shot into the casing and repeated over a number of intervals or spacing. The group of clusters then form the stage that will be stimulated with the proppant. One stage may consist of as little as 1 to more than 15 clusters. Usually, frac engineer develops numerical model to design the fracturing. However, as complexity in the formation rises, a numerical model can be insufficient. Therefore, data-driven modeling can be useful.
In this work, an open-source dataset from SPE was used. The dataset consists of records from 53 wells consisting of reservoir temperature, pressure, water saturation, gas specific gravity, and perforation geometry. Feature selection using statistical R-test was done to reduce from 25 to only 13 useful features. Next, Decision Tree model was used as regressor. To handle with small dataset, Leave-One-Out Cross-Validation (LOOCV) was used as a strategy to avoid model overfitting. From this LOOCV, two hyperparameters of Decision Tree are determined to achieve the most desirable model.
The trained model was finally used to predict the number of clusters per stage given a mock case of a fracturing company. Modeling based on data-driven approach which is demonstrated in this work makes the fracturing design more efficient.