SHEP

Open-source implementation of the paper:
"Shapley estimated explanation: A fast post-hoc attribution method for interpreting intelligent mechanical fault diagnosis"

Despite significant progress in intelligent fault diagnosis (IFD), the lack of interpretability remains a critical barrier to practical industrial applications, driving the growth of interpretability research in IFD. Post-hoc interpretability has gained popularity due to its ability to preserve network flexibility and scalability without modifying model structures. However, these methods often yield suboptimal time-domain explanations. Recently, combining domain transforms with SHAP has improved interpretability by extending explanations to more informative domains. Nonetheless, the computational expense of SHAP, exacerbated by increased dimensions from domain transforms, remains a major challenge.

To address this, we propose patch-wise attribution and SHapley Estimated Explanation (SHEP). Patch-wise attribution reduces feature dimensions at the cost of explanation granularity, while SHEP simplifies subset enumeration to approximate SHAP, reducing complexity from exponential to linear. Together, these methods significantly enhance SHAP's computational efficiency, providing feasibility for real-time interpretation in monitoring tasks. Extensive experiments confirm SHEP's efficiency, interpretability, and reliability in approximating SHAP. Additionally, with open-source code, SHEP has the potential to serve as a benchmark for post-hoc interpretability in IFD.

Notes

2026-01-23: The paper is published on Engineering Applications of Artificial Intelligence .
2025-10-12: modify SHEPs\Attribution_methods.py to save memory cost.
2025-04-29: The code of SHEP with full demo under different domains and patch sizes is uploaded.
2025-04-03: The preprint is available on .
2025-01-15: We will upload our code after the paper is accepted.

Repository Structure

Core Code

SHEPs/MultiDomain_Attribution.py: Implementation of multi-domain multi-method attribution (Time/Freq/Env/TF/CS domain & SHEP/SHAP method).
SHEPs/Attribution_methods.py: code of SHEP and SHAP.
SHEPs/DomainTransform.py: Signal processing of domain transforms and patch-wsie attribution technique.
SHEPs/utils_SHAP_MyIndependent.py: Modified SHAP utilities to support numpy.ndarray with dtype=object.

Demo Code

Besides, we also provide Demo Code of the simulation dataset and CWRU dataset. Run the Demo, and you will get the same experimental results as descripted in .

The repo structure is organized as follows:

├── Demo # the demo code of of the simulation dataset and CWRU dataset
│   ├── Datasets
│   ├── Models
│   ├── checkpoint
│   ├── train.py                      # 1) training the NN model
│   ├── Demo_attribution_statistic.py # 2) demo code of full attribution analysis and statistic the result
│   └── Demo_attribution.py           # 2) demo code of single attribution analysis
└── SHEPs
    ├── Attribution_methods.py      # SHEP and SHAP
    ├── DomainTransform.py          # signal processing (high-level)
    ├── MultiDomain_Attribution.py  # the main file of multi-domain multi-method attribution analysis
    ├── plot_func.py
    ├── utils_SHAP_MyIndependent.py # shap package modification to support numpy.ndarray with dtype=object
    ├── utils_Transform.py          # signal processing (low-level)
    └── utils_Visualization.py      # visualizating the attribution result

Quick Start

1. Install Dependencies

conda create -n env-SHAP python=3.12.3
conda activate env-SHAP
conda install numpy=1.26.4
pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
conda install pandas matplotlib seaborn openpyxl scipy=1.13.1 scikit-learn shap=0.42.1

2. Download Datasets

Simulation: Auto-generated (saved to Demo/Datasets/Buffer-SimulationDataset).

CWRU：

Official: Case School of Engineering;
Mirror: Baidu NetDisk | Kuake NetDisk.

Organize as follows:

$CWRU_dir$ 
├── 12k Drive End Bearing Fault Data
├── 12k Fan End Bearing Fault Data
├── 48k DE fault data and normal baseline data
├── 48k Drive End Bearing Fault Data
└── Normal Baseline Data

3. Run Demos

Step 1: train the model:

# step 0: set the python environment as above, and set current path to the project path

# step 1: train the model for simulation dataset
python Demo/train.py --data_name 'Simulation'

# (step 1): Or train the model for CWRU dataset, but you need set the CWRU directory at the same time
python Demo/train.py --data_name 'CWRU' --data_dir $CWRU_dir$

The result is located in Demo/checkpoint/$checkpoint_name$.

Step 2: conduct attribution analysis:

you can choose different attribution-method / patch-level / domain to analyse the model in $checkpoint_name$ .

# step 2: You can conduct single attribution analysis
# --domain_mode ['time', 'frequency', 'envelope', 'STFT', 'CS','all'], default is 'frequency'
# --patch_mode ['0', '1', '2', '3', '4', '5'], the level of patch size where higher level means bigger patach and coarser granularity, default is '1'
# --method ['SHEP', 'SHAP', 'SHEP_Remove', 'SHEP_Add'], default is 'SHEP'
# --checkpoint_name, default is None which means the first $checkpoint_name$

python Demo/Demo_attribution.py # --domain_mode 'frequency' --patch_mode '1' --method 'SHEP' --checkpoint_name $checkpoint_name$

The result is located in Demo/checkpoint/$checkpoint_name$/PostProcess_of_Attribution_Analysis.

(Step 2):conduct full attribution analysis and statistic the results:

# (step 2): Or you can conduct full attribution analysis and statistic the results
# --checkpoint_name, default is None which means the first $checkpoint_name$

python Demo/Demo_attribution_statistic.py # --checkpoint_name $checkpoint_name$

The result is located in Demo/checkpoint/$checkpoint_name$/PostProcess_of_Attribution_Analysis/Stat.

(Tips) patch size settings:

| | #0 | #1 | #2 | #3 | #4 | #5 | | :------: | :----: | :----: | :----: | :-----: | :-----: | :-----: | | Time | (1,) | (3,) | (6,) | (12,) | (24,) | (48,) | | Freq | (1,) | (3,) | (6,) | (12,) | (24,) | (48,) | | Env | (1,) | (1,) | (2,) | (4,) | (8,) | (16,) | | TF | (1,2) | (1,5) | (2, 5) | (2, 10) | (4, 10) | (4, 20) | | CS | (1,1) | (1,3) | (2, 3) | (2, 6) | (4, 6) | (4, 12) |

Results Preview of Simulation Dataset

Dataset description

Parameter settings:

| Component | $f_c$ (kHz) | $f_m$ (Hz) | Health | Fault #1 | Fault #2 | | :-------: | :----------------: | :-------------------: | :----------: | :----------: | :----------: | | $C_0$ | 1.5 | 50 | $\checkmark$ | $\checkmark$ | $\checkmark$ | | $C_H$ | $\mathcal{U}(1,4)$ | $\mathcal{U}(20,200)$ | $\checkmark$ | | | | $C_1$ | 2.5 | 100 | | $\checkmark$ | | | $C_2$ | 3.5 | 125 | | | $\checkmark$ |

<html> <table style="width:100%; table-layout: fixed;"> <td align="center"> <strong>Dataset presentation</strong><br> <img src="./doc/SimuData.jpg" alt="Dataset presentation" width="50%"> </td> </table> </html>

Attribution visualization

Domain=frequency | Patch=#1 ：

<html> <table style="width:100%; table-layout: fixed;"> <tr> <td align="center"> <strong>SHEP-Remove</strong><br> <img src="./doc/frequency_1_SHEP_Remove_visualization.jpg" alt="SHEP-Remove" width="100%"> </td> <td align="center"><strong>SHEP-Add</strong><br><img src="./doc/frequency_1_SHEP_Add_visualization.jpg" alt="SHEP-Add" width="100%"></td> </tr> <tr> <td align="center"><strong>SHEP</strong><br><img src="./doc/frequency_1_SHEP_visualization.jpg" alt="SHEP" width="100%"></td> <td align="center"><strong>SHAP</strong><br><img src="./doc/frequency_1_SHAP_visualization.jpg" alt="SHAP" width="100%"></td> </tr> </table> </html>

Domain=CS | Patch=#1 ：

<html> <table style="width:100%; table-layout: fixed;"> <tr> <td align="center"> <strong>SHEP-Remove</strong><br> <img src="./doc/CS_1_SHEP_Remove_visualization.jpg" alt="SHEP-Remove" width="100%"> </td> <td align="center"><strong>SHEP-Add</strong><br><img src="./doc/CS_1_SHEP_Add_visualization.jpg" alt="SHEP-Add" width="100%"></td> </tr> <tr> <td align="center"><strong>SHEP</strong><br><img src="./doc/CS_1_SHEP_visualization.jpg" alt="SHEP" width="100%"></td> <td align="center"><strong>SHAP</strong><br><img src="./doc/CS_1_SHAP_visualization.jpg" alt="SHAP" width="100%"></td> </tr> </table> </html>

Attribution similarity (Demo_attribution_statistic.py)

<html> <table style="width:100%; table-layout: fixed;"> <tr> <td align="center"> <strong>Similarity matrix under Patch #1</strong><br> <img src="./doc/Similarity_Matrix_p1.jpg" alt="Similarity_Matrix_p1" width="100%"> </td> </tr> <tr> <td align="center"> <strong>Similarity box</strong><br> <img src="./doc/Similarity_Box.jpg" alt="Similarity box" width="100%">

SHEP

Install / Use

README

SHEP

Notes

Repository Structure

Core Code

Demo Code

Quick Start

1. Install Dependencies

2. Download Datasets

3. Run Demos

Results Preview of Simulation Dataset

Dataset description

Attribution visualization

Attribution similarity (Demo_attribution_statistic.py)