FeatureEngineering
A clean and modular Python toolkit for feature engineering, including data preprocessing, transformation, encoding, scaling and feature selection. Suitable for both exploratory data analysis and production workflows.
Install / Use
/learn @FinDii/FeatureEngineeringREADME
<h1 align="center">🔧 Feature Engineering Toolkit (Python)</h1>
<p align="center">
<img src="https://img.shields.io/badge/Python-3.8%2B-blue.svg" />
<img src="https://img.shields.io/badge/Status-Active-success.svg" />
<img src="https://img.shields.io/badge/Jupyter-Notebook-orange.svg" />
</p>
<p align="center">
A modular and reusable toolkit for performing feature engineering on structured datasets.<br>
This repository provides essential utilities for preprocessing, transforming, and optimizing features for quant finance and machine learning workflows.
</p>
📦 Overview
Feature engineering is one of the most critical steps in quant projects.
Here provides a clean pipeline and practical examples for:
- 🧹 Data preprocessing:missing value handling, outlier detection
- 📊 Exploratory Feature Analysis:time series analysis, classical technical indices, correlation, visual comparison
- 🔣 Feature Transformation:temporal dimension, Cross-sectional features, interaction, contextual dimension, demensional reduction
- 📐 Advanced Engineering:improved engineering methods based on the previous results and comparisons
- ✂️ Feature Selection:selection based on correlation changes, SHAP from Catboost models
This repository can serve both as a reference and a reusable feature engineering module.
📁 Project Structure
- Data used: data_cp.csv (too big for uploading)
- Notebooks:
- Technical Indices.ipynb
- TimeSeriesAnalysis.ipynb
- FeatureEngeering_basic.ipynb
