Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Commit a67c634

Browse files
docs(KDP): reorganising documentation for better UX
1 parent 052454e commit a67c634

File tree

4 files changed

+186
-2
lines changed

4 files changed

+186
-2
lines changed

docs/contributing.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -69,7 +69,7 @@ If you wan't to request a new feature or you have detected an issue, please use
6969
- `init repo structure`
7070
- `adding pre-processing unit-tests`
7171

72-
- [x] Once the code is ready create a Merge Request (MR) into the DEV branch with a proper naming convention
72+
- [x] Once the code is ready create a Merge Request (MR) into the MAIN branch with a proper naming convention
7373

7474
> The name of your MR should follow the same exact convention as your commits (we have a dedicated check for this in the CI):
7575

docs/kdp_overview.md

Lines changed: 142 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,142 @@
1+
# 💡 Understanding KDP
2+
3+
## 🎯 What is KDP?
4+
5+
KDP (Keras Data Processor) is a powerful preprocessing library designed to streamline and enhance data preparation for deep learning models. It combines modern deep learning techniques with traditional preprocessing methods to create a flexible and efficient data processing pipeline.
6+
7+
## 🌟 Key Features
8+
9+
### 1. 🔄 Unified Preprocessing
10+
- Single interface for all preprocessing needs
11+
- Seamless integration with Keras models
12+
- End-to-end differentiable pipeline
13+
14+
### 2. 🎛️ Advanced Feature Processing
15+
- **Numerical Features**
16+
- Multiple scaling options
17+
- Automatic outlier handling
18+
- Missing value imputation
19+
20+
- **Categorical Features**
21+
- Learned embeddings
22+
- Automatic vocabulary management
23+
- Handling of unknown categories
24+
25+
### 3. 🧠 Deep Learning Enhancements
26+
- **Tabular Attention**
27+
- Feature interaction modeling
28+
- Adaptive feature importance
29+
- Multi-head attention support
30+
31+
- **Feature Selection**
32+
- Automatic importance learning
33+
- Dynamic feature filtering
34+
- Interpretable weights
35+
36+
## 🏗️ Architecture Overview
37+
38+
```mermaid
39+
graph TD
40+
A[Raw Data] --> B[Feature Definition]
41+
B --> C[Preprocessing Model]
42+
C --> D[Feature Processing]
43+
D --> E[Deep Learning Extensions]
44+
E --> F[Processed Features]
45+
46+
subgraph "Feature Processing"
47+
D1[Numerical Processing]
48+
D2[Categorical Processing]
49+
end
50+
51+
subgraph "Extensions"
52+
E1[Tabular Attention]
53+
E2[Feature Selection]
54+
E3[Transformer Blocks]
55+
end
56+
```
57+
58+
## 💪 Why Choose KDP?
59+
60+
### 1. 🎯 Simplicity
61+
- Intuitive API design
62+
- Minimal boilerplate code
63+
- Clear documentation
64+
65+
### 2. 🚀 Performance
66+
- Optimized for large datasets
67+
- GPU acceleration support
68+
- Memory-efficient processing
69+
70+
### 3. 🔧 Flexibility
71+
- Customizable preprocessing
72+
- Extensible architecture
73+
- Framework agnostic
74+
75+
### 4. 🤝 Integration
76+
- Seamless Keras integration
77+
- Easy model export/import
78+
- Cloud platform support
79+
80+
## 🛠️ Core Components
81+
82+
### 1. Feature Definitions
83+
- Define data types and processing
84+
- Configure feature-specific parameters
85+
- Set preprocessing strategies
86+
87+
### 2. Preprocessing Model
88+
- Manages feature transformations
89+
- Handles data flow
90+
- Maintains state
91+
92+
### 3. Extensions
93+
- Add advanced capabilities
94+
- Enhance preprocessing
95+
- Improve model performance
96+
97+
## 📈 Use Cases
98+
99+
### 1. 📊 Tabular Data
100+
- Financial data processing
101+
- Customer analytics
102+
- Time series analysis
103+
104+
### 2. 🎯 Feature Engineering
105+
- Automatic feature selection
106+
- Feature interaction modeling
107+
- Dimensionality reduction
108+
109+
### 3. 🔄 Model Integration
110+
- Deep learning pipelines
111+
- AutoML systems
112+
- Production deployments
113+
114+
## 🚀 Getting Started
115+
116+
1. Check out our [Quick Start Guide](quick_start.md)
117+
2. Explore [Key Features](features.md)
118+
3. Try [Complex Examples](complex_examples.md)
119+
120+
## 📚 Learning Path
121+
122+
1. 🎓 **Beginner**
123+
- Basic feature definition
124+
- Simple preprocessing
125+
- Data transformation
126+
127+
2. 🏃 **Intermediate**
128+
- Advanced features
129+
- Custom preprocessing
130+
- Performance optimization
131+
132+
3. 🚀 **Advanced**
133+
- Extension development
134+
- Pipeline optimization
135+
- Production deployment
136+
137+
## 🔗 Next Steps
138+
139+
- [🛠️ Key Features](features.md)
140+
- [🚀 Quick Start](quick_start.md)
141+
- [📚 Complex Examples](complex_examples.md)
142+
- [🤝 Contributing Guide](contributing.md)

docs/quick_start.md

Lines changed: 42 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,42 @@
1+
# 🚀 Quick Start Guide
2+
3+
## 📦 Installation
4+
5+
```bash
6+
pip install keras-data-processor
7+
```
8+
9+
## 🎯 Basic Usage
10+
11+
### 1️⃣ Define Your Features
12+
13+
```python
14+
from kdp.processor import PreprocessingModel
15+
from kdp.features import NumericalFeature, CategoricalFeature
16+
17+
# Define features
18+
features = {
19+
"age": NumericalFeature(),
20+
"income": NumericalFeature(scaling="standard"),
21+
"occupation": CategoricalFeature(embedding_dim=32),
22+
"education": CategoricalFeature(embedding_dim=16)
23+
}
24+
```
25+
26+
### 2️⃣ Create Preprocessing Model
27+
28+
```python
29+
# Initialize the model
30+
model = PreprocessingModel(
31+
features=features,
32+
tabular_attention=True, # Enable attention mechanism
33+
feature_selection=True # Enable feature selection
34+
)
35+
```
36+
37+
38+
## 🔗 Useful Links
39+
40+
- [📚 Full Documentation](https://kdp.readthedocs.io)
41+
- [💻 GitHub Repository](https://github.com/piotrlaczkowski/keras-data-processor)
42+
- [🐛 Issue Tracker](https://github.com/piotrlaczkowski/keras-data-processor/issues)

mkdocs.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -37,7 +37,7 @@ theme:
3737

3838
nav:
3939
- 🏠 Home: index.md
40-
# - 💡 Understanding KDP: kdp_overview.md
40+
- 💡 Understanding KDP: kdp_overview.md
4141
- 🛠️ Key Features: features.md
4242
- 🏭 Factory System:
4343
- 🏗️ Layers Factory: layers_factory.md

0 commit comments

Comments
 (0)