| title: PromptPrepML | |
| emoji: π€ | |
| colorFrom: blue | |
| colorTo: indigo | |
| sdk: streamlit | |
| sdk_version: 1.28.0 | |
| python_version: 3.11 | |
| app_file: app.py | |
| pinned: false | |
| license: mit | |
| # PromptPrepML | |
| AI-Powered Machine Learning Data Preprocessing Assistant | |
| Upload your dataset and describe your preprocessing needs in natural language. Our intelligent system will automatically: | |
| - Detect and remove identifier columns | |
| - Extract date features | |
| - Handle categorical encoding | |
| - Scale numeric features | |
| - Generate ML-ready datasets | |
| - Create reusable pipelines | |
| ## π Features | |
| - **π§ Intelligent Preprocessing**: Smart column detection and processing | |
| - **π Automated EDA**: Comprehensive data analysis reports | |
| - **π§ Smart Feature Engineering**: Advanced feature extraction | |
| - **βοΈ Reusable Pipelines**: Scikit-learn pipelines for production | |
| - **π Clean Outputs**: ML-ready train/test datasets | |
| ## π Usage | |
| 1. Upload your CSV dataset | |
| 2. Describe your preprocessing needs in natural language | |
| 3. Click "Process Dataset" | |
| 4. Download your ML-ready results | |
| ## π οΈ Tech Stack | |
| - **Backend**: Python, FastAPI, scikit-learn | |
| - **Frontend**: Streamlit | |
| - **EDA**: ydata-profiling | |
| - **ML**: pandas, numpy | |