| | --- |
| | license: mit |
| | language: en |
| | datasets: |
| | - adilshamim8/social-media-addiction-vs-relationships |
| | tags: |
| | - tabular-data |
| | - scikit-learn |
| | - random-forest |
| | - classification |
| | - addiction |
| | - social-media |
| | - Linkspreed |
| | - Web4 |
| | - Social Networks as a Service |
| | model-index: |
| | - name: LS-W4-Mini-RF_Addiction_Impact |
| | results: |
| | - task: |
| | name: Tabular Classification |
| | type: tabular-classification |
| | dataset: |
| | name: Students Social Media Addiction |
| | type: adilshamim8/social-media-addiction-vs-relationships |
| | metrics: |
| | - name: Accuracy |
| | type: accuracy |
| | value: 0.93 |
| | --- |
| | |
| | # LS-W4-Mini-RF_Addiction_Impact |
| |
|
| | ## Model Summary |
| |
|
| | This is a **Random Forest Classifier** trained to predict whether social media use affects a student's academic performance. The model is based on the "Social Media Addiction vs. Relationships" dataset from Kaggle, which contains survey responses from students aged 16 to 25. |
| |
|
| | ## Usage |
| |
|
| | The model is packaged within a scikit-learn pipeline and can be easily loaded and used within any Python environment. It expects a pandas DataFrame with the same column structure as the original training data. |
| |
|
| | ```python |
| | import joblib |
| | import pandas as pd |
| | |
| | # Load the model |
| | model = joblib.load('LS-W4-Mini-RF_Addiction_Impact.joblib') |
| | |
| | # Example of new data to predict on |
| | new_data = pd.DataFrame({ |
| | 'Gender': ['Female'], |
| | 'Academic_Level': ['Undergraduate'], |
| | 'Most_Used_Platform': ['Instagram'], |
| | 'Relationship_Status': ['Single'], |
| | 'Age': [20], |
| | 'Avg_Daily_Usage_Hours': [5.0], |
| | 'Sleep_Hours_Per_Night': [6], |
| | 'Mental_Health_Score': [7], |
| | 'Addicted_Score': [8], |
| | 'Conflicts_Over_Social_Media': [0] |
| | }) |
| | |
| | # Make a prediction |
| | prediction = model.predict(new_data) |
| | print("Prediction (1 = Yes, 0 = No):", prediction) |
| | |
| | ``` |
| |
|
| | ## Training Data |
| |
|
| | The model was trained on the public dataset **[Social Media Addiction vs. Relationships](https://www.kaggle.com/datasets/adilshamim8/social-media-addiction-vs-relationships/data)**. The dataset consists of 705 records and 13 features with survey responses. The training data and the model file are available within the repository. |
| |
|
| | ## Model Details |
| |
|
| | * **Model Type**: scikit-learn `RandomForestClassifier` |
| | * **Pipeline Structure**: The pipeline includes a `ColumnTransformer` for one-hot encoding categorical features and the `RandomForestClassifier` itself. |
| | * **Key Hyperparameters**: `n_estimators=100`, `random_state=42`. |
| |
|
| | ## Performance |
| |
|
| | The model's performance was evaluated on a held-out test set from the original dataset. |
| |
|
| | * **Accuracy**: 0.93 |
| |
|
| | ## Limitations and Ethical Considerations |
| |
|
| | * **Not a Diagnostic Tool**: This model should be used as a statistical tool for trend analysis and should **not** be used for clinical or psychological diagnosis of addiction. The data is based on self-reported survey responses. |
| | * **Generalizability**: The model was trained on a specific sample of students and may not generalize well to other populations, age groups, or time periods. |
| | * **Data Bias**: The model's predictions reflect the biases present in the original dataset. The results should be interpreted with caution. |