Web4
/

LS-W4-Mini-RF_Addiction_Impact

Social Networks as a Service

Eval Results (legacy)

Model card Files Files and versions

LS-W4-Mini-RF_Addiction_Impact / README.md

Web4's picture

Update README.md

161e82d verified 7 months ago

|

history blame contribute delete

3.12 kB

	---
	license: mit
	language: en
	datasets:
	- adilshamim8/social-media-addiction-vs-relationships
	tags:
	- tabular-data
	- scikit-learn
	- random-forest
	- classification
	- addiction
	- social-media
	- Linkspreed
	- Web4
	- Social Networks as a Service
	model-index:
	- name: LS-W4-Mini-RF_Addiction_Impact
	results:
	- task:
	name: Tabular Classification
	type: tabular-classification
	dataset:
	name: Students Social Media Addiction
	type: adilshamim8/social-media-addiction-vs-relationships
	metrics:
	- name: Accuracy
	type: accuracy
	value: 0.93
	---

	# LS-W4-Mini-RF_Addiction_Impact

	## Model Summary

	This is a Random Forest Classifier trained to predict whether social media use affects a student's academic performance. The model is based on the "Social Media Addiction vs. Relationships" dataset from Kaggle, which contains survey responses from students aged 16 to 25.

	## Usage

	The model is packaged within a scikit-learn pipeline and can be easily loaded and used within any Python environment. It expects a pandas DataFrame with the same column structure as the original training data.

	```python
	import joblib
	import pandas as pd

	# Load the model
	model = joblib.load('LS-W4-Mini-RF_Addiction_Impact.joblib')

	# Example of new data to predict on
	new_data = pd.DataFrame({
	'Gender': ['Female'],
	'Academic_Level': ['Undergraduate'],
	'Most_Used_Platform': ['Instagram'],
	'Relationship_Status': ['Single'],
	'Age': [20],
	'Avg_Daily_Usage_Hours': [5.0],
	'Sleep_Hours_Per_Night': [6],
	'Mental_Health_Score': [7],
	'Addicted_Score': [8],
	'Conflicts_Over_Social_Media': [0]
	})

	# Make a prediction
	prediction = model.predict(new_data)
	print("Prediction (1 = Yes, 0 = No):", prediction)

	```

	## Training Data

	The model was trained on the public dataset [Social Media Addiction vs. Relationships](https://www.kaggle.com/datasets/adilshamim8/social-media-addiction-vs-relationships/data). The dataset consists of 705 records and 13 features with survey responses. The training data and the model file are available within the repository.

	## Model Details

	* Model Type: scikit-learn `RandomForestClassifier`
	* Pipeline Structure: The pipeline includes a `ColumnTransformer` for one-hot encoding categorical features and the `RandomForestClassifier` itself.
	* Key Hyperparameters: `n_estimators=100`, `random_state=42`.

	## Performance

	The model's performance was evaluated on a held-out test set from the original dataset.

	* Accuracy: 0.93

	## Limitations and Ethical Considerations

	* Not a Diagnostic Tool: This model should be used as a statistical tool for trend analysis and should not be used for clinical or psychological diagnosis of addiction. The data is based on self-reported survey responses.
	* Generalizability: The model was trained on a specific sample of students and may not generalize well to other populations, age groups, or time periods.
	* Data Bias: The model's predictions reflect the biases present in the original dataset. The results should be interpreted with caution.