| | --- |
| | library_name: tf-keras |
| | tags: |
| | - structured-data |
| | - tabular-data |
| | - classification |
| | --- |
| | # Tensorflow Keras Implementation of Structured data classification from scratch |
| | This repo contains models and notebook for [Structured data classification from scratch](https://keras.io/examples/structured_data/structured_data_classification_from_scratch/). |
| | This example demonstrates how to do structured data classification, starting from a raw CSV file. Our data includes both numerical and categorical features. We will use Keras preprocessing layers to normalize the numerical features and vectorize the categorical ones. |
| | Full credits to [François Chollet](https://twitter.com/fchollet), creator of Keras! |
| |
|
| | ## Model description |
| |
|
| | The model is a very simple MLP with only one hidden layer. |
| | This example showcases how to perform preprocessing of common tabular data *inside* a Keras model. |
| | It uses tensorflow.keras.layers.{IntegerLookup, Normalization, StringLookup} to process numerical and categorical (integer or string) features. |
| |
|
| |
|
| |
|
| | ## Intended uses & limitations |
| |
|
| | This tool does not provide medical advice It is intended for informational purposes only. It is not a substitute for professional medical advice, diagnosis or treatment. |
| |
|
| | ## Training and evaluation data |
| |
|
| | [Our dataset](https://archive.ics.uci.edu/ml/datasets/heart+Disease) is provided by the Cleveland Clinic Foundation for Heart Disease. It's a CSV file with 303 rows. Each row contains information about a patient (a sample), and each column describes an attribute of the patient (a feature). We use the features to predict whether a patient has a heart disease (binary classification). |
| | The model is trained on 80% of data and evaluated on remaining 20%. |
| |
|
| | ## Training procedure |
| | Training proceeds for 50 epochs with default Adam optimizer on binary crossentropy. |
| |
|
| | ### Training hyperparameters |
| |
|
| | The following hyperparameters were used during training: |
| | - optimizer: {'name': 'Adam', 'learning_rate': 0.001, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False} |
| | - training_precision: float32 |
| |
|
| | ## Training Metrics |
| | Training achieves >83% accuracy on the held-out validation data. |
| | Check TensorBoard "Metrics" tab above for details. |
| | ## Model Plot |
| |
|
| | <details> |
| | <summary>View Model Plot</summary> |
| |
|
| |  |
| |
|
| | </details> |