VKAT Ganti's picture

1

VKAT Ganti PRO

vkatg

azithteja91

AI & ML interests

None yet

Recent Activity

updated a model about 2 months ago

vkatg/exposureguard-policynet

updated a model about 2 months ago

vkatg/exposureguard-dcpg-encoder

updated a model about 2 months ago

vkatg/exposureguard-dagplanner

View all activity

Organizations

None yet

updated 4 models about 2 months ago

vkatg/exposureguard-policynet

Text Classification • Updated Mar 17 • 5

vkatg/exposureguard-dcpg-encoder

Feature Extraction • Updated Mar 17 • 2

vkatg/exposureguard-dagplanner

Text Classification • Updated Mar 17 • 2

vkatg/exposureguard-synthrewrite-t5

Text Generation • 0.2B • Updated Mar 15 • 2.33k

posted an update about 2 months ago

Post

208

A patient's name in a clinical note. Their voice in an ASR transcript. A waveform header with their DOB. Three separate records, each one low-risk by itself. Together, they're enough to re-identify someone.

Per-record de-identification doesn't see this. It can't. It has no memory of what came before.

We built a system that does. AMPHI tracks PHI exposure across modalities and time, maintains a risk score per patient, and escalates masking automatically as exposure accumulates. When a text record and an audio record co-reference the same patient via embedding similarity, the system catches it and responds before the next record arrives.

The core results: adaptive policy holds privacy at 0.991 on high-risk bursty workloads while keeping utility at 0.847. Static redaction gets the privacy number but destroys utility. Static weak masking keeps utility but leaks on high-risk bursts. The adaptive system doesn't trade one for the other.

Full system is open-source. Five models, three datasets, two demo spaces, 141 passing tests.

Spaces: [AMPHI Demo]( vkatg/amphi-rl-dpgraph) | [DCPG Scorer]( vkatg/dcpg-scorer-demo)

Models: [DCPG Encoder]( vkatg/exposureguard-dcpg-encoder) | [Cross-Modal Risk Scorer]( vkatg/dcpg-cross-modal-phi-risk-scorer) | [PolicyNet]( vkatg/exposureguard-policynet) | [FedCRDT]( vkatg/exposureguard-fedcrdt-distill) | [DAGPlanner](https://huggingface.co/vkatg/expsoureguard-dagplanner)

Datasets: [Multimodal PHI Masking]( vkatg/multimodal-phi-masking-benchmark) | [Streaming De-ID Benchmark]( vkatg/streaming-phi-deidentification-benchmark) | [DAG Remediation Traces]( vkatg/dag_remediation_traces)

Code: [phi-exposure-guard on GitHub](https://github.com/azithteja91/phi-exposure-guard)

published a model about 2 months ago

vkatg/exposureguard-dagplanner

Text Classification • Updated Mar 17 • 2

updated a model about 2 months ago

vkatg/exposureguard-fedcrdt-distill

Text Classification • Updated Mar 14 • 7

updated a dataset about 2 months ago

vkatg/multimodal-phi-masking-benchmark

Viewer • Updated Mar 14 • 18k • 52

updated a Space about 2 months ago

Stateful Exposure-Aware De-Identification for Multimodal Streaming Data

Adaptive PHI de-identification for streaming multimodal data

updated a model about 2 months ago

vkatg/dcpg-cross-modal-phi-risk-scorer

Text Classification • Updated Mar 13 • 6

updated a Space about 2 months ago

DCPG Cross-Modal PHI Risk Scorer

Score PHI risk across clinical data modalities

updated 2 datasets about 2 months ago

vkatg/dag_remediation_traces

Viewer • Updated Mar 13 • 8.5k • 25

vkatg/streaming-phi-deidentification-benchmark

Viewer • Updated Mar 13 • 328 • 164

published 2 datasets about 2 months ago

vkatg/dag_remediation_traces

Viewer • Updated Mar 13 • 8.5k • 25

vkatg/multimodal-phi-masking-benchmark

Viewer • Updated Mar 14 • 18k • 52

published 4 models about 2 months ago

vkatg/exposureguard-fedcrdt-distill

Text Classification • Updated Mar 14 • 7

vkatg/exposureguard-dcpg-encoder

Feature Extraction • Updated Mar 17 • 2

vkatg/exposureguard-synthrewrite-t5

Text Generation • 0.2B • Updated Mar 15 • 2.33k

vkatg/exposureguard-policynet

Text Classification • Updated Mar 17 • 5

published a Space about 2 months ago

DCPG Cross-Modal PHI Risk Scorer

Score PHI risk across clinical data modalities