Privacy-Preserving Clinical NLP

PLACID

Privacy-preserving Large language models for Acronym Clinical Inference and Disambiguation

Local inference  ·  Context-aware  ·  On-premises deployment

Product Concepts

Mobile translation and reviewer workbench.

A snapshot of two PLACID concepts: a patient-facing mobile note translator and a reviewer-facing clinical acronym workbench.

PLACID mobile app and web workbench demo screenshot

The Problem

Clinical notes are riddled with ambiguous acronyms.

Electronic health records contain tens of thousands of acronyms. "PT" could mean patient, physical therapy, prothrombin time, or posterior tibial, depending on context. Misinterpretation contributes to clinical errors and hinders downstream NLP tasks.

Existing solutions often require sending sensitive patient data to external APIs, creating governance and privacy barriers in healthcare. PLACID addresses both problems at once.

Acronym Ambiguity Example

PT
Patient
82%
Physical Therapy
10%
Prothrombin Time
6%
Posterior Tibial
2%

Distribution depends entirely on clinical context.

Methodology

Privacy as a core design constraint.

PLACID was built backwards from the constraint that no clinical text can leave the institution. Every component, from model selection through inference pipeline to evaluation harness, was chosen so the entire system can run on-premises with hospital-grade infrastructure.

An LLM system designed for hospital deployment.

Privacy is the single hardest constraint in clinical NLP. PLACID's design reflects that throughout: a model small enough to run on-prem, a context window long enough to capture the surrounding note, and an evaluation protocol built around real clinical notes rather than benchmark abstractions.

Engineering skills

Local LLM inference, prompt engineering, structured output, model quantisation.

Clinical skills

Clinical note analysis, acronym taxonomy, domain-specific evaluation.

Privacy skills

PHI handling, on-prem deployment, institutional review board alignment.

Evaluation skills

Ground-truth labelling, contextual accuracy metrics, comparative model studies.

At a glance

A model small enough to run on-prem, accurate enough to deploy.

The pipeline is deliberately compact so it can sit inside the institution's existing infrastructure without bespoke hardware procurement.

On-prem

Deployment

Local

Inference

PHI-safe

By design

01 Survey

Acronym taxonomy

Mapped the space of clinical acronyms and their expansions across specialties, identifying which acronyms are ambiguous and which are stable.

7B
13B
Quant.
CPU

02 Select

Model selection

Evaluated multiple open-weight LLMs against the privacy constraint, weighing accuracy against memory footprint and inference time on hospital-spec hardware.

03 Prompt

Contextual prompting

Designed prompts that supply the surrounding clinical note as context, so the model can disambiguate based on the same signals a clinician would use.

Accuracy

04 Validate

Empirical testing

Benchmarked PLACID against ground-truth labels from clinical experts, with per-acronym accuracy reported by specialty and acronym frequency.

On-prem

05 Deploy

Inside the institution

Packaged for on-premises deployment with no external API calls. PHI stays inside the institutional perimeter at every stage of inference.

The System

Three principles. One pipeline.

01 · Local Inference

Patient data never leaves the institution.

PLACID runs entirely on-premises. There are no external API calls and no data sharing with third-party providers — the same constraint clinical IT teams apply to every other PHI-handling system.

This privacy constraint shaped the system architecture: open-weight LLM selection, calibration for available hospital hardware, and the structure of the inference pipeline all followed from the deployment environment.

On-premises No external APIs PHI-safe
Hospital perimeter
PLACID
On-prem inference
No external calls

All PHI stays inside the institutional perimeter.

Context selector

"PT tolerated gait training and stairs."

Physical therapy78%
Prothrombin time8%
Patient14%

Same acronym, different context, different answer.

02 · Context-aware

The same acronym means different things in different notes.

PLACID disambiguates by looking at the surrounding clinical context, the same signal a clinician uses. "PT" in an orthopaedics discharge note rarely means prothrombin time; in a haematology lab report it almost always does.

The output is a probability distribution over candidate expansions rather than a single guess, so downstream systems and clinicians both have visibility into how confident the model is.

Contextual Probability distribution Specialty-aware

03 · Structured output

Machine-readable, designer-readable.

Every disambiguation is returned as a structured record — the original token, the expanded meaning, the confidence score, and the context window the model used. Downstream NLP tools can query it programmatically, and clinical reviewers can read it directly.

This structure is what makes PLACID composable. Other clinical pipelines can call it as a service without having to handle free-text parsing.

JSON output Composable Auditable
placid_response.json
{
"acronym": "PT",
"expansion": "Patient",
"confidence": 0.84,
"alternatives": [
["Physical therapy", 0.11],
["Prothrombin time", 0.04]
],
"context_used": "reports SOB after exertion"
}

Web Stack

A local LLM system integrated into the clinical workflow.

PLACID is designed as a deployable web application with the LLM embedded into the institutional stack. The browser workbench sends de-identified note text to a local inference service, receives structured acronym mappings, and renders confidence, alternatives, and evidence without clinical text leaving the hospital network.

The same local model service can support the reviewer workbench, patient-facing translation views, and downstream NLP pipelines through a consistent JSON contract.

Front end

Responsive clinical workbench

Browser-based note review, acronym highlighting, evidence panels, confidence meters, and accept/edit workflows.

LLM layer

Integrated local model

Open-weight LLM inference runs locally/on-prem, tuned for acronym expansion with surrounding note context.

Service contract

Structured JSON output

Returns acronym, expansion, confidence, alternatives, and context evidence for auditing and downstream reuse.

Deployment

No external API calls

Packaged for hospital infrastructure so PHI stays inside the institutional perimeter throughout inference.

Why It Matters

Cleaner records. Safer downstream AI.

On-prem

Deployment model

Context

Disambiguation signal

JSON

Composable output

Active

Project status