Loading articles...
News Update:
March 2026 Special Issue Announcement – Exclusive Publication Discounts: 60% for Editorial Board Members, 40% for Previous Authors, 25% for New Authors | Fast-Track Review (7–10 Days) | E-Certificate of Publication | Special Issue (Minimum 5 Papers) | Inviting Applications for Editorial Board Membership.
WhatsApp
Back to Articles
Computer Science Open Access Peer Reviewed

Privacy preserving clinical information extraction pipeline


Authors

Pradeep Kumar, Srimathi, Sanjeevi Vishnu, Senthil prakash*


Abstract

This paper presents a privacy-preserving Clinical Information Extraction Pipeline designed to extract structured medical knowledge from unstructured clinical narratives while ensuring robust patient data confidentiality. The proposed pipeline integrates state-of-the-art Natural Language Processing (NLP) techniques including Named Entity Recognition (NER), relation extraction, and medical concept normalization with privacy-enhancing technologies such as differential privacy, federated learning, and de-identification modules compliant with HIPAA and GDPR standards. Clinical entities such as diagnoses, medications, procedures, and laboratory findings are accurately identified and extracted without exposing personally identifiable information (PII). Experimental evaluations conducted on benchmark clinical datasets demonstrate that the pipeline achieves competitive extraction accuracy while maintaining strong privacy guarantees, with minimal utility loss. The results highlight the feasibility of deploying privacy-aware NLP systems in real-world healthcare environments, paving the way for secure secondary use of clinical data in medical research, pharmacovigilance, and clinical decision support systems.


Keywords

Clinical information extraction, privacy-preserving NLP, de-identification, differential privacy, federated learning, named entity recognition, electronic health records, HIPAA compliance.

Publication Details

Published In

Volume 1, Issue 1