TY - GEN
T1 - Privacy Preserving Semantic Communications Using Vision Language Models
T2 - 2025 IEEE Military Communications Conference, MILCOM 2025
AU - Chang, Haoran
AU - Chen, Mingzhe
AU - Wang, Huaxia
AU - Zhang, Qianqian
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Semantic communication has emerged as a promising paradigm for next-generation wireless systems, improving the communication efficiency by transmitting high-level semantic features. However, reliance on unimodal representations can degrade reconstruction under poor channel conditions, and privacy concerns of the semantic information attack also gain increasing attention. In this work, a privacy-preserving semantic communication framework is proposed to protect sensitive content of the image data. Leveraging a vision-language model (VLM), the proposed framework identifies and removes private-content regions from input images prior to transmission. A shared privacy database enables semantic alignment between the transmitter and receiver to ensure consistent identification of sensitive entities. At the receiver, a generative module reconstructs the masked regions using learned semantic priors and conditioned on the received text embedding. Simulation results show that generalizes well to unseen image processing tasks, improves reconstruction quality at the authorized receiver by over 10% using text embedding, and reduces identity leakage to the eavesdropper by more than 50%.
AB - Semantic communication has emerged as a promising paradigm for next-generation wireless systems, improving the communication efficiency by transmitting high-level semantic features. However, reliance on unimodal representations can degrade reconstruction under poor channel conditions, and privacy concerns of the semantic information attack also gain increasing attention. In this work, a privacy-preserving semantic communication framework is proposed to protect sensitive content of the image data. Leveraging a vision-language model (VLM), the proposed framework identifies and removes private-content regions from input images prior to transmission. A shared privacy database enables semantic alignment between the transmitter and receiver to ensure consistent identification of sensitive entities. At the receiver, a generative module reconstructs the masked regions using learned semantic priors and conditioned on the received text embedding. Simulation results show that generalizes well to unseen image processing tasks, improves reconstruction quality at the authorized receiver by over 10% using text embedding, and reduces identity leakage to the eavesdropper by more than 50%.
UR - https://www.scopus.com/pages/publications/105031772946
UR - https://www.scopus.com/pages/publications/105031772946#tab=citedBy
U2 - 10.1109/MILCOM64451.2025.11310050
DO - 10.1109/MILCOM64451.2025.11310050
M3 - Conference contribution
AN - SCOPUS:105031772946
T3 - Proceedings - IEEE Military Communications Conference MILCOM
SP - 1365
EP - 1370
BT - 2025 IEEE Military Communications Conference, MILCOM 2025
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 6 October 2025 through 10 October 2025
ER -