Policy based synthesis: Data generation and augmentation methods for RF machine learning

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Scopus citations

Abstract

The current dataset generation methods for RF Machine Learning (RFML) tasks consist of either completely synthetically generated data or completely raw digitized data from an RF front end. The synthetic datasets are often unrealistic in terms of waveforms or protocols, and the raw captures are typically unlabeled (or often mislabeled), and can skew machine learning algorithms to focus on non-salient features. Further, the associated storage and processing requirements are quite large. In this work, a novel dataset generation and augmentation method called policy-based synthesis is presented that aims to address the short-comings of either approach by combining basic protocol knowledge with simulated channel and device impairments to supplement over-the-air captures made in a controlled environment. This method permits the learning of salient features and regularizes radio and device anomalies that are not of interest. Practical considerations for collecting and processing data for this hybridized approach are also detailed and examples are provided on a dataset that includes protocols commonly used in the 2.4 GHz ISM band such as Bluetooth and Wi-Fi.

Original languageEnglish (US)
Title of host publicationGlobalSIP 2019 - 7th IEEE Global Conference on Signal and Information Processing, Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728127231
DOIs
StatePublished - Nov 2019
Externally publishedYes
Event7th IEEE Global Conference on Signal and Information Processing, GlobalSIP 2019 - Ottawa, Canada
Duration: Nov 11 2019Nov 14 2019

Publication series

NameGlobalSIP 2019 - 7th IEEE Global Conference on Signal and Information Processing, Proceedings

Conference

Conference7th IEEE Global Conference on Signal and Information Processing, GlobalSIP 2019
Country/TerritoryCanada
CityOttawa
Period11/11/1911/14/19

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Information Systems and Management
  • Artificial Intelligence
  • Computer Vision and Pattern Recognition
  • Signal Processing

Fingerprint

Dive into the research topics of 'Policy based synthesis: Data generation and augmentation methods for RF machine learning'. Together they form a unique fingerprint.

Cite this