Healthcare Data Quality Assessment for Cybersecurity Intelligence
Li, Yang; Yang, Jiachen; Zhang, Zhuo; Wen, Jiabao; Kumar, Prabhat (2022-07-12)
Post-print / Final draft
Li, Yang
Yang, Jiachen
Zhang, Zhuo
Wen, Jiabao
Kumar, Prabhat
12.07.2022
IEEE Transactions on Industrial Informatics
IEEE
School of Engineering Science
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi-fe2022080552892
https://urn.fi/URN:NBN:fi-fe2022080552892
Tiivistelmä
Considering the efficiency and security of healthcare data processing, indiscriminate data collection, annotation, and transmission are unwise. In this work, we propose the normalized double entropy (NDE) method to assess image data quality in the form of meta-task. In specific, the probability entropy and distance entropy are both adopted and normalized to evaluate the data quality. The experimental results show the stable ability of the NDE to distinguish good and bad data in terms of information contribution. Further, the model’s diagnostic performances driven by selected good and bad data are compared, and a
clear gap exists between them under the premise of the same amount of data. Screening 70% of the dataset can achieve almost the same accuracy as that based on all data. This work focuses on healthcare data quality and data redundancy and provides a practical evaluation tool to facilitate the identification and collection of valuable data, which is beneficial to improve efficiency and protect cybersecurity in healthcare systems.
clear gap exists between them under the premise of the same amount of data. Screening 70% of the dataset can achieve almost the same accuracy as that based on all data. This work focuses on healthcare data quality and data redundancy and provides a practical evaluation tool to facilitate the identification and collection of valuable data, which is beneficial to improve efficiency and protect cybersecurity in healthcare systems.
Lähdeviite
Li Y., Yang J., Zhang Z., Wen J., Kumar P. (2022). Healthcare Data Quality Assessment for Cybersecurity Intelligence. IEEE Transactions on Industrial Informatics. DOI: 10.1109/TII.2022.3190405
Alkuperäinen verkko-osoite
https://ieeexplore.ieee.org/document/9827613Kokoelmat
- Tieteelliset julkaisut [1463]