CATEGORY: Ontologies and Machine Learning Techniques

SOURCE: MDPI Open Access Journals, Sensors, March 2024, 24(6), 1739;

Advanced Data Processing of Pancreatic Cancer Data Integrating Ontologies and Machine Learning Techniques to Create Holistic Health Records


George Manias1, Ainhoa Azqueta-Alzúaz2, Athanasios Dalianis3, Jacob Griffiths4, Maritini Kalogerini3, Konstantina Kostopoulou5, Eleftheria Kouremenou1, Pavlos Kranas6, Sofoklis Kyriazakos5, Danae Lekka5, Fabio Melillo7, Marta Patiño-Martinez2, Oscar Garcia Perales4, Aristodemos Pnevmatikakis5, Salvador Garcia Torrens8, Usman Wajid4 and Dimosthenis Kyriazis1

1Department of Digital Systems, University of Piraeus, 18534 Piraeus, Greece
2Facultad de Informática, Universidad Politécnica de Madrid, 28040 Madrid, Spain
3Athens Technology Center S.A., 15233 Athens, Greece
4Information Catalyst, S.L., 46800 Xàtiva, Spain
5Innovation Sprint, 1200 Brussels, Belgium
6LeanXscale, 28223 Madrid, Spain
7Engineering Ingegneria Informatica SpA, 00144 Rome, Italy
8Hospital de Denia Marina Salud S.A., 03700 Alicante, Spain
*Author to whom correspondence should be addressed.



The modern healthcare landscape is overwhelmed by data derived from heterogeneous IoT data sources and Electronic Health Record (EHR) systems. Based on the advancements in data science and Machine Learning (ML), an improved ability to integrate and process the so-called primary and secondary data fosters the provision of real-time and personalized decisions. In that direction, an innovative mechanism for processing and integrating health-related data is introduced in this article. It describes the details of the mechanism and its internal subcomponents and workflows, together with the results from its utilization, validation, and evaluation in a real-world scenario. It also highlights the potential derived from the integration of primary and secondary data into Holistic Health Records (HHRs) and from the utilization of advanced ML-based and Semantic Web techniques to improve the quality, reliability, and interoperability of the examined data. The viability of this approach is evaluated through heterogeneous healthcare datasets pertaining to personalized risk identification and monitoring related to pancreatic cancer. The key outcomes and innovations of this mechanism are the introduction of the HHRs, which facilitate the capturing of all health determinants in a harmonized way, and a holistic data ingestion mechanism for advanced data processing and analysis.


More Publications