Western Sydney University · PhD Research

Explore biomedical data
with your voice.

MediVerse turns natural-language questions into validated SQL and renders the results as interactive 3D visualisations inside a head-mounted display. No code, no menus - just ask.

95%
retrieval accuracy
68.2%
cohort compression
1.1-1.7s
end-to-end latency
348
patients evaluated
About the research

A unified framework for immersive biomedical data analysis

MediVerse is the visualisation arm of a doctoral research programme at Western Sydney University that addresses four field-level gaps in biomedical informatics: the absence of validated hybrid edge-cloud architectures for real-time analytics, the lack of cohort-aware compression methods for omics data, limited integration of natural-language interfaces with immersive analytics, and the absence of a unified evaluation framework for AI-driven immersive visualisation.

Existing infrastructures remain fragmented, failing to simultaneously address scalability, efficiency, privacy, and accessibility for non-technical users. Multi-dimensional genomic, clinical, and imaging data also exceed what 2D desktop interfaces can effectively communicate. This work proposes a unified architecture that lets frontline clinicians and researchers query and explore complex datasets through their voice, inside a virtual reality environment - without writing SQL and without leaving an immersive workspace.

Impact at a glance

Numbers from the published evaluations

Validated on paediatric leukaemia and de-identified paediatric tumour cohorts.

95%
Voice-to-visualisation accuracy on the 20-query PRICAI benchmark.
68.2%
Lossless compression on 340-patient cytometry cohort (vs. 57.9% ZSTD).
57%
Faster upload through edge-side pre-processing and pseudonymisation.
1.7s
Worst-case end-to-end latency from voice to rendered 3D view.
Research program

Four interrelated contributions

The research is structured as four contributions covering how biomedical data are stored, compressed, analysed, and visualised. Three are already published; the fourth is in preparation.

01

Hybrid edge-cloud framework

A validated architecture distributing computation between edge devices (local pre-processing and pseudonymisation) and cloud infrastructure (scalable analytics) for real-time biomedical workflows.

56% data reduction57% faster uploadB-ALL & CLL cytometry
02

Trie-based shared dictionary compression

A lossless compression method that exploits global redundancy patterns across entire patient cohorts rather than individual files, using N-gram analysis to build dictionaries that are deployed back to edge nodes.

68.2% reduction vs. ZSTD 57.9%178 B-ALL + 162 CLL patients
03

MediVerse - voice-driven VR analytics

A generalised voice-driven immersive analytics architecture that translates natural-language questions into validated SQL via an LLM, then renders results as interactive 3D visualisations inside a head-mounted display.

95% retrieval accuracy95% correct visualisation1100-1740 ms end-to-end
04

IVEM - evaluation metric suite

A unified metric suite of six operationalised metrics across three dimensions - Rendering Quality, AI Decision Quality, and Explanation Quality - enabling diagnostic precision and cross-system comparison for AI-driven immersive visualisations.

6 metrics3 dimensionsManuscript in preparation
Demonstration

See MediVerse in action

A short demonstration of the voice-driven workflow on a paediatric leukaemia case study.

Peer-reviewed publications

Research output

The MediVerse programme is supported by a series of peer-reviewed publications covering the underlying edge-cloud framework, an efficient omics-data compression scheme, the immersive voice-driven analytics system itself, and a forthcoming evaluation metric suite.

PRICAI 2025 Conference paper

MediVerse: AI-Powered Interactive Voice-Driven Virtual Reality for Health Data Analytics

2025

R. Adam, D. R. Catchpoole, S. J. Simoff, Z. Qu, P. J. Kennedy, Q. V. Nguyen.

Pacific Rim International Conference on Artificial Intelligence (PRICAI 2025), pp. 35-50. Springer Nature Singapore.

95%retrieval accuracy 95%correct visualisation 1.1-1.7send-to-end
Abstract

This paper introduces MediVerse, a system that integrates voice-based natural-language interfaces, immersive VR visualisation, and large-language-model query translation to enable real-time, hands-free interaction with complex biomedical datasets. The platform uses head-mounted VR displays for voice input, a cloud-based LLM for schema-aware query interpretation, and real-time 3D rendering of the resulting data. Evaluation across 20 seed queries on paediatric leukaemia (247 patients) and a de-identified paediatric tumour gene-expression cohort (101 samples) demonstrated accurate user-intent interpretation, low-latency responsiveness, and the correct selection of immersive visualisations in 95% of cases.

JSAN Journal article

Lossless Compression with Trie-Based Shared Dictionary for Omics Data in Edge-Cloud Frameworks

2025

R. Adam, D. R. Catchpoole, S. J. Simoff, Z. Qu, P. J. Kennedy, Q. V. Nguyen.

Journal of Sensor and Actuator Networks, vol. 14, no. 2, art. 41 (2025).

68.2%compression vs. 57.9%ZSTD baseline 340patient cohort
Abstract

The growing complexity and volume of genomic and omics data present critical challenges for storage, transfer, and analysis on edge-cloud platforms. This paper introduces a lossless compression method that integrates Trie-based shared dictionaries within an edge-cloud architecture. N-gram analysis identifies repeated sequences across entire patient cohorts; the resulting global dictionary is deployed to edge nodes for localised preprocessing, reducing redundancy before cloud transmission, where additional compression is then applied. Evaluation on B-ALL (178 patients, 7.92 GB) and CLL (162 patients, 3.98 GB) datasets achieved 68.2% reduction in file size compared with 57.9% for traditional ZSTD - an additional saving of approximately US $676 per year at this scale - while maintaining compatibility with standard bioinformatics pipelines.

IDHDB Journal article

Novel Hybrid Edge-Cloud Framework for Efficient and Sustainable Omics Data Management

2024

R. Adam, D. R. Catchpoole, S. J. Simoff, P. J. Kennedy, Q. V. Nguyen.

Innovations in Digital Health, Diagnostics, and Biomarkers, vol. 4, pp. 81-88 (2024).

56%data reduction 57%faster upload ~$4.7k/yr saved (40 TB)
Abstract

The healthcare landscape is rapidly integrating diverse data sources - electronic health records, omics, and genomic data - into unified patient profiles, enhancing personalised medicine and interoperability. This transformation, however, faces growing challenges in data integration and analysis. This study introduces a novel hybrid edge-cloud framework designed to manage the surge of multidimensional genomic and omics data in healthcare. It combines the localised processing of edge computing with the scalable resources of cloud computing, evaluated on simulated B-cell acute lymphoblastic leukaemia (B-ALL) and chronic lymphocytic leukaemia (CLL) cytometry datasets. The architecture achieved 56% data reduction, 57% faster upload times, enhanced privacy through local pseudonymisation, and projected annual cost savings of approximately US $4,679 on a 40 TB workload.

In preparation Pending submission

IVEM: A Unified Metric Suite for AI-Driven Visualisation in Virtual Reality

2026

R. Adam, D. R. Catchpoole, S. J. Simoff, Z. Qu, P. J. Kennedy, Q. V. Nguyen.

Manuscript in preparation.

6operationalised metrics 3quality dimensions
Abstract

Existing evaluation practices for AI-driven immersive visualisation systems rely on static, single-axis metrics that fail to capture the joint quality of rendering, model decisions, and post-hoc explanation. IVEM (Immersive Visualisation Evaluation Model) introduces a unified metric suite of six operationalised metrics across three dimensions - Rendering Quality, AI Decision Quality, and Explanation Quality - validated on paediatric leukaemia and de-identified paediatric tumour gene-expression datasets. The suite enables diagnostic precision and objective cross-system comparison for AI-driven immersive analytics, complementing the MediVerse architecture introduced in earlier work.

Research team

Supervisory panel

A cross-institutional team spanning computer science, AI, biomedical research, and paediatric oncology.

Candidate

Rani Adam

PhD candidate, School of Computer, Data and Mathematical Sciences, Western Sydney University.

Primary supervisor

Assoc. Prof. Quang Vinh Nguyen

School of Computer, Data and Mathematical Sciences, Western Sydney University.

Co-supervisor

Prof. Daniel R. Catchpoole

The Tumour Bank, Children's Cancer Research Unit, Kids Research, The Children's Hospital at Westmead; The University of Sydney; UTS.

Co-supervisor

Prof. Simeon J. Simoff

School of Computer, Data and Mathematical Sciences, Western Sydney University.

Co-supervisor

Dr. Zhonglin Qu

School of Computer, Data and Mathematical Sciences, Western Sydney University.

Co-supervisor

Prof. Paul J. Kennedy

Australian Artificial Intelligence Institute, Faculty of Engineering and IT, University of Technology Sydney.

Institutional partners

Where the research is hosted

MediVerse is a research collaboration supported by an Australian Government Research Training Program scholarship.

Western Sydney University
Children's Hospital at Westmead
Get in touch

Demos, collaborations, and beta access

If you would like to see a live demonstration, discuss a collaboration, or request beta access to the platform, please reach out.

rani.adam@westernsydney.edu.au