Evaluation of standard and semantically-augmented distance metrics for neurology patients

doi:10.21203/rs.3.rs-20018/v4

Download PDF

Research article

Evaluation of standard and semantically-augmented distance metrics for neurology patients

https://doi.org/10.21203/rs.3.rs-20018/v4

This work is licensed under a CC BY 4.0 License

Journal Publication

published 26 Aug, 2020

Read the published version in BMC Medical Informatics and Decision Making →

You are reading this latest preprint version

Background: Patient distances can be calculated based on signs and symptoms derived from an ontological hierarchy. There is controversy as to whether patient distance metrics that consider the semantic similarity between concepts can outperform standard patient distance metrics that are agnostic to concept similarity. The choice of distance metric can dominate the performance of classification or clustering algorithms. Our objective was to determine if semantically augmented distance metrics would outperform standard metrics on machine learning tasks.

Methods: We converted the neurological findings from 382 published neurology cases into sets of concepts with corresponding machine-readable codes. We calculated patient distances by four different metrics (cosine distance, a semantically augmented cosine distance, Jaccard distance, and a semantically augmented bipartite distance). Semantic augmentation for two of the metrics depended on concept similarities from a hierarchical neuro-ontology. For machine learning algorithms, we used the patient diagnosis as the ground truth label and patient findings as machine learning features . We assessed classification accuracy for four classifiers and cluster quality for two clustering algorithms for each of the distance metrics.

Results: Inter-patient distances were smaller when the distance metric was semantically augmented. Classification accuracy and cluster quality were not significantly different by distance metric.

Conclusion: Although semantic augmentation reduced inter-patient distances, we did not find improved classification accuracy or improved cluster quality with semantically augmented patient distance metrics when applied to a dataset of neurology patients. Further work is needed to assess the utility of semantically augmented patient distances.

Medical Informatics

Neurology

Patient distances

Semantic augmentation

Ontologies

Machine learning

Patient clustering

Patient classification

Distance metrics

neurology

Table 1. Illustration of case abstraction method. The first column is findings from a case of Parkinson disease in Neuroanatomy through Clinical Cases [47] and is reproduced with the permission of the author. The second column is the abstractor’s interpretation of the finding, and the third column is the UMLS CUI [24].

Original Finding	Interpretation	CUI
“micrographia”	micrographia	C0240341
“mask-like decreased facial expression”	mask-like facies	C0424448
“asymmetrical bradykinesia”	bradykinesia	C0233565
“cogwheel rigidity”	cogwheel rigidity	C0151564
“en bloc turning”	difficulty turning body	C0555095
“Exhibited retropulsion of two steps when pulled gently backward”	retropulsion	C0277845
“no extinction of the glabellar reflex (Myerson sign)”	Myerson sign	C4293666
“4 Hz tremor of the head and all extremities, worse at rest”	resting tremor	C0234379
“Slow, stiff gait with stooped posture, short steps, decreased arm swing”	decreased arm swing	C2938985
	stooped posture	C4476759
	slow gait	C1851908
	marche a petit pas	C0427169

Table 2. Four test groups and 32 diagnoses used in clustering and classification analyses. The first column is an abbreviation used in Tables and Figures. Typical findings are listed illustratively for non-neurologists and are not meant to be a definitive reference on each condition.

	Test Group	Typical Findings	N
	Patient with weakness	Group 1	148
GBS	Guillain Barré syndrome*	weakness, areflexia, sensory loss, paresthesias	20
MYL	myelopathy	weakness, sensory level, urinary retention, hyperreflexia	29
CE	cauda equina	leg weakness, urinary retention, sensory loss	6
ALS	amyotrophic lateral sclerosis	weakness, hyperreflexia, fasciculations	21
MS	multiple sclerosis	weakness, sensory changes, hyperreflexia, diplopia	19
MYO	myopathy	proximal muscle weakness	15
MG	myasthenia gravis	weakness, diplopia, ptosis	18
PN	polyneuropathy	weakness, sensory loss, hyporeflexia	20
	Patient with abnormal movements	Group 2	75
HD	Huntington disease*	chorea, personality change	16
PAR	Parkinson disease*	tremor, bradykinesia, rigidity	19
PSP	progressive supranuclear palsy	bradykinesia, rigidity, gaze palsies	8
SND	striatonigral degeneration	bradykinesia, rigidity	8
ET	essential tremor	tremor	7
HB	hemiballismus	hemiballismus	4
DYS	dystonia	dystonia	9
WIL	Wilson disease*	tremor, ataxia, dystonia, bradykinesia, personality change	4
	Patient with altered mental status	Group 3	102
LBD	Lewy body dementia	dementia, bradykinesia, hallucinations	6
B12	B₁₂ deficiency	paresthesias, confusion, weakness, sensory loss	9
NPH	normal pressure hydrocephalus	urinary incontinence, dementia, gait apraxia	14
AW	acute Wernicke encephalopathy*	confusion, diplopia, ataxia, disorientation	19
CJD	Creutzfeldt-Jakob disease*	myoclonus, personality change, memory loss, disorientation	12
ALZ	Alzheimer disease*	amnesia, dementia	16
FTD	frontotemporal dementia	aphasia, dementia, executive dysfunction	14
SDH	subdural hematoma	headache, lethargy, weakness, confusion	12
	Patient with cranial neuropathy	Group 4	67
BPV	benign positional vertigo	vertigo	9
MNR	Meniere disease*	vertigo, dizziness, hearing loss	7
RH	Ramsay Hunt syndrome*	facial weakness, hearing loss	6
BEL	Bell palsy*	facial weakness	10
THD	third nerve palsy	diplopia, ptosis	8
AN	acoustic neuroma	tinnitus, hearing loss, nystagmus	11
ON	optic neuritis	blurred vision, papilledema	6
TN	trigeminal neuralgia	face pain	10

*The non-possessive form of eponymous diseases has been used uniformly [70]

Download PDF

Journal Publication

published 26 Aug, 2020

Read the published version in BMC Medical Informatics and Decision Making →

Editorial decision: Accept
12 Aug, 2020
Editor assigned by journal
10 Aug, 2020
Submission checks completed at journal
09 Aug, 2020
Editor invited by journal
09 Aug, 2020

You are reading this latest preprint version

Evaluation of standard and semantically-augmented distance metrics for neurology patients

Status:

Journal Publication

Version 4

Abstract

Figures

Full Text

Tables

difficulty turning body

Status:

Journal Publication

Version 4