is a standard for health care data exchange, published by HL7®.
Provides metadata about the document so that the document can be discovered and managed
Through the Provenance one can
describe entities and processes involved in producing and delivering or otherwise influencing that resource
It is therefore natural to apply computer techniques to automatically generate structured data from the medical records. For that there are several implementations available both on the market and also fully open source. For example cTAKES, CLAMP, NOBLE, ClarityNLP and others are all freely available solutions targeting this task.
They all address the specific need of generating structured data from unstructured medical notes, however they all deliver the structure using their own format, that eventually could be converted into FHIR.
However, a central problem is on how to represent the Provenance of the extracted information, since FHIR is - to the best of my knowledge - missing the way of connecting to the precise location within the DocumentReference object of where the information has been extracted from , with which technology, and which is the level of "quality" of the extracted information.
Before submitting a Change Request https://gforge.hl7.org/gf/project/fhir/tracker/?action=TrackerItemBrowse to the FHIR normative, it is recommended to expose the issue to the widest community and the stackoverflow.com is one of the main recommended channels.
For this purpose I am hereby looking forward opinions on the matter, and namely on how to specify the provenance of FHIR Resources generated by applying NLP over medical narratives. For example, taking an example from the Adverse Event Corpus of Gurulingappa et al https://doi.org/10.1016/j.jbi.2012.04.008 ,
10030778|Intravenous azithromycin-induced ototoxicity.|ototoxicity|43|54|azithromycin|22|34
123456789012345678901234567890123456789012345678901234567890
1 2 3 4 5
The question is how to represent into FHIR that such drug induced problem has been extracted from the specific bytes positions 22-34 (drug) and 43-54 (problem) from the text (the Title of the paper 1999 in this example).
{
"resourceType": "AdverseEvent",
"id": "example",
"actuality": "actual",
"category": [
{
"coding": [
{
"system": "http://terminology.hl7.org/CodeSystem/adverse-event-category",
"code": "product-use-error",
"display": "Product Use Error"
}
]
}
],
"event": {
"coding": [
{
"system": "http://snomed.info/sct",
"code": "9062008",
"display": "Ototoxicity (disorder)"
}
],
"text": "10030778|Intravenous azithromycin-induced ototoxicity."
},
"subject": {
"reference": "Patient/example"
},
"date": "1999-02-29T00:00:00+00:00",
"seriousness": {
"coding": [
{
"system": "http://terminology.hl7.org/CodeSystem/adverse-event-seriousness",
"code": "Non-serious",
"display": "Non-serious"
}
]
},
"severity": {
"coding": [
{
"system": "http://terminology.hl7.org/CodeSystem/adverse-event-severity",
"code": "mild",
"display": "Mild"
}
]
},
"recorder": {
"reference": "Pharmacotherapy. 1999 Feb;19(2):245-8."
},
"suspectEntity": [
{
"instance": {
"reference": "Azithromycin"
}
}
]
}
Currently the FHIR standard does not allow to represent the precise byte position, the quality of the extraction, and the method used to perform it.