Methods Inf Med 1998; 37(04/05): 315-326
DOI: 10.1055/s-0038-1634563
Original Article
Schattauer GmbH

Alternative Ways for Knowledge Collection, Indexing and Robust Language Retrieval

R. H. Baud
1   Division of Medical Informatics, Geneva University Hospital, Switzerland
,
C. Lovis
2   Department of Internal Medicine, Geneva University Hospital, Switzerland
,
A.-M. Rassinoux
3   Division of Biomedical Informatics, Vanderbilt University, Nashville, TN, USA
,
J.-R. Scherrer
1   Division of Medical Informatics, Geneva University Hospital, Switzerland
› Institutsangaben
Weitere Informationen

Publikationsverlauf

Publikationsdatum:
15. Februar 2018 (online)

Abstract

Definitions are provided of the key entities in knowledge representation for Natural Language Processing (NLP). Starting from the words, which are the natural components of any sentence, both the role of expressions and the decomposition of words into their parts are emphasized. This leads to the notion of concepts, which are either primitive or composite depending on the model where they are created. The problem of finding the most adequate degree of granularity for a concept is studied. From this reflection on basic Natural Language Processing components, four categories of linguistic knowledge are recognized, that are considered to be the building blocks of a Medical Linguistic Knowledge Base (MLKB). Following on the tracks of a recent experience in building a natural language-based patient encoding browser, a robust method for conceptual indexing and query of medical texts is presented with particular attention to the scheme of knowledge representation.

 
  • REFERENCES

  • 1 Wittgenstein LJJ. Philosophical Investigations. Oxford: Basil Blackwell; 1953
  • 2 de Saussure F. (1915) Cours de linguistique générale Bally & Sechehaye, Ed Payot. 1966
  • 3 Tuttle MS. et al Concept, Code, Term and Word: Preserving the Distinction, proceedings. SCAMC 95.;
  • 4 Ceusters W, Buekens F, De Moor G, Waag-meester A. The distinction between Linguistic and Conceptual Semantics in Medical Terminology and its Implication for NLP-Based Knowledge Acquisition. Proceedings of the IMIA WG6 Working Conference. Jacksonville, US; 1997
  • 5 Eco U. A Theory of Semiotics Bloomington. Indiana U.P.: 1976
  • 6 Pierce CS. Collected papers Cambridge. Harvard: University Press; 1931. -5.
  • 7 McCray AT. et al. Special Issue on Concepts, Knowledge, and Language in HealthCare Information Systems (IMIA), proceedings of the Working Conference of WG6. June 1994 Vevey, Switzerland: Meth Inform Med; 1995. (34) 1-4.
  • 8 Baud RH, Lovis C, Rassinoux A-M, Michel P-A, Scherrer J-R. Automatic Extraction of Linguistic Knowledge from an International Classification. In: Cesnik B. et al. eds. MEDINFO '98. Amsterdam: IOS Press; 1998: 581-5.
  • 9 Baud RH, Lovis C, Alpay L, Rassinoux AM, Scherrer JR, Nowlan A. Rector A Modelling for Natural Language Understanding. In: Safran C. (eds.) Proceedings of SCAMC 93. New York: McGraw-Hill, Inc; 1993: 289-93.
  • 10 Rector A. Compositional Models of Medical Concepts: Towards Re-usable Application-Independent Medical Terminologies. In: Barahona P, Christensen JP. Knowledge and Decisions in Health Telematics IOS Press. 1994: 109-14.
  • 11 Cimino JJ. et al. Managing Vocabulary for a Centralized Clinical System. Proceedings MEDINFO '95. Greene RA. et al. (eds.) IMIA. 1995: 117-20.
  • 12 McCray AT, Hole WT. The Scope and Structure of the First Version of the UMLS Semantic Network. In: Miller RA. (eds.) Proceedings of SCAMC 90 Los Alamitos: IEEE Computer Society Press. 1990: 126-30.
  • 13 Rassinoux AM, Miller RA, Baud RH, Scherrer JR. Modeling Just the Important and Relevant concepts in Medicine for Medical Language Understanding: A survey of the Issues Proceedings of the IMIA WG6 Working Conference. Jacksonville, US: 1997
  • 14 Côté RA, Rothwell DJ, Palotay JL, Beckett RS. SNOMED International Northfield, IL: College of American Pathologists. 1994
  • 15 Rezneck L. The Nature of disease. London: Routledge & Kegan; 1987
  • 16 Pratt AW. On the Matter of Medical Linguistics Proceedings MEDINFO '77. Shires DB. et al. (eds.) 1977 IMIA 223-4.
  • 17 Pacak MG, Cousineau L, White W. The segmentation Approach to Dictionary Construction Proceedings of the Annual Meeting of the Association of Canadian Pathologists. Sherbrooke, Canada: 1972
  • 18 Pacak MG, Norton LM, Dunham GS. Morpho-semantic Analysis of it is Forms in Medical Language. Meth Inform Med 1980; 19: 99-105.
  • 19 Norton LM, Pacak MG. Morpho-semantic Analysis of Compound Word Forms Denoting Surgical Procedures. Meth Inform Med 1983; 22: 29-36.
  • 20 Wingert W. Word Segmentation and Morpheme Dictionary for Pathology Data Processing Proceedings. MEDINFO '74. Stockholm, IFIP; 915-21.
  • 21 Rector AL, Rogers JE, Pole P. The GALEN High Level Ontology Medical Informatics Europe '96. Brender J. et al. (eds.) IOS Press; 1996: 174-8.
  • 22 Sowa JF. Conceptual Structures: Information Processing in Mind and Machine. Reading, MA: Addison-Wesley Publishing Company; 1984
  • 23 Rassinoux AM, Wagner JC, Lovis C, Baud RH, Rector A, Scherrer JR. Analysis of Medical Texts Based on a sound Medical Model. In: Gardner RM. (eds.) Proceedings of the 1995 AMIA Annual Fall Symposium (Formerly SCAMC). Philadelphia: Hanley & Belfus, Inc; 1995: 27-31.
  • 24 Baud RH, Rassinoux AM, Lovis C, Wagner J, Griesser V, Michel PA, Scherrer JR. Knowledge Sources for Natural Language Processing. In: Cimino JJ. (eds.) Proceedings of the 1996 AMIA Annual Fall Symposium (Formerly SCAMC). Philadelphia: Hanley & Belfus, Inc; 1996: 70-4.
  • 25 Baud RH, Lovis C, Rassinoux AM, Michel PA. et al. Towards a Medical Linguistic Knowledge Base. In: Greenes RA. et al. (eds.) Proceedings of MEDINFO 95. Alberta: HC&CC; 1995: 13-7.
  • 26 Lindberg DAB, Humphreys BL, McCray AT. The Unified Medical Language System. Meth Inform Med 1993; 32: 281-91.
  • 27 International Classification of Diseases Different versions World Health Organization. Geneva, Switzerland.:
  • 28 Cimino JJ. Formal Descriptions and Adaptative Mechanisms for Changes in Controlled Medical Vocabularies. Meth Inform Med 1996; 35: 202-10.
  • 29 Lovis C, Michel PA, Borst F, Baud RH, Griesser V. JR Scherrer: Medico-Economic Patient Encoding in the University Hospital of Geneva. Medical Informatics Europe '96. Brender J. et al. (eds.) IOS Press; 1996
  • 30 Lovis C, Michel PA, Baud RH, Scherrer JR. Word Segmentation Processing: A Way to Exponentially Extend Medical Dictionaries. Proceedings MEDINFO '95. Greenes RA. et al. (eds.) IMLA; 1995: 28-32.
  • 31 Lovis C, Baud RH, Rassinoux AM, Michel PA, Scherrer JR. Methodology Used and Implementation of a Multiclassification, Multilingual Patient Encoder. In: proceedings of Artificial Intelligence in Medicine Europe, AIME '97 Grenoble France. Keravnou E. et al. (eds.) Berlin, Heidelberg: Springer Verlag; 1997: 373-80.
  • 32 Ceusters W. et al. The ANTHEM Representation Formalism for the Alphabetic Index of ICD Proceedings MEDINFO '95. Greenes RA. et al. (eds.) IMIA; 1995: 113-6.