Subscribe to RSS
DOI: 10.1055/s-0038-1635550
Discriminating Powers of Partial Agreements of Names for Linking Personal Records
Part I: The Logical BasisPublication History
Publication Date:
19 February 2018 (online)
Abstract:
Machines have difficulty when using people’s names to link medical and other records pertaining to the same individuals because of nicknames, ethnic synonyms, truncations, misspellings and typographical errors. Present algorithms used to compute the discriminating powers (or ODDS) associated with partial agreements of names are based, inappropriately, on the degrees of outward similarity alone. They are particularly ineffective in dealing with names that look alike but are unrelated, and with related names that have little apparent similarity. A fundamentally different rationale is, therefore, proposed which, like the human mind, assesses the relatedness of two alternative forms of a name in terms of how often they are used, interchangeably in practice. This must be taken into account if the associated discriminating powers (ODDS) are to be correctly computed. A way of implementing this more precise approach is described and illustrated, using the given names on linked records from an earlier epidemiological study. This first study of two describes the logical basis for record linkage, a second one the empirical test.
Key-words:
Record Linkage - Probabilistic Linkage - Name Comparisons - Partial Agreements - Value Specific Odds* Available from Ted Hill, Research and General Systems Subdivision, Systems Development Division, Rm 2405, Main Building, Statistics Canada, Tunney’s Pasture, Ottawa, Canada KIA OT6.
** Available from Howard B. Newcombe, P. O. Box 135, Deep River, Ontario, Canada KOJ IPO.
-
REFERENCES
- 1 Smith ME, Newcombe HB. Automated follow-up facilities in Canada for monitoring delayed health effects. Amer. J. Publ. Health 1980; 70: 1261-8.
- 2 Smith ME, Newcombe HB. Use of the Canadian mortality data base for epidemiological follow-up. Canad. J. Publ. Health 1982; 73: 39-46.
- 3 Howe GR, Lindsay J. A generalized iterative record linkage system for use in medical follow-up studies. Comp. Biomed. Res 1981; 14: 327-40.
- 4 Hill T. Generalized Iterative Record Linkage System: GIRLS (glossary, concepts, strategy guide, user guide). Ottawa Ont: Statistics Canada; 1981
- 5 Hill T, Pring-Mill F. Generalized iterative record linkage system: GIRLS (revised ed). Statistics Canada: 1985–6*
- 6 Newcombe HB. Handbook of Record Linkage: Methods for Health and Statistical Studies, Administration and Business. Oxford: Oxford University Press; 1988
- 7 Newcombe HB, Fair ME, Lalonde P. Concepts and practices that improve probabilistic linkage. Proceedings of the International Symposium on Statistical Uses of Administrative Data, 1987. Ottawa Ont: Statistics Canada; 1987. **
- 8 Eagen M, Hill BE. Record linkage – methodology and its application. Proceedings of. the International Symposium on Statistical Uses of Administrative Data, 1987. Ottawa Ont: Statistics Canada; 1987. *
- 9 Newcombe HB, Fair ME, Lalonde P. Discriminating powers of partial agreements of names for linking personal records. Part II: The empirical test. Meth. Inform. Med 1989; 28: 92-96.