Basketball, P. (2000). During the P. Basketball, H. F. Spirer, & L. Spirer (Eds.), Putting some Instance: Exploring Major Peoples Liberties Abuses Playing with Advice Solutions and you will Investigation Investigation. AAAS.
Belin, T. Roentgen., & Rubin, D. B. (1995). A method to possess calibrating incorrect-fits rates in number linkage. Log of Western Mathematical Organization, 90(430), 694–707.
Bilenko, Yards., & Mooney, Roentgen. J. (2003). Transformative Duplicate Recognition Having fun with Learnable Sequence Similarity Methods. For the KDD ’03 (pp. 39–48). ACM.
Christen, P. (2008). Automated Checklist Linkage Using Seeded Nearby Neighbor and Assistance Vector Servers Category. Into the KDD ’08 (pp. 151–159). ACM.
Christen, P. (2012). A study out-of indexing approaches for scalable listing linkage and deduplication. IEEE Transactions to your Degree and you will Research Technology, 24(9), 1537–1555.
Cohen, W., Raviku). An evaluation out-of string metrics to have complimentary names and you may facts. When you look at the KDD workshop into data clean and you may target consolidation (Vol. step 3, pp. 73–78).
Copas, J., & Hilton, F. (1990). Checklist linkage: Mathematical activities getting matching computers facts. Record of your own Regal Mathematical Area, Series A beneficial, 153(3), 287–320.
Dai, A great. Yards., & Storkey, A great. J. (2011). The fresh grouped author-question model to own unsupervised entity solution. Within the Fake neural networks and servers learning–icann Etiopia-naiset 2011 (pp. 241–249). Springer.
Fortini, Yards., Liseo, B., Nuccitelli, A great., & Scanu, Meters. (2001). On Bayesian List Linkage. Research for the Authoritative Analytics, 4(1), 185–198.
Gutman, R., Afendulis, C., & Zaslavsky, An excellent. (2013). An effective bayesian procedure for file linking to research avoid- of-life medical will cost you. Journal of your American Mathematical Association, 108(501), 34–47.
Hsu, W., Lee, Yards. L., Liu, B., & Ling, T. W. (2000). Mining Exploration inside Diabetics Databases: Conclusions and Findings. For the KDD ’00 (pp. 430–436). ACM.
A split-mix Markov strings Monte Carlo means of the latest Dirichlet techniques mix model
Jewell, N. P., Spagat, Yards., & Jewell, B. L. (2013). MSE and Casualty Matters: Assumptions, Translation, and Demands. From inside the T. B. Seybolt, J. D. Aronson, & B. Fischhoff (Eds.), Relying Civil Casualties: An introduction to Recording and you may Quoting Nonmilitary Fatalities in conflict. Oxford, UK: Oxford College or university Press.
Larsen, Yards. D. (2002)ments into Hierarchical Bayesian Checklist Linkage. For the Process of your combined analytical conferences, area into the questionnaire look strategies (pp. 1995–2000). Brand new Western Analytical Organization.
Steorts, R
Larsen, Yards. D. (2005). Enhances for the Checklist Linkage Idea: Hierarchical Bayesian Checklist Linkage Theory. Inside the Procedures of one’s shared mathematical conferences, area to the questionnaire look tips (pp. 3277–3284). The brand new American Analytical Relationship.
Larsen, M. D., & Rubin, D. B. (2001). Iterative automatic listing linkage having fun with blend patterns. Diary of Western Mathematical Organization, 96(453), 32–41.
Lum, K., Rate, M. E., & Banking institutions, D. (2013). Software regarding Numerous Expertise Quote into the Individual Rights Research. Brand new Western Statistician, 67(4), 191–200.
Marchant, N. G., C., Kaplan, A beneficial., Rubinstein, B. We. P., & Elazar, D. N. (2019). D-blink: Marketed prevent-to-prevent bayesian entity solution.
McCallum, An excellent., & Wellner, B. (2004). Conditional Models of Name Uncertainty having App in order to Noun Coreference. In the Enhances from inside the sensory advice control expertise (nips ’04) (pp. 905–912). MIT Force.
Miller, P. L., Frawley, S. J., & Sayward, F. G. (2000). IMM/Scrub: A domain name-Specific Product on Deduplication off Vaccination Record Ideas in the Youngsters Immunization Registriesputers and you will Biomedical Browse, 33(2), 126–143.
Murphy, J., Brackbill, R. M., Thalji, L., Dolan, M., Pulliam, P., & Walker, D. J. (2007). Calculating and Promoting Publicity internationally Trading Cardiovascular system Fitness Registry. Statistics for the Drug, 26(8), 1688–1701.
Murray, J. S. (2016). Probabilistic number linkage and you can deduplication shortly after indexing, clogging, and filtering. Diary of Privacy and you may Confidentiality, 7(1), 3–24.
Newcombe, H. B., Kennedy, J. M., Axford, S. J., & James, An effective. P. (1959). Automated linkage from public record information hosts are often used to extract” follow-up” statistics out of group from documents out of techniques details. Research, 130(3381), 954–959.
Sadinle, Meters. (2014). Detecting Copies during the a homicide Registry Using an effective Bayesian Partitioning Strategy. Annals out of Used Analytics, 8(4), 2404–2434.
Sariyar, Meters., Borg, A., & Pommerening, K. (2012). Productive Studying Suggestions for the fresh Deduplication of Electronic Patient Study Having fun with Group Trees. Log of Biomedical Informatics, 45(5), 893–900.
C., Hall, Roentgen., & Fienberg, S. Age. (2016). A great Bayesian Approach to Graphical Record Linkage and you may Deduplication. Journal of one’s Western Mathematical Association, 111(516), 1660–1672.
Tancredi, A beneficial., & Liseo, B. (2011). A good hierarchical Bayesian way of listing linkage and you can population proportions difficulties. Annals of Used Analytics, 5(2B), 1553–1585.