Seminar on Data matching research at the Australian National University

  Title: Data matching research at the Australian National University Speakers: Prof Peter Christen, Mr Thilina Ranbaduge Research School of Computer Science ANU College of Engineering and Computer Science The Australian National University Canberra ACT 0200 Australia peter.christen@anu.edu.au http://users.cecs.anu.edu.au/~christen/ thilina.ranbaduge@anu.edu.au https://cecs.anu.edu.au/people/thilina-ranbaduge Abstract: Techniques for matching and integrating data from different

Start

End

 

Title: Data matching research at the Australian National University

Speakers: Prof Peter Christen, Mr Thilina Ranbaduge

Research School of Computer Science
ANU College of Engineering and Computer Science
The Australian National University
Canberra ACT 0200
Australia

peter.christen@anu.edu.au
http://users.cecs.anu.edu.au/~christen/

thilina.ranbaduge@anu.edu.au
https://cecs.anu.edu.au/people/thilina-ranbaduge

Abstract:

Techniques for matching and integrating data from different sources are
becoming increasingly important in many data science and data analytics
projects. Application domains of data matching include health and
social science research, national censuses, social welfare, crime and
fraud detection, and national security.

Today, data matching (also known as entity resolution, duplicate
detection, and record linkage) not only faces computational challenges
due to the increasing size of data collections and their complexity,
but also operational challenges as many applications move from static
environments into real-time processing and analytics of potentially
large and fast data streams, where real-time matching of records is
required. Finally, with the growing concerns by the public of the use
of their data, privacy and confidentiality often need to be considered
when personal information is being matched and shared between
organisations.

This talk will start with a short introduction to data matching and
describe the above discussed challenges in more detail, and then
provide an overview of three areas of research conduct in data matching
at the Australian National University:
1) Techniques for scalable real-time record linkage
2) Advanced classification techniques based on active learning
3) Privacy-preserved linking of multiple large databases

Biographies:

Peter Christen is a Professor at the Australian National University
(ANU) Research School of Computer Science. He graduated with a PhD in
Computer Science in 1999 from the University of Basel, Switzerland,
and has been at the ANU since 2000. He has led various research
projects, including industry collaborations with NSW Health, Google,
and Fujitsu Laboratories. He has published over 130 articles in the
areas of record linkage and data mining, including in 2012 the
monograph “Data Matching” published by Springer. He is the principle
developer of the Febrl (Freely Extensible Biomedical Record Linkage)
open source data cleaning, deduplication and record linkage system.

Thilina Ranbaduge is a final year PhD student at the Australian National
University (ANU) Research School of Computer Science. He is working on
privacy-preserving record linkage techniques (PPRL) for multiple
databases. His main focus is to develop scalable techniques for efficient
and effective indexing/blocking of multiple database and his research
has been published in several top tier data mining conferences. Before
starting his PhD in 2014, he received his PG.Dip and BSc Honours from the
University of Moratuwa, Sri Lanka, in 2013 and 2009, respectively.

 

Share:

Contact Us

Mailing Address

University of Colombo School of Computing
UCSC Building Complex,
35 ,Reid Avenue, Colombo 7
SRI LANKA

T: +94 -11- 2581245/ 7

F: +94-11-2587239

E: info[at]ucsc.cmb.ac.lk

Subscribe to Our mailing list

Social Links


Subscribe on Youtube:

Like our Facebook:

TOP