Cheriton School of Computer Science
Language Skills
Language Read Write Speak Understand Peer Review
English Yes Yes Yes Yes Yes
Degrees
- 2004/8 Doctorate, Computer Science, Purdue University
Supervisors: Walid Aref and Ahmed Elmagarmid, 1999/9 - 2004/8 - 1999/8 Master's Thesis, Computer Science, Alexandria University
Supervisors: Nagwa Elmakky, 1997/5 - - 1995/6
Recognitions
Bachelor's, Computer Science, Alexandria University
2014/9 - 2015/9 Google Faculty Award - 45,000 Google
Prize / Award
Faculty research Award
2014/9 ACM Distinguished Scientist - 0 Association for Computing Machinery Distinction
Recognizes distinguished research members of the ACM 2014/4 - 2017/4 NSERC Accelerator Supplement - 120,000
National Research Council Canada Prize / Award
40000 each year for three years
2013/7 - 2016/7 David R. Cheriton Faculty Fellowship - 15,000 University of Waterloo
Honor
to recognize a faculty member whose scholarly work has gained national or
international attention, whose teaching ability is exceptional, and who has displayed a high level of commitment and dedication to her/his department, the Faculty of
Mathematics, and the University
User Profile
Research Specialization Keywords: Databases, Data Cleaning, Data Quality, Information Retrieval, Ranking, Uncertain Data
Professor Ihab Ilyas
111
Employment
2014/7 Professor
Computer Science, Faculty of Math, University of Waterloo Full-time, Term, Professor
Tenure Status: Tenure 2013/1 Advisor and Co-Founder
Data Tamer
Co-founded the company with my former student, George Beskales and Prof. Mike Stonebraker from MIT and a number of other colleagues. Currently serving as an advisor on the technical platform
2008/11 Consultant - VP Research Primal Fusion
Primal is a startup in the Waterloo area. I am Leading the research team on transferring the technical vision to product. Working mainly on using ontologies and semantic technologies to create scalable user models.
2009/7 - 2014/6 Associate Professor
Computer Science, University of Waterloo Full-time, Term, Associate Professor Tenure Status: Tenure
2011/6 - 2013/9 Principal Scientist
Qatar Computing Research Institute
Founded and leading the Data Analytics Center. Hired a team of 5 scientists and 3 software engineers to conduct research relevant to the state of Qatar and having global impact.
2004/7 - 2009/7 Assistant Professor
Computer Science, University of Waterloo Full-time, Term, Assistant Professor Tenure Status: Tenure Track
2003/5 - 2003/9 Research Intern
IBM Almaden Research Center
Working with the database group at Almaden on database research focusing on robust query processing and optimization.
2002/5 - 2002/9 Research Intern
IBM Almaden Research Center
Working with the database group at Almaden on database research focusing on robust query processing and optimization.
Professor Ihab Ilyas
112
Leaves of Absence and Impact on Research
2011/9 - 2013/8 Unpaid, Qatar Computing Research Institute
Built and led a world-class database group at the newly founded Qatar Computing
Research Institute (QCRI). The experience had many positive effects on my research and leadership career: (1) The opportunity to start a group from scratch focusing on recruiting top researchers from all over the world; (2) Starting high-impact research projects in the field on data analytics, focusing on data quality; (3) Hiring my graduate students from Waterloo as research interns gave them a unique experience in working with various top researchers on projects that became integral part of their thesis research; (4)
Commercializing and licensing our home grown research and founding a startup (Data Tamer) with collaborators from MIT as a strong example of technology transfer; and (5) Dealing with national-level challenges and stakeholders to solve large scale problem. We published multiple papers in top conferences, filed more than 10 patents, and released open-source software.
2011/9 - 2011/8 Sabbatical, University of Waterloo, Primal Fusion and QCRI
Interacted with multiple colleagues from various organizations from industry, research labs and academia. I kept a close relationship with my students; we started investigating multiple new research directions focusing mainly on data quality. The interaction with industry gave us an appreciation of real world problems and helped us address more impactful issues in the field of data analytics and data cleaning.
Research Funding History
Awarded [n=7] 2015/9 - 2020/9 Principal Investigator
Schema-Later Analytics with Proactive Cleaning, Grant Funding Sources:
Thomson Reuters Faculty Grant
Total Funding - 120,000
Portion of Funding Received - 120,000 Funding Competitive?: No
2017/1 - 2020/1 Principal Applicant
Distributed Data Profiling, Grant Funding Sources:
Huawei Faculty Grant
Total Funding - 250,000
Portion of Funding Received - 250,000 Funding Competitive?: Yes
2014/4 - 2019/4 Principal Applicant
Cleaning and Analysis of Large Uncertain and Inconsistent Data Sources, Grant Funding Sources:
Natural Sciences and Engineering Research Council of Canada (NSERC) DISCOVERY
Total Funding - 380,000
Portion of Funding Received - 380,000 Funding Competitive?: Yes
2014/4 - 2017/4 Principal Investigator
Cleaning and Analysis of Large Uncertain and Inconsistent Data Sources, Grant Funding Sources:
Natural Sciences and Engineering Research Council of Canada (NSERC) Research Accelerator Supplement
Professor Ihab Ilyas
113 Total Funding - 120,000
Portion of Funding Received - 120,000 Funding Competitive?: Yes
2014/9 - 2015/9 Information Extraction for Ling Tail Entities, Grant Principal Investigator
Funding Sources: Google
Faculty Award
Total Funding - 50,000
Portion of Funding Received - 50,000 Funding Competitive?: Yes
2010/4 - 2012/4 Principal Applicant
NSERC DISCOVERY: Probabilistic Retrieval and Cleaning of Large Uncertain and Inconsistent Databases, Grant
Funding Sources:
Natural Sciences and Engineering Research Council of Canada (NSERC) DISCOVERY
Total Funding - 215,000
Portion of Funding Received - 86,000 Funding Competitive?: Yes
2011/1 - 2011/12 Co- applicant
Completed [n=2]
NSERC Business Intelligence Strategic Network (BIN): Modelling Uncertainty in record De-duplication, Grant
Funding Sources:
Natural Sciences and Engineering Research Council of Canada (NSERC) NSERC Business Intelligence Strategic Network
Total Funding - 52,000
Portion of Funding Received - 26,000 Funding Competitive?: Yes
Co-applicant : Tamer Ozsu
2012/1 - 2013/12 Co-applicant
NSERC Business Intelligence Strategic Network (BIN): Modelling and Querying Uncertainty in Record De-duplication and Repairing FD Violations, Grant Funding Sources:
Natural Sciences and Engineering Research Council of Canada (NSERC) Business Intelligence Strategic Network
Total Funding - 62,000
Portion of Funding Received - 31,000 Funding Competitive?: Yes
Professor Ihab Ilyas
114 2008/8 - 2013/8
Principal Applicant
Early Researcher Award, Grant Funding Sources:
University of Waterloo ERA - Matching
Total Funding - 50,000
Portion of Funding Received - 50,000 Funding Competitive?: Yes
Government of Ontario (Ottawa, ON) Early Researcher Award
Total Funding - 100,000
Portion of Funding Received - 100,000 Funding Competitive?: Yes
Student/Postdoctoral Supervision
Bachelor’s [n=2] 2018/1 - 2018/5 Principal Supervisor
Jiexuan Zheng (In Progress) , University of Waterloo Student Degree Expected Date: 2018/5
Thesis/Project Title: NSERC - URSA: HoloClean System Implemntation Present Position: undergrad
2017/9 - 2017/12 Principal Supervisor
Jiexuan Zheng (Completed) , University of Waterloo
Thesis/Project Title: HoloClean: System development and deployment Present Position: undergrad
Master’s non-Thesis [n=2] 2015/1 - 2015/11
Principal Supervisor
Hella Hoffmann (Completed) , University of Waterloo
Thesis/Project Title: Holistic Cleaning of Heterogeneous Datasets usingConditional Denial Constraints - Master Research Paper
Present Position: Thomson Reuters 2011/9 - 2015/12
Principal Supervisor
John Morcos (Completed) , University of Waterloo
Thesis/Project Title: Crowd Sourcing for Data Cleaning Tasks - Master Research Paper Present Position: Software Engineer, Microsoft
Master’s Thesis [n=6] 2016/9 - 2018/1
Principal Supervisor
Anam Shadab (Completed) , University of Waterloo
Thesis/Project Title: Data Driven Techniques for Schema Discovery in RDF Present Position: Software Engineer
2016/1 - 2019/1 Principal Supervisor
Ahmed Aljimai (In Progress) , University of Waterloo Thesis/Project Title: Distributed Profiling
Present Position: grad student 2014/9 - 2018/6
Co-Supervisor
Shichao Jin (In Progress) , University of Waterloo
Thesis/Project Title: Hybrid Column-Row Stores for Mixed Workloads Present Position: Grad student
2010/9 - 2013/9 Principal Supervisor
Artur Galiullin (Completed) , University of Waterloo
Thesis/Project Title: Probabilistic Query Answering on Dirty Databases Present Position: Facebook
Professor Ihab Ilyas
115 2009/9 - 2012/3
Principal Supervisor
Mina Farid (Completed) , University of Waterloo
Thesis/Project Title: Query Optimization for On-Demand Information Extraction Tasks over Text Databases
Present Position: Phd Student 2008/9 - 2012/9
Principal Supervisor
Mina Saleeb (Completed) , University of Waterloo
Thesis/Project Title: Ad-hoc Holistic Ranking Aggregation Present Position: Oracle, USA
Doctorate [n=12] 2017/9 - 2022/9 Principal Supervisor
Georgios Michalopoulos (In Progress) , University of Waterloo Thesis/Project Title: Knowledge Fusion using HoloClean Present Position: Phd Student
2017/9 - 2022/9 Principal Supervisor
Michael Azmy (In Progress) , University of Waterloo Student Degree Expected Date: 2022/9
Thesis/Project Title: the DSTLR Project Present Position: Phd Student
2017/1 - 2022/1 Principal Supervisor
Alireza Heydari (In Progress) , University of Waterloo
Thesis/Project Title: Sampling in HoloClean: Sampling from Dirty Data Present Position: Phd Student
2016/9 - 2021/12 Principal Supervisor
Chang Ge (In Progress) , University of Waterloo Student Degree Expected Date: 2021/12
Thesis/Project Title: Cleaning Private Data Sets Present Position: University of Waterloo
2015/9 - 2020/9 Principal Supervisor
Hemant Saxena (In Progress) , University of Waterloo Student Degree Expected Date: 2020/9
Thesis/Project Title: Scalable Data Profiling Present Position: PhD Student
2014/9 - 2019/9 Principal Supervisor
Jian Li (In Progress) , University of Waterloo Student Degree Expected Date: 2019/9 Thesis/Project Title: Profiling RDF Data Present Position: PhD Student
2014/1 - 2019/12 Principal Supervisor
Mina Farid (In Progress) , University of Waterloo Student Degree Expected Date: 2019/12
Thesis/Project Title: DSTLR: Profiling, Cleaning and Efficient Processing of Dirty RDF Data
Present Position: PhD Student 2011/9 - 2017/9
Principal Supervisor
Xu Chu (Completed) , University of Waterloo
Thesis/Project Title: Scalable and Holistic Qualitative Data Cleaning Present Position: Assistant Professor, Georgia Institute of Technology 2009/9 - 2013/8
Co-Supervisor
Jeffrey Pound (Completed) , University of Waterloo
Thesis/Project Title: Interpreting and Answering Keyword Queries using Web Knowledge Bases
Present Position: SAP - Canada 2009/9 - 2019/12
Principal Supervisor
Anup Kumar Chalamalla (In Progress) , University of Waterloo Student Degree Expected Date: 2019/12
Thesis/Project Title: Discovering Interaction Patterns in Heterogeneous Networks Present Position: Software Engineer
Professor Ihab Ilyas
116 2005/9 - 2012/9
Principal Supervisor
George Besklaes (Completed) , University of Waterloo
Thesis/Project Title: Modeling and Querying Uncertainty in Data Cleaning Present Position: Data Tamer, Boston, USA
2005/1 - 2012/5 Principal Supervisor
Amr El-Helw (Completed) , University of Waterloo
Thesis/Project Title: Query Optimization in Dynamic Environments Present Position: Greenplum, USA
Post-doctorate [n=2] 2016/1 - 2017/6 Principal Supervisor
Lanjun Wang (Completed) , University of Waterloo
Thesis/Project Title: Cleaning Machine Generated Data Streams Present Position: Research Staff, Huawei
2015/1 - 2016/12 Principal Supervisor
Alexandra Roatis (Completed) , University of Waterloo Thesis/Project Title: Cleaning RDF data sets
Present Position: N/A
Event Administration
2016/1 - 2021/1 VLDB Board of Trustees Member, The VLDB Endownment, Association, 2016/1 - 2021/1 2017/1 - 2020/1 SIGMOD Vice President, ACM SIGMOD, Association, 2017/1 - 2020/1
2018/7 - 2019/2 Core PC Member (Group Leader), The ACM SIGMOD International Conference on Management of Data, SIGMOD 2019, Conference, 2019/6 - 2019/6
2017/4 - 2018/3 PC Member, The International Conference on Very Large Databases 2018, Conference, 2018/8 - 2018/8
2017/10 - 2018/2 Workshop Co-Chair, The ACM SIGMOD International Conference on Management of Data, SIGMOD 2018, Conference, 2018/6 - 2018/6
2017/10 - 2018/2 Tutorial Co-Chair, The ACM SIGMOD International Conference on Management of Data, SIGMOD 2018, Conference, 2018/6 - 2018/6
2017/5 - 2018/2 TKDE Poster Track Co-Chair, IEEE International Conference on Data Engineering, ICDE 2018, Conference, 2018/4 - 2018/4
2016/4 - 2017/3 PC Member, The International Conference on Very Large Databases 2017, Conference, 2017/8 - 2017/8
2016/7 - 2017/2 PC Member, The ACM SIGMOD International Conference on Management of Data, SIGMOD 2017, Conference, 2017/6 - 2017/6
2015/4 - 2016/3 Associate Editor, The International Conference on Very Large Databases 2016, Conference, 2016/9 - 2016/9
2015/7 - 2016/2 PC Group Leader, The ACM SIGMOD International Conference on Management of Data, SIGMOD 2016, Conference, 2016/6 - 2016/6
2015/6 - 2016/2 PC Member, IEEE International Conference on Data Engineering, ICDE 2016, Conference, 2016/4 - 2016/4
2014/5 - 2015/2 PC Member, IEEE International Conference on Data Engineering, ICDE 2015, Conference, 2015/4 - 2015/4
2013/7 - 2014/9 PC Member, The International Conference on Very Large Databases 2014, Conference, 2014/9 - 2014/9
2013/7 - 2014/6 PC Member, The ACM SIGMOD International Conference on Management of Data, SIGMOD 2014, Conference, 2014/6 - 2014/6
Professor Ihab Ilyas
117
2013/3 - 2014/3 PC Member, he International Conference on Extending Database Technology, EDBT 2014, Conference, 2014/3 - 2014/3
2013/3 - 2013/8 Panel Track Co -Chair, The International Conference on Very Large Databases, VLDB 2013, Conference, 2013/8 - 2013/8
2006/8 - 2013/8 Proposer and Steering Committee, DBRank Workshops yearly from 2006 to 2013, Workshop, 2006/8 - 2013/8
2012/11 - 2013/6 PC Area Chair, The ACM SIGMOD International Conference on Management of Data, SIGMOD 2013, Conference, 2013/6 - 2013/6
2012/3 - 2013/4 PC Member, IEEE International Conference on Data Engineering, ICDE 2013, Conference, 2013/4 - 2013/4
2011/3 - 2012/8 PC Member, The International Conference on Very Large Databases 2012, Conference, 2012/8 - 2012/8
2010/4 - 2011/9 PC Member, The International Conference on Very Large Databases 2011, Conference, 2011/8 - 2011/9
2010/10 - 2011/6 PC Member, The ACM SIGMOD International Conference on Management of Data, SIGMOD 2011, Conference, 2011/6 - 2011/6
2010/1 - 2011/4 PC Member, IEEE International Conference on Data Engineering, ICDE 2011, Conference, 2011/4 - 2011/4
Editorial Activities
2014/1 - 2019/1 Associate Editor, Foundations and Trends in Databases (FnTDB)., Journal 2014/1 - 2019/1 Associate Editor, ACM Transactions on Database Systems (TODS), Journal 2013/9 - 2014/9 Area Editor, Encyclopedia of Database Systems, Book