Ph.D Symposium Program

8.30AM – 9.40AM: Panel (Industry vs Academia: Avenues ahead and turning points)


Stratos Idreos is an associate professor of Computer Science at Harvard University where he leads the Data Systems Laboratory. His research is towards creating a grammar for data systems, making it easy and even automatic to design workload and hardware conscious data structures and data systems with applications on relational, NoSQL, and data science problems. For his PhD thesis on adaptive indexing, Stratos was awarded the 2011 ACM SIGMOD Jim Gray Doctoral Dissertation award and the 2011 ERCIM Cor Baayen award from the European Research Council on Informatics and Mathematics. In 2015 he was awarded the IEEE TCDE Rising Star Award from the IEEE Technical Committee on Data Engineering for his work on adaptive data systems and in 2020 he received the ACM SIGMOD Contributions award for his work on reproducible research. Stratos is also a recipient of the National Science Foundation Career award and the Department of Energy Early Career award.

Dr. Yuanyuan Tian is a Principal Research Staff Member at IBM Almaden Research Center, and an ACM Distinguished Member. She obtained her Ph.D. in computer science from the University of Michigan. Her research interests include HTAP, SQL-on-Hadoop, big data federation, graph analytics platforms, and large-scale systems for machine learning. She has published two books and over 40 articles in top database venues with 4300+ citations. Dr. Tian has served in the editorial board for the new encyclopedia for Big Data, as an Associate Editor for VLDB Journal and PVLDB, and chaired various tracks in top database conferences. She has also served in several NSF panels. She is the recipient of the SIGMOD 2019 Research Highlight Award, EDBT 2018 Best Paper Award, the Outstanding Technical Achievement Award from IBM in 2020, 2019 and 2016, the Research Division Award from IBM in 2019, and the Distinguished Academic Achievement Award from the University of Michigan in 2008.

Xin Luna Dong is a Senior Principal Scientist at Amazon, leading the efforts of constructing Amazon Product Knowledge Graph. She was one of the major contributors to the Google Knowledge Vault project, and has led the Knowledge-based Trust project, which is called the “Google Truth Machine” by Washington’s Post. She has co-authored book “Big Data Integration”, was awarded ACM Distinguished Member, and VLDB Early Career Research Contribution Award for “advancing the state of the art of knowledge fusion”. She serves in VLDB endowment and PVLDB advisory committee, and is a PC co-chair for WSDM’2022, VLDB’2021, KDD’2020 ADS Invited Talk Series.

Ihab Ilyas is a professor in the Cheriton School of Computer Science and the NSERC-Thomson Reuters Research Chair on data quality at the University of Waterloo. His main research focuses on the areas of Data Science and data management , with special interest in data quality and integration, managing uncertain data, machine learning for data curation, and information extraction. Ihab is a co-founder of Tamr, a startup focusing on large-scale data integration, and the co-founder of inductiv (acquired by Apple), a Waterloo-based startup on using AI for structured data cleaning. He is an ACM Fellow, and a recipient of the Ontario Early Researcher Award, a Cheriton Faculty Fellowship, an NSERC Discovery Accelerator Award, and a Google Faculty Award. Ihab is an elected member of the VLDB Endowment board of trustees (2016-2021), elected SIGMOD vice chair (2016-2021), an associate editor of the ACM Transactions of Database Systems (2014-2020), and an associate editor of Foundations of Database Systems. He holds a PhD in computer science from Purdue University, West Lafayette.

Volker Markl is a German Professor of Computer Science. He leads the Chair of Database Systems and Information Management at TU Berlin and the Intelligent Analytics for Massive Data Research Department at DFKI. In addition, he is Director of the Berlin Institute for the Foundations of Learning and Data (BIFOLD). He is a database systems researcher, conducting research at the intersection of distributed systems, scalable data processing, and machine learning. Volker led the Stratosphere project, which resulted in the creation of Apache Flink. Volker has received numerous honors and prestigious awards, including best paper awards at ACM SIGMOD, VLDB, and EDBT. In 2014, he was elected one of Germany‘s leading “Digital Minds“ (Digitale Köpfe) by the German Informatics Society. He was elected an ACM Fellow for his contributions to query optimization, scalable data processing, and data programmability. He is currently President of the VLDB Endowment, and serves as advisor to academic institutions, governmental organizations, and technology companies. Volker holds eighteen patents and has been co-founder and mentor to several startups

Pınar Tözün is an Associate Professor at IT University of Copenhagen. Before ITU, she was a research staff member at IBM Almaden Research Center. Prior to joining IBM, she received her PhD from EPFL. Her thesis received ACM SIGMOD Jim Gray Doctoral Dissertation Award Honorable Mention in 2016. Her research focuses on performance characterization of data-intensive workloads, scalability and efficiency of data-intensive systems on modern processors and storage, and resource-aware machine learning.

09.40AM – 09.50AM: Break

09.50AM – 11.20AM: Talks Session 1

Edge Sparsification for Graphs via Meta-LearningGuihong Wan (The University of Texas at Dallas)
Monilog: An automated log-based anomaly detection system for cloud computing infrastructuresArthur Vervaet (3DS OUTSCALE)
Graph Based Approach to Real-Time Metro Passenger Flow Anomaly DetectionWeiqi ZHANG (Hong Kong University of Science and Technology)
Combining Anatomical Constraints and Deep learning for 3-D CBCT Dental Image Multi-label SegmentationJiayu Huang (Arizona State University)

11.20AM – 11.30AM: Break

11.30AM – 12.30PM: Talks Session 2

Tensor Topic Models with Graphs and Applications on Individualized Travel PatternsZIYUE LI (The Hong Kong University of Science and Technology)
BERT-based Dynamic clustering of Subway Stations Based on Flow InformationMan LI (HKUST)