Paper ID | Paper Title | Authors |
Tuesday 20th April 2021 |
Data Integration and Cleaning 1 (Tuesday 20th April 2021 / 08.30-10.00) TRACK 1 |
691 | Profiles of Schema Evolution in Free Open Source Software Projects | Panos Vassiliadis (University of Ioannina)* |
181 | CleanML: A Study for Evaluating the Impact of Data Cleaning on ML Classification Tasks | Peng Li (GATECH)*; Xi Rao (ETH); Jeffinifer Blase (GATECH); Yue Zhang (GATECH); Xu Chu (GATECH); Ce Zhang (ETH) |
204 | Approximate Order Dependency Discovery | Yifeng Jin (Fudan University); Zijing Tan (Fudan University)*; Weijun Zeng (Fudan University); Shuai Ma (Beihang University) |
875 | DBSCOUT: A density-based method for scalable outlier detection in very large datasets | Matteo Corain (Politecnico di Torino)*; Paolo Garza (Politecnico di Torino); Abolfazl Asudeh (University of Illinois at Chicago) |
915 | Bootstrapping Information Extraction via Conceptualization | Jiaqing Liang (Fudan University)*; Suo Feng (Fudan University); Chenhao Xie (Fudan University); Yanghua Xiao (Fudan University); Jindong Chen (Fudan University); Seungwon Hwang (Yonsei University) |
606 | Capturing Semantics for Imputation with Pre-trained Language Models | Yinan Mei (Tsinghua University); Shaoxu Song (Tsinghua University)*; Chenguang Fang (Tsinghua University); Haifeng Yang (Huawei Technologies Co., Ltd.); Jingyun Fang (Huawei Technologies Co., Ltd.); Jiang Long (Huawei Technologies Co., Ltd.) |
Graph Data Management 1 (Tuesday 20th April 2021 / 08.30-10.00) TRACK 2 |
354 | Manipulating Black-Box Networks for Centrality Promotion | Wentao Li (University of Technology Sydney); Min Gao (Chongqing University)*; Fu Wu (Chongqing University); Wenge Rong (Beihang University); Junhao Wen (Chongqing University); Lu Qin (UTS) |
29 | Efficient and Effective Community Search on Large-scale Bipartite Graphs | Kai Wang (University of New South Wales)*; Wenjie Zhang (University of New South Wales); Xuemin Lin (University of New South Wales); Ying Zhang (University of Technology Sydney); Lu Qin (UTS); Yuting Zhang (University of New South Wales) |
30 | Efficient Community Search with Size Constraint | BOGE LIU (University of New South Wales); Fan Zhang (Guangzhou University)*; Wenjie Zhang (University of New South Wales); Xuemin Lin (University of New South Wales); Ying Zhang (University of Technology Sydney) |
471 | Multi-attributed Community Search in Road-social Networks | Fangda Guo (Northeastern University)*; Ye Yuan (Beijing Institute of Technology); Guoren Wang (Beijing Institute of Technology); Xiangguo Zhao (Northeastern University); Hao Sun (Northeastern University) |
499 | Peer Learning Through Targeted Dynamic Groups Formation | Dong Wei (NJIT); Ioannis Koutis (NJIT); Senjuti Basu Roy (NJIT)* |
208 | Efficient 2-Hop Labeling Maintenance in Dynamic Small-World Networks | Mengxuan Zhang (The University of Queensland)*; Lei Li (University of Queensland); Wen Hua (The University of Queensland); Xiaofang Zhou (The Hong Kong University of Science and Technology) |
Data Privacy (Tuesday 20th April 2021 / 08.30-10.00) TRACK 3 |
52 | Differentially Private Publication of Multi-Party Sequential Data | Peng Tang (Shandong University)*; Rui Chen (Samsung Research America); Sen Su (Beijing University of Posts and Telecommunications); Shanqing GUO (Shandong University); Lei Ju (Shandong University); Gaoyuan Liu (Shandong University) |
65 | Secure Dynamic Skyline Queries Using Result Materialization | Sepanta Zeighami (University of Southern California)*; Gabriel Ghinita (Univ. of Massachusetts Boston); Cyrus Shahabi (Computer Science Department. University of Southern California) |
264 | P3GM: Private High-Dimensional Data Release via Privacy Preserving Phased Generative Model | Shun Takagi (Kyoto University); Tsubasa Takahashi (LINE Corporation)*; Yang Cao (Kyoto University); Masatoshi Yoshikawa (Kyoto University) |
564 | Feature Inference Attack on Model Predictions in Vertical Federated Learning | Xinjian Luo (National University of Singapore); Yuncheng Wu (National University of Singapore)*; Xiaokui Xiao (National University of Singapore); Beng Chin Ooi (NUS) |
694 | Enabling Efficient Cyber Threat Hunting With Cyber Threat Intelligence | Peng Gao (University of California, Berkeley)*; Fei Shao (Case Western Reserve University); Xiaoyuan Liu (University of California, Berkeley); Xusheng Xiao (Case Western Reserve University); Zheng Qin (Nanjing University); Fengyuan Xu (Nanjing University); Prateek Mittal (Princeton University); Sanjeev Kulkarni (Princeton University); Dawn Song (UC Berkeley) |
844 | TWINE: An Embedded Trusted Runtime for WebAssembly | Jämes Ménétrey (University of Neuchâtel)*; Marcelo Pasin (University of Neuchatel); Pascal Felber (University of Neuchatel); Valerio Schiavoni (University of Neuchatel) |
Crowdsourcing (Tuesday 20th April 2021 / 10.10-11.40) TRACK 1 |
39 | Modeling Citywide Crowd Flows using Attentive Convolutional LSTM | Chi Harold Liu (Beijing Institute of Technology)*; Chengzhe Piao (Beijing Institute of Technology ); Xiaoxin Ma (Beijing Institute of Technology ); Ye Yuan ( Beijing Institute of Technology); Jian Tang (Syracuse University); Guoren Wang (Beijing Institute of Technology); Kin K. Leung (Imperial College ) |
236 | A Privacy-enhanced and Personalized Safe Route Planner with Crowdsourced Data and Computation | Fariha Tabassum Islam (Bangladesh University of Engineering and Technology)*; Tanzima Hashem (Bangladesh University of Engineering and Technology); Dr. Rifat Shahriyar (BUET) |
250 | Coalition-based Task Assignment in Spatial Crowdsourcing | Yan Zhao (Department of Computer Science, Aalborg University)*; Jiannan Guo (China Mobile Cloud Centre); Xuanhao Chen (University of Electronic Science and Technology of China); Jianye Hao (Tianjin University); Xiaofang Zhou (The Hong Kong University of Science and Technology); Kai Zeng (University of Electronic Science and Technology of China) |
461 | Crowdsensing Data Trading based on Combinatorial Multi-Armed Bandit and Stackelberg Game | Baoyi An (University Of Science And Technology Of China); Mingjun Xiao (University of Science and Technology of China)*; An Liu (Soochow University); Xike Xie (University of Science and Technology of China); Xiaofang Zhou (The Hong Kong University of Science and Technology) |
522 | Fairness-aware Task Assignment in Spatial Crowdsourcing: Game-Theoretic Approaches | Yan Zhao (Aalborg University)*; Kai Zeng (University of Electronic Science and Technology of China); Jiannan Guo (China Mobile Cloud Centre); Bin Yang (Aalborg University); Torben Bach Pedersen (Aalborg University); Christian S Jensen (Aalborg University) |
615 | A Human-in-the-loop Approach to Social Behavioral Targeting | Jingru Yang (Renmin University of China); Xiaoman Zhao (Renmin University of China); Ju Fan (Renmin University of China)*; Gong Chen (Tencent); Chong Peng (Tencent); Sheng Yao (2Tencent); Xiaoyong Du (Renmin University of China) |
952 | CrowdRL: An End-to-End Reinforcement Learning Framework for Data Labelling | Kaiyu Li (Tsinghua University); Guoliang Li (Tsinghua University)*; Yong Wang (Tsinghua University); Yan Huang (TAL Education Group); Zitao Liu (TAL AI Lab); Zhongqin Wu (Tomorrow Advancing Life) |
Spatial and Temporal Data Management 1 (Tuesday 20th April 2021 / 10.10-11.40) TRACK 2 |
167 | Rebuilding City-Wide Traffic Origin Destination from Road Speed Data | Guanjie Zheng (Shanghai Jiao Tong University)*; Chang Liu (Shanghai Jiao Tong University); Hua Wei (Penn State University); Chacha Chen (Pennsylvania State University); Zhenhui (Jessie) Li (Penn State University) |
60 | Constrained Route Planning over Large Multi-Modal Time-Dependent Networks | Yishu Wang (School of Computer Science and Engineering of Northeastern University)*; Ye Yuan (Beijing Institute of Technology); Hao Wang (KAUST); Xiangmin Zhou (RMIT University); Congcong Mu (Northeastern University); Guoren Wang (Beijing Institute of Technology) |
604 | Online Route Planning over Time-Dependent Road Networks | Di Chen (Northeastern University)*; Ye Yuan (Beijing Institute of Technology); Wenjin Du (Beijing Institute of Technology); Yurong Cheng (Beijing institute of technology); Guoren Wang (Beijing Institute of Technology) |
482 | Dynamic Hub Labelling for Road Networks | Mengxuan Zhang (The University of Queensland)*; Lei Li (University of Queensland); Wen Hua (The University of Queensland); Pingfu Chao (University of Queensland); Xiaofang Zhou (University of Queensland) |
136 | An Effective Joint Prediction Model for Travel Demands and Traffic Flows | Haitao Yuan (Tsinghua University); Guoliang Li (Tsinghua University)*; Zhifeng Bao (RMIT University); Ling Feng (Tsinghua university) |
224 | A Learning-based Method for Computing Shortest Path Distances on Road Networks | Shuai Huang (Tsinghua University); Yong Wang (Tsinghua University); Tianyu Zhao (Tsinghua University); Guoliang Li (Tsinghua University)* |
Distributed Data Management 1 (Tuesday 20th April 2021 / 10.10-11.40) TRACK 3 |
25 | Efficient Federated-Learning Model Debugging | Anran Li (University of Science and Technology of China); Lan Zhang (University of Science and Technology of China)*; Xiangyang Li (University of Science and Technology of China); Junhao Wang (University of Science and Technology of China); Juntao Tan (University of Science and Technology of China); Nikolaos Freris (University of Science and Technology of China); Feng Han (University of Science and Technology of China); Yaxuan Qin (University of Science and Technology of China) |
104 | Communication-efficient Decentralized Machine Learning over Heterogeneous Networks | Pan Zhou (University of Electronic Science and Technology of China); Qian Lin (ByteDance); Dumitrel Loghin (National University of Singapore); Beng Chin Ooi (NUS)*; Yuncheng Wu (National University of Singapore); Hongfang Yu (University of Electronic Science and Technology of China) |
258 | Spark-Based Cloud Data Analytics using Multi-objective Optimization | Fei Song (Ecole Polytechnique)*; Khaled Zaouk (Ecole Polytechnique); Chenghao Lyu (University of Massachusetts Amherst); Arnab Singha (Ecole Polytechnique); Qi Fan (Ecole Polytechnique); Yanlei Diao (University of Massachusetts Amherst); Prashant Shenoy (University of Massachusetts Amherst) |
305 | WedgeChain: A Trusted Edge-Cloud Store With Asynchronous (Lazy) Trust | Faisal Nawab (UC Santa Cruz)* |
306 | CooLSM: Distributed and Cooperative Indexing Across Edge and Cloud Machines | Natasha Mittal (UC Santa Cruz); Faisal Nawab (UC Santa Cruz)* |
402 | Interactive Analytic DBMSs: Breaching the Scalability Wall | Pedro Pedreira (Facebook Inc.)*; Amit Dutta (Facebook Inc.); Sergey Pershin (Facebook Inc.); Lin Liu (Facebook Inc.); Sushant Shringarpure (Facebook Inc.); Jialiang Tan (Facebook Inc.); Brian Landers (Facebook Inc.); Ge Gao (Facebook Inc.); Karen Pieper (Facebook Inc.) |
Wednesday 21st April 2021 |
Data Integration and Data Science (Wednesday 21th April 2021 / 08.00-09.30) TRACK 1 |
430 | Relational Header Discovery using Similarity Search in a Table Corpus | Hazar Harmouch (Hasso Plattner Institute)*; Thorsten Papenbrock (Hasso Plattner Institute); Felix Naumann (Hasso Plattner Institute) |
442 | Efficient Joinable Table Discovery in Data Lakes: A High-Dimensional Similarity-Based Approach | Yuyang Dong (NEC corporation)*; Kunihiro Takeoka (NEC Corporation); Chuan Xiao (Osaka University); Masafumi Oyamada (NEC) |
800 | Valentine: Evaluating Matching Techniques for Dataset Discovery | Christos Koutras (TU Delft)*; George Siachamis (TU Delft); Andra Ionescu (TU Delft); Kyriakos Psarakis (TU Delft); Jerry Brons (ING Bank Netherlands); Marios Fragkoulis (TU Delft); Christoph Lofi (TU Delft); Angela Bonifati (Univ. of Lyon); Asterios Katsifodimos (TU Delft) |
715 | Odess: Speeding up Resemblance Detection for Redundancy Elimination by Fast Content-Defined Sampling | Xiangyu Zou (Harbin Institute of Technology,Shenzhen)*; Deng Cai (Harbin Institute of Technology, Shenzhen); Wen Xia (Harbin Institute of Technology,Shenzhen); Philip Shilane (Dell Technologies); Haoliang Tan (Harbin Institute of Technology,Shenzhen); Haijun Zhang (Harbin Institute of Technology (Shenzhen)); Xuan Wang (Harbin Institute of Technology) |
464 | Latent Low-rank Graph Learning for Multimodal Clustering | Guo Zhong (University of Macau); Chi-Man Pun (University of Macau)* |
273 | Hate is the New Infodemic: A Topic-Aware Modeling of Hate Speech Diffusion on Twitter | Subhabrata Dutta (Jadavpur University); Sarah Masud (IIIT Delhi, India); Sakshi Makkar (IIIT Delhi); Amitava Das (IIIT Sri City, India); Chhavi Jain (IIIT Delhi, India); Vikram Goyal (“IIIT Delhi, India”); Tanmoy Chakraborty (Indraprastha Institute of Information Technology Delhi (IIIT-D),India )* |
Graph Data Management 2 (Wednesday 21th April 2021 / 08.00-09.30) TRACK 2 |
12 | UniNet: Scalable Network Representation Learning with Metropolis-Hastings Sampling | Xingyu Yao (BUPT); Yingxia Shao (BUPT)*; Bin Cui (Peking University); Lei Chen (HKUST) |
24 | Towards Efficient Motif-based Graph Partitioning: An Adaptive Sampling Approach | Shixun Huang (RMIT)*; Yuchen Li (Singapore Management University); Zhifeng Bao (RMIT University); Zhao Li (Alibaba Group) |
77 | LineageBA: A Fast, Exact and Scalable Graph Generation for the Barabasi-Albert Model | Himchan Park (KAIST); Min-Soo Kim (KAIST)* |
207 | Search to aggregate neighborhood for graph neural network | Huan Zhao (4Paradigm Inc.)*; Quanming Yao (4th Paradigm); Wei-Wei Tu (4Paradigm Inc.) |
333 | FastSGG: Efficient Social Graph Generation Using a Degree Distribution Generation Model | Binbin Wang (Tsinghua University); Chaokun Wang (Tsinghua University)*; Bingyang Huang (Tsinghua University); Shaoxu Song (Tsinghua University); Zai Li (Kuaishou Inc.) |
796 | Noah: Neural-optimized A* Search Algorithm for Graph Edit Distance Computation | Lei Yang (Peking University); Lei Zou (Peking University)* |
Indexing (Wednesday 21th April 2021 / 08.00-09.30) TRACK 3 |
155 | TS-Benchmark: A Benchmark for Time Series Databases | Yuanzhe Hao (Renmin University of China); Xiongpai Qin (Renmin University of China); Yueguo Chen (Renmin University of China)*; Yaru Li (Renmin University of China); Xiaoguang Sun (Renmin University of China); Xiaoyong Du (Renmin University of China) |
561 | DBA bandits: Self-driving index tuning under ad-hoc, analytical workloads with safety guarantees | Malinga Perera (University of Melbourne)*; Bastian Oetomo (University of Melbourne); Benjamin Rubinstein (University of Melbourne); Renata Borovica-Gajic (University of Melbourne) |
593 | Less is More: De-Amplifying I/Os for Key-Value Stores with a Log-Assisted LSM-Tree | Kecheng Huang (Shandong University)*; Zhiping Jia (Shandong University); Zhaoyan Shen (Shandong University); Zili Shao (The Chinese University of Hong Kong); Feng Chen (Louisiana State University) |
177 | Multidimensional Adaptive & Progressive Indexes | Matheus A Nerone (CWI)*; Pedro Holanda (CWI); Eduardo Cunha de Almeida (UFPR); Stefan Manegold (CWI) |
455 | Hash Adaptive Bloom Filter | Rongbiao Xie (State Key Laboratory for Novel Software Technology, Nanjing University)*; Meng Li (Nanjing University); Zheyu Miao (Alibaba Group); Rong Gu (Nanjing University); He Huang (Soochow University); Haipeng Dai (Nanjing University); Guihai Chen (Nanjing University) |
34 | HST+: An Efficient Index for Embedding Arbitrary Metric Spaces | Yuxiang Zeng (Hong Kong University of Science and Technology)*; Yongxin Tong (Beihang University); Lei Chen (Hong Kong University of Science and Technology) |
Spatial and Temporal Data Management 3 (Wednesday 21th April 2021 / 08.00-09.30) TRACK 4 |
786 | Flow Computation in Temporal Interaction Networks | Chrysanthi Kosyfaki (University of Ioannina); Nikos Mamoulis (University of Ioannina)*; Evaggelia Pitoura (Univ. of Ioannina); Panayiotis Tsaparas (University of Ioannina, Greece) |
649 | Leveraging Temporal and Topological Selectivities in Temporal-Clique Subgraph Query Processing | Kaijie Zhu (Eindhoven University of Technology)*; George Fletcher (Eindhoven University of Technology); Nikolay Yakovets (Eindhoven University of Technology) |
602 | Trajectory Simplification with Reinforcement Learning | ZHENG WANG (Nanyang Technological University)*; Cheng Long (Nanyang Technological University); Gao Cong (Nanyang Technological Univesity) |
635 | E^2DTC: An End to End Deep Trajectory Clustering Framework via Self-Training | Ziquan Fang (Zhejiang University); Yuntao Du (Zhejiang University); Lu Chen (Zhejiang University); Yujia Hu (Zhejiang University); Yunjun Gao (Zhejiang University)*; Gang Chen (Zhejiang University) |
743 | REPOSE: Distributed Top-k Trajectory Similarity Search with Local Reference Point Tries | Bolong Zheng (Huazhong University of Science and Technology)*; Lianggui Weng (Huazhong University of Science and Technology ); Xi Zhao (Huazhong University of Science and Technology ); Kai Zeng (Alibaba Group); Xiaofang Zhou (The Hong Kong University of Science and Technology); Christian S Jensen (Aalborg University) |
884 | Durable Top-K Instant-Stamped Temporal Records with User-Specified Scoring Functions | Junyang Gao (Duke University)*; Stavros Sintos (University of Chicago); Pankaj K Agarwal (Duke University); Jun Yang (Duke University) |
Data Management on New Hardware 1 (Wednesday 21th April 2021 / 09.40-11.10) TRACK 1 |
435 | The Case for In-Memory OLAP on “Wimpy” Nodes | Andrew Crotty (Brown University)*; Alex Galakatos (Brown University); Connor Luckett (Brown University); Ugur Cetintemel (Brown University) |
88 | DyCuckoo: Dynamic Hash Tables on GPUs | Yuchen Li (Singapore Management University)*; Qiwei Zhu (Zhejiang University); Zheng Lyu (Alibaba Group); Zhongdong Huang (Zhejiang University); Jianling Sun (Zhejiang University) |
134 | Programming an SSD Controller to Support Batched Writes for Variable-Size Pages | Jaeyoung Do (Microsoft Research)*; Chen Luo (Snowflake Inc.); David B Lomet (Microsoft Research) |
187 | Predict and Write: Using K-Means Clustering to Extend the Lifetime of NVM Storage | Saeed Kargar (UCSC)*; Heiner Litz (UC Santa Cruz ); Faisal Nawab (UC Santa Cruz) |
257 | Discriminative Admission Control for Shared-everything Database under Mixed OLTP Workloads | Donghui Wang (East China Normal University)*; Peng Cai (East China Normal University) |
447 | Efficiently Reclaiming Space in a Log Structured Store | David B Lomet (Microsoft Research)*; Chen Luo (Snowflake Inc.) |
Stream Data Management 1 (Wednesday 21th April 2021 / 09.40-11.10) TRACK 2 |
490 | LogLog Filter: Filtering Cold Items within a Large Range over High Speed Data Streams | Peng Jia (Xi’an Jiaotong University)*; Pinghui Wang (Xi’an Jiaotong University); Junzhou Zhao (Xi’an Jiaotong University); Ye Yuan ( Beijing Institute of Technology); Jing Tao (Xi’an Jiaotong University); Xiaohong Guan (Xi’an Jiaotong University) |
508 | SliceNStitch: Continuous CP Decomposition of Sparse Tensor Streams | Taehyung Kwon (KAIST); Inkyu Park (KAIST); Dongjin Lee (Korea Advanced Institute of Science and Technology); Kijung Shin (KAIST)* |
579 | DISC: Density-Based Incremental Clustering by Striding over Streaming Data | Bogyeong Kim (Seoul National University); Kyoseung Koo (Seoul National University); Juhun Kim (Seoul National University); Bongki Moon (Seoul National University)* |
902 | Robust Factorization of Real-world Tensor Streams with Patterns, Missing Values and Outliers | Dongjin Lee (Korea Advanced Institute of Science and Technology); Kijung Shin (KAIST)* |
286 | Single Point Incremental Fourier Transform on 2D Data Streams | Muhammad Saad (Univeristy of Zurich)*; Abraham Bernstein (University of Zurich); Michael H Böhlen (University of Zurich); Daniele Dell’Aglio (Universität Zürich) |
449 | SALSA: Self-Adjusting Lean Streaming Analytics | Ran Ben Basat (Harvard University)*; gil Einziger (Nokia Bell Labs); Michael Mitzenmacher (Harvard); Shay Vargaftik (VMware) |
Knowledge Discovery (Wednesday 21th April 2021 / 09.40-11.10) TRACK 3 |
16 | NewsLink: Empowering Intuitive News Search with Knowledge Graphs | Yueji Yang (National University of Singapore)*; Yuchen Li (Singapore Management University); Anthony Tung (NUS) |
53 | On Disambiguating Authors: Collaboration Network Reconstruction in a Bottom-up Manner | Na Li (East China Normal University)*; Renyu Zhu (East China Normal University); Xiaoxu Zhou (East China Normal University); Xiangnan He (University of Science and Technology of China); Ming Gao (East China Normal University); Aoying Zhou (East China Normal University ) |
254 | A Bootstrapping Approach to Optimize Random Walk Based Statistical Estimation over Graphs | Pei Yi (Chongqing University); Hong Xie (College of Computer Science, Chongqing University)*; Yongkun Li (University of Science and Technology of China); John C. S. Lui (The Chinese University of Hong Kong) |
630 | Leveraging Meta-path Contexts for Classification in Heterogeneous Information Networks | Xiang Li (East China Normal University)*; Danhao Ding (The University of Hong Kong); Ben Kao (University of Hong Kong); Yizhou Sun (UCLA); Nikos Mamoulis (University of Ioannina) |
188 | Property Graph Schema Optimization for Domain-Specific Knowledge Graphs | Rana Alotaibi (University of California, San Diego); Chuan Lei (IBM Research – Almaden)*; Abdul H Quamar (IBM Research Almaden); Vasilis Efthymiou (FORTH-ICS); Fatma Ozcan (Google) |
637 | Fast Core-based Top-k Frequent Pattern Discovery in Knowledge Graphs | Jian Zeng (Southern University of Science and Technology); Leong Hou U (University of Macau); Xiao Yan (Southern University of Science and Technology); Mingji Han (Southern University of Science and Technology); Bo Tang (Southern University of Science and Technology)* |
Query Processing and Optimization 1 (Wednesday 21th April 2021 / 09.40-11.10) TRACK 4 |
843 | The Logarithmic Dynamic Cuckoo Filter | Fan Zhang (Huazhong University of Science and Technology); Hanhua Chen (Huazhong University of Science and Technology)*; Hai Jin (Huazhong University of Science and Technology); Pedro Reviriego (Universidad Carlos III de Madrid) |
340 | Continuously Bulk Loading over Range Partitioned Tables for Large Scale Historical Data | XiaoLong He (East China Normal University)*; Peng Cai (East China Normal University); Xuan Zhou (East China Normal University); Aoying Zhou (East China Normal University ) |
113 | Eclipse: Generalizing kNN and Skyline | Jinfei Liu (Emory University/Georgia Institute of Technology)*; Li Xiong (Emory University); Jian Pei (Simon Fraser University); Jun Luo (CAS); Qiuchen Zhang (Emory University) |
115 | Memory-Efficient Key/Foreign-Key Join Size Estimation via Multiplicity and Intersection Size | Magnus Mueller (University of Mannheim)*; Daniel Flachs (University of Mannheim); Guido Moerkotte (University of Mannheim) |
144 | Authenticated Keyword Search in Scalable Hybrid-Storage Blockchains | Ce Zhang (Hong Kong Baptist University); Cheng Xu (Hong Kong Baptist University); Haixin Wang (HKBU); Jianliang Xu (Hong Kong Baptist University)*; Byron Choi (Hong Kong Baptist University) |
163 | NestGPU: Nested Query Processing on GPU | Sofoklis Floratos (The Ohio State University)*; Mengbai Xiao (Shandong University); Hao Wang (the Ohio State University); Yuan Yuan (Google); Chengxin Guo (Renmin University of China); Rubao Lee (The Ohio State University); Xiaodong Zhang (Ohio State U.) |
Data Management on New Hardware 2 (Wednesday 21th April 2021 / 11.20-12.50) TRACK 1 |
469 | Aria: Tolerating Skewed Workloads in Secure In-memory Key-Value Store | Fan Yang (Tsinghua University)*; Youyou Lu (luyouyou@tsinghua.edu.cn); Youmin Chen (Tsinghua University); Qing Wang (Tsinghua University); Jiwu Shu (shujw@tsinghua.edu.cn) |
638 | CruiseDB: An LSM-Tree Key-Value Store with Both Better Tail Throughput and Tail Latency | Junkai Liang (Renmin University of China); yunpeng chai (renmin university of china)* |
702 | FPGA for Aggregate Processing: The Good, The Bad, and The Ugly | Zubeyr Furkan Eryilmaz (University of Wisconsin-Madison)*; Aarati Kakaraparthy (University of Wisconsin, Madison); Jignesh Patel (UW – Madison); Rathijit Sen (Microsoft); Kwanghyun Park (Microsoft Gray Systems Lab) |
Stream Data Management 2 (Wednesday 21th April 2021 / 11.20-12.50) TRACK 2 |
753 | Fingerprinting Concepts in Data Streams with Supervised and Unsupervised Meta-Information | Benjamin S Halstead (The University of Auckland)*; Yun Sing Koh (The University of Auckland, New Zealand); Patricia Riddle (University of Auckland, New Zealand); Mykola Pechenizkiy (TU Eindhoven); Albert Bifet (University of Waikato); Russel Pears (Auckland University of Technology) |
929 | Concept Drift Detection from Multi-Class Imbalanced Data Streams | Lukasz Korycki (Virginia Commonwealth University); Bartosz Krawczyk (Virginia Commonwealth University)* |
195 | DisMASTD: An Efficient Distributed Multi-Aspect Streaming Tensor Decomposition | Keyu Yang (Zhejiang University); Yunjun Gao (Zhejiang University)*; Yifeng Shen (Zhejiang University); Baihua Zheng (Singapore Management University); Lu Chen (Zhejiang University) |
Stream Data Management (Wednesday 21th April 2021 / 11.20-12.50) TRACK 3 |
19 | EDGE: Entity-Diffusion Gaussian Ensemble for Interpretable Tweet Geolocation Prediction | Bo Hui (Auburn University)*; Haiquan Chen (California State University, Sacramento); Da Yan (University of Alabama at Birmingham); Wei-Shinn Ku (Auburn University) |
56 | Efficient Relation-aware Scoring Function Search for Knowledge Graph Embedding | Shimin Di (The Hong Kong University of Science and Technology)*; Quanming Yao (4th Paradigm); Yongqi Zhang (4Paradigm Inc.); Lei Chen (Hong Kong University of Science and Technology) |
341 | InfoShield: Generalizable Information-Theoretic Human-Trafficking Detection | Meng-Chieh Lee (National Chiao Tung University)*; Catalina Vajiac (Carnegie Mellon University); Aayushi Kulshrestha (Mcgill University); Sacha Levy (McGill University); Namyong Park (Carnegie Mellon University); Cara Jones (Marinus Analytics); Christos Faloutsos (CMU); Reihaneh Rabbany (McGill University) |
518 | An Efficient Approach for Cross-Silo Federated Learning to Rank | Yansheng Wang (Beihang University); Yongxin Tong (Beihang University)*; Dingyuan Shi (Beihang University); Ke Xu (Beihang University) |
197 | Efficient Construction of Nonlinear Models Over Normalized Data | Zhaoyue Cheng (University of Toronto); Nick Koudas (University of Toronto); Zhe Zhang (York University)*; Xiaohui Yu (York University) |
773 | Workload-aware materialization for efficient variable elimination on Bayesian networks | Cigdem Aslay (Aarhus University)*; Martino Ciaperoni (Aalto University); Aristides Gionis (KTH Royal Institute of Technology); Michael Mathioudakis (University of Helsinki) |
Spatial and Temporal Data Managemnet 2 (Wednesday 21th April 2021 / 11.20-12.50) TRACK 4 |
632 | A Distance-Based Scheme for Reducing Bandwidth in Distributed Geometric Monitoring | Yuval Alfassi (University of Haifa); Moshe Gabel (University of Toronto ); Gal Yehuda (Technion, Israel Institute of Technology); Danny Keren (University of Haifa)* |
483 | SAKE: Spatial Question Answering over Knowledge Graph based on Embedding Techniques | Huan Li (Aalborg University)*; Hua Lu (Roskilde University); Lidan Shou (Zhejiang University); Ke Chen (Zhejiang University); Gang Chen (Zhejiang University) |
549 | LHist: Towards Learning Multi-dimensional Histogram for Massive Spatial Data | Qiyu LIU (Hong Kong University of Science and Technology)*; Yanyan Shen (Shanghai Jiao Tong University); Lei Chen (Hong Kong University of Science and Technology) |
23 | Data-Driven Fairness-Aware Vehicle Displacement for Large-Scale Electric Taxi Fleets | Guang Wang (Rutgers University)*; Shuxin Zhong (Rutgers University); Shuai Wang (Southeast University); Fei Miao (University of Connecticut); Zheng Dong (Wayne State University); Desheng Zhang (Rutgers University) |
130 | On Efficient and Scalable Time-continuous Spatial Crowdsourcing | Ting Wang (USTC); Xike Xie (University of Science and Technology of China)*; Xin Cao (University of New South Wales); Torben Bach Pedersen (Aalborg University); Yang Wang (University of Science and Technology of China); Mingjun Xiao (University of Science and Technology of China) |
597 | Spatial-Temporal Similarity for Trajectories with Location Noise and Sporadic Sampling | Guanyao Li (The Hong Kong University of Science and Technology)*; Chih-chieh hung (National Chung Hsing university); Mengyun LIU (The Hong Kong University of Science and Technology); Linfei PAN (ETHZ); Wen-Chih Peng (National Chiao Tung University); S.-H. Gary Chan (The Hong Kong University of Science and Technology) |
Thursday 22th April 2021 |
Data Integration and Cleaning 2 (Thursday 22th April 2021 / 08.10-09.40) TRACK 1 |
157 | Learning to Characterize Matching Experts | Roee Shraga (Technion – Israel Institute of Technology)*; Ofra Amir (Technion); Avigdor Gal (Technion — Israel Institute of Technology) |
170 | End-to-end Task Based Parallelization for Entity Resolution on Dynamic Data | Leonardo Gazzarri (University of Stuttgart)*; Melanie Herschel (Universität Stuttgart) |
868 | KDDLog: Performance and Scalability in Knowledge Discovery by Declarative Queries with Aggregates | Youfu Li (UCLA)*; Jin Wang (UCLA); Mingda Li (UCLA); Ariyam Das (UCLA); Jiaqi Gu (UCLA); Carlo Zaniolo (UCLA, USA) |
729 | Cost–effective Variational Active Entity Resolution | Alex Bogatu (University of Manchester)*; Norman Paton (University of Manchester); Mark Douthwaite (Peak AI); Stuart Davie (Peak AI); André Freitas (University of Manchester) |
834 | Structured Object Matching Across Web Page Revisions | Tobias Bleifuß (Hasso Plattner Institute)*; Leon Bornemann (Hasso Plattner Institute); Dmitri V. Kalashnikov (AT&T Labs Research); Felix Naumann (Hasso Plattner Institute); Divesh Srivastava (AT&T Labs Research) |
837 | Automating Entity Matching Model Development | Pei Wang (Simon Fraser University)*; Jiannan Wang (Simon Fraser University); Jian Pei (Simon Fraser University); Weiling Zheng (Simon Fraser University) |
Graph Data Management 3 (Thursday 22th April 2021 / 08.10-09.40) TRACK 2 |
44 | A Framework to Quantify Approximate Simulation on Graph Data | Xiaoshuang Chen (University of New South Wales); Longbin Lai (Alibaba Corporation); Lu Qin (UTS); Xuemin Lin (University of New South Wales)*; BOGE LIU (University of New South Wales) |
89 | PEFP: Efficient k-hop Constrained s-t Simple Path Enumeration on FPGA | Zhengmin Lai (East China Normal University)*; You Peng (University of New South Wales); Shiyu Yang (Guangzhou University); Xuemin Lin (University of New South Wales); Wenjie Zhang (University of New South Wales) |
214 | DPTL+: Efficient Parallel Triangle Listing on Batch-Dynamic Graphs | Michael R Yu (UNSW)*; Lu Qin (UTS); Ying Zhang (University of Technology Sydney); Wenjie Zhang (University of New South Wales); Xuemin Lin (University of New South Wales) |
360 | Finding a Summary for All Maximal Cliques | Xiaofan Li (Swinburne University of Technology); Rui Zhou (Swinburne University of Technology)*; Lu Chen (Swinburne University of Technology); Yong Zhang (” Tsinghua University, China”); Chengfei Liu (Swinburne University of Technology); Qiang He (Swinburne University of Technology); Yun Yang (Swinburne University of Technology) |
512 | Efficient Algorithm for the Anchored k-Core Budget Minimization Problem | Kaixin Liu (Tsinghua University)*; Sibo Wang (The Chinese University of Hong Kong); Yong Zhang (” Tsinghua University, China”); Chunxiao Xing (Tsinghua University) |
543 | Scalable Graph Isomorphism: Combining Pairwise Color Refinement and Backtracking via Compressed Candidate Space | Geonmo Gu (Seoul National University); Yehyun Nam (Seoul National University); Kunsoo Park (Seoul National University); Zvi Galil (Georgia Institute of Technology); Giuseppe F. Italiano (LUISS University); Wook-Shin Han (POSTECH)* |
Distributed Data Management 2 (Thursday 22th April 2021 / 09.50-11.20) TRACK 1 |
153 | Scalable Model-Based Management of Correlated Dimensional Time Series in ModelarDB+ | Søren Kejser Jensen (Aalborg University)*; Torben Bach Pedersen (Aalborg University); Christian Thomsen (Aalborg University) |
324 | RCC: Resilient Concurrent Consensus for High-Throughput Secure Transaction Processing | Suyash Gupta (University of California Davis)*; Jelle Hellings (University of California Davis); Mohammad Sadoghi (University of California, Davis) |
403 | WipDB: A Write-in-place Key-value Store that Mimics Bucket Sort | Xingsheng Zhao (University of Texas at Arlington)*; Song Jiang (University of Texas, Arlington ); Xingbo Wu (University of Illinois at Chicago) |
663 | Lock Violation for Fault-tolerant Distributed Database System | Hua Guo (Renmin University of China); Xuan Zhou (East China Normal University)* |
831 | Efficient Control Flow in Dataflow Systems: When Ease-of-Use Meets High Performance | Gábor E. Gévay (Technische Universität Berlin)*; Tilmann Rabl (HPI, University of Potsdam); Sebastian Bress (TU Berlin); Lorand Madai-Tahy (TU Berlin); Jorge Arnulfo Quiane Ruiz (TU Berlin); Volker Markl (DFKI) |
898 | Samya: A Geo-Distributed Data System for High Contention Aggregate Data | Sujaya A Maiyya (University Of California, Santa Barbara)*; Ishtiyaque Ahmad (University of California, Santa Barbara); Divy Agrawal (University of California, Santa Barbara); Amr El Abbadi (UC Santa Barbara) |
Graph Data Management 4 (Thursday 22th April 2021 / 09.50-11.20) TRACK 2 |
520 | FAST: FPGA-based Subgraph Matching on Massive Graphs | Xin Jin (East China Normal University)*; Zhengyi Yang (University of New South Wales); Xuemin Lin (University of New South Wales); Shiyu Yang (Guangzhou University); Lu Qin (UTS); You Peng (University of New South Wales) |
531 | A+ Indexes: Tunable and Space-Efficient Adjacency Lists in Graph Database Management Systems | Amine Mhedhbi (University of Waterloo)*; Pranjal Gupta (University of Waterloo); Shahid Khaliq (University of Waterloo); Semih Salihoglu (University of Waterloo) |
537 | Explaining Missing Data in Graphs: A Constraint-based Approach | Qi Song (Amazon.com)*; Peng Lin (Washington State University); Hanchao Ma (Case Western Reserve University); Yinghui Wu (Case Western Reserve University) |
591 | Influence Maximization Based on Dynamic Personal Perception in Knowledge Graph | Ya-Wen Teng (Academia Sinica)*; Yishuo Shi (Wenzhou University); Chih-Hua Tai (National Taipei University); De-Nian Yang (Academia Sinica); Wang-Chien Lee (Pennsylvania State University, USA); Ming-Syan Chen (National Taiwan University) |
619 | Privacy Preserving Strong Simulation Queries for Large Graphs | Lyu Xu (Hong Kong Baptist University)*; Jiaxin Jiang (Hong Kong Baptist University); Byron Choi (Hong Kong Baptist University); Jianliang Xu (Hong Kong Baptist University); Sourav S Bhowmick (Nanyang Technological University) |
874 | Trillion-scale Graph Processing Simulation based on Top-Down Graph Upscaling | Himchan Park (KAIST); Jinjun Xiong (IBM Thomas J. Watson Research Center); Min-Soo Kim (KAIST)* |
Recommender Systems (Thursday 22th April 2021 / 09.50-11.20) TRACK 3 |
86 | Multi-Facet Recommender Networks with Spherical Optimization | Yanchao Tan (Zhejiang University)*; Carl Yang (Emory University); Xiangyu Wei (Zhejiang University); Yun Ma (Zhejiang University); Xiaolin Zheng (Zhejiang University) |
251 | Group-Buying Recommendation for Social E-Commerce | Jun Zhang (Tsinghua University); Chen Gao (Tsinghua University)*; Depeng Jin (Tsinghua University); Yong Li (Tsinghua University) |
269 | Reliable Recommendation with Review-level Explanations | Yanzhang Lyu (Xi`an Jiaotong University); Hongzhi Yin (The University of Queensland)*; Shizhuo Deng (Northeastern University); Jun Liu (Xi’an Jiaotong Univerisity); Huan Liu (Xi’an Jiaotong Univerisity); Mengyue Liu (Xi’an Jiaotong University) |
460 | Variational Self-attention Network for Sequential Recommendation | Jing Zhao (Soochow University); Pengpeng Zhao (Soochow University)*; Lei Zhao (Soochow University); Yanchi Liu (Rutgers University); Victor S. Sheng (Texas Tech University); Xiaofang Zhou (The Hong Kong University of Science and Technology) |
534 | Knowledge-Aware Group Representation Learning for Group Recommendation | Zhiyi Deng (University of Electronic Science and Technology of China); Changyu Li (University of Electronic Science and Technology of China); Shujin Liu (University of Electronic Science and Technology of China); Waqar Ali (University of Electronic Science and Technology of China); Jie Shao (University of Electronic Science and Technology of China)* |
119 | Attacking Black-box Recommendations via Copying Cross-Domain User Profiles | Wenqi FAN (The Hong Kong Polytechnic University)*; Tyler Derr (Michigan State University); Xiangyu Zhao (Michigan State University); Yao Ma (Michigan State University); Hui Liu (Michigan State University); Jianping Wang (City University of Hong Kong); Jiliang Tang (Michigan State University); Qing Li (The Hong Kong Polytechnic University ) |
Query Processing and Optimization 2 (Thursday 22th April 2021 / 11.30-13.00) TRACK 1 |
307 | Approximating multidimensional range counts with maximum error guarantees | Michael Shekelyan (University of Warwick)*; Anton Dignös (Free University of Bozen-Bolzano, Italy); Johann Gamper (Free University of Bozen-Bolzano, Italy); Minos Garofalakis (ATHENA Research Centre & Technical University of Crete) |
467 | LATEST: Learning-Assisted Selectivity Estimation Over Spatio-Textual Streams | Mayur M Patil (University of California, Riverside)*; Amr Magdy (University of California Riverside) |
629 | ProMIPS: Efficient High-Dimensional c-Approximate Maximum Inner Product Search with a Lightweight Index | Yang Song (Northeastern University)*; Yu Gu (Northeastern University); Rui Zhang (” University of Melbourne, Australia”); Ge Yu (Northeast University) |
14 | A Fully Dynamic Algorithm for k-Regret Minimizing Sets | Yanhao Wang (University of Helsinki)*; Yuchen Li (Singapore Management University); Raymond Chi-Wing Wong (Hong Kong University of Science and Technology); Kian-Lee Tan (National University of Singapore) |
501 | Optimizing Error-Bounded Lossy Compression for Scientific Data by Dynamic Spline Interpolation | Kai Zhao (University of California, Riverside)*; Sheng Di (Argonne National Laboratory, Lemont, IL); Maxim Dmitriev (Saudi Aramco); Thierry Tonellot (Saudi Aramco); zizhong chen (UC Riverside); Franck Cappello (Argonne National Laboratory, Lemont, IL) |
557 | MLCask: Efficient Management of Component Evolution in Collaborative Data Analytics Pipelines | Zhaojing Luo (National University of Singapore); Sai Ho Yeung (NUS); Meihui Zhang (Beijing Institute of Technology); Kaiping Zheng (National University of Singapore); Gang Chen (Zhejiang University); Feiyi Fan (ICTCAS); Qian Lin (ByteDance); Kee Yuan Ngiam (NUHS); Beng Chin Ooi (NUS)* |
Search and Retrieval (Thursday 22th April 2021 / 11.30-13.00) TRACK 2 |
556 | Improving Constrained Search Results By Data Melioration | Ido Guy (eBay Research); Tova Milo (Tel Aviv University); Slava Novgorodov (eBay Research); Brit Youngmann (Tel Aviv University)* |
513 | G-TADOC: Enabling Efficient GPU-Based Text Analytics without Decompression | Feng Zhang (Renmin University of China)*; Zaifeng Pan (Shanghai Jiao Tong University); Yanliang Zhou (Renmin University of China); Jidong Zhai (Tsinghua University); Xipeng Shen (North Carolina State University); Onur Mutlu (ETH); Xiaoyong Du (Renmin University of China) |
35 | Fast Similarity Computation for t-SNE | Yasuhiro Fujiwara (NTT Communication Science Laboratories)*; Yasutoshi Ida (NTT Software Innovation Center); Sekitoshi Kanai (NTT Software Innovation Center); Atsutoshi Kumagai (NTT Software Innovation Center); Naonori Ueda (NTT Communication Science Labs.) |
185 | Rapid Approximate Aggregation with Distribution-Sensitive Interval Guarantees | Stephen Macke (University of Illinois at Urbana-Champaign)*; Aditya Parameswaran (University of California, Berkeley); Maryam Aliakbarpour (MIT); Ronitt Rubinfeld (MIT, TAU); Ilias Diakonikolas (University of Wisconsin-Madison) |
862 | Optimally Summarizing Data by Small Fact Sets for Concise Answers to Voice Queries | Immanuel Trummer (Cornell)*; Connor Anderson (Cornell University) |
636 | Automatic Webpage Briefing | Yimeng Dai (University of Melbourne)*; Rui Zhang (” University of Melbourne, Australia”); Jianzhong Qi (The University of Melbourne) |
Spatial and Temporal Data Management 4 (Thursday 22th April 2021 / 11.30-13.00) TRACK 3 |
109 | EnhanceNet: Plugin Neural Networks for Enhancing Correlated Time Series Forecasting | Razvan G Cirstea (Aalborg University)*; Tung Kieu (Aalborg University); Chenjuan Guo (Aalborg University); Bin Yang (Aalborg University); Sinno Pan (NTU, Singapore) |
248 | Forecasting Ambulance Demand with Profiled Human Mobility via Heterogeneous Multi-Graph Neural Networks | Zhaonan Wang (The University of Tokyo)*; TIANQI XIA (The University of Tokyo); Renhe Jiang (The University of Tokyo); Xin Liu (National Institute of Advanced Industrial Science and Technology (AIST)); Kyoung-Sook Kim (Artificial Intelligence Research Center); Xuan Song (The University of Tokyo); Ryosuke Shibasaki (University of Tokyo) |
566 | Efficient Constrained Shortest Path Query Answering with Forest Hop Labeling | Ziyi Liu (The University of Queensland)*; Lei Li (University of Queensland); Wen Hua (The University of Queensland); Pingfu Chao (University of Queensland); Xiaofang Zhou (The Hong Kong University of Science and Technology) |
572 | TASM: A Tile-Based Storage Manager for Video Analytics | Maureen Daum (University of Washington)*; Brandon Haynes (Microsoft); Dong He (University of Washington); Amrita Mazumdar (University of Washington); Magdalena Balazinska (UW) |
790 | A Two-Layer Partitioning for Non-Point Spatial Data | Dimitrios Tsitsigkos (University of Ioannina); Konstantinos Lampropoulos (University of Ioannina); Panagiotis Bouros (Johannes Gutenberg University Mainz); Nikos Mamoulis (University of Ioannina)*; Manolis Terrovitis (IMIS, Athena RC) |
818 | Spangle: A Distributed In-Memory Processing System for Large-Scale Arrays | Sangchul Kim (Seoul National University); Bogyeong Kim (Seoul National University); Bongki Moon (Seoul National University)* |