SparkDQ: Efficient generic big data quality management on distributed data-parallel computation

Gu, R; Qi, Y; Wu, TY; Wang, ZK; Xu, XL; Yuan, CF; Huang, YH

Huang, YH (corresponding author), Nanjing Univ, State Key Lab Navel Software Technol, Nanjing, Peoples R China.; Wang, ZK (corresponding author), Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China.

JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2021; 156 (): 132

Abstract

In the big data era, large amounts of data are under generation and accumulation in various industries. However, users usually feel hindered by the da......

Full Text Link