Ember: No-Code Context Enrichment via Similarity-Based Keyless Joins. | 0 | 0.34 | 2022 |
Saga: A Platform for Continuous Construction and Serving of Knowledge At Scale | 0 | 0.34 | 2022 |
PCOR: Private Contextual Outlier Release via Differentially Private Search | 0 | 0.34 | 2021 |
Kamino: constraint-aware differentially private data synthesis | 0 | 0.34 | 2021 |
Properties of Inconsistency Measures for Databases | 0 | 0.34 | 2021 |
Attention-based Learning for Missing Data Imputation in HoloClean. | 0 | 0.34 | 2020 |
Technical Report: Optimizing Human Involvement for Entity Matching and Consolidation. | 0 | 0.34 | 2019 |
HoloDetect: Few-Shot Learning for Error Detection. | 3 | 0.37 | 2019 |
Approximate Inference in Structured Instances with Noisy Categorical Observations. | 0 | 0.34 | 2019 |
ExplIQuE: Interactive Databases Exploration with SQL | 0 | 0.34 | 2019 |
Unsupervised String Transformation Learning for Entity Consolidation | 2 | 0.36 | 2019 |
Distributed Implementations of Dependency Discovery Algorithms. | 3 | 0.44 | 2019 |
Distributed Discovery of Functional Dependencies | 0 | 0.34 | 2019 |
Building Scalable Machine Learning Solutions for Data Cleaning. | 0 | 0.34 | 2019 |
Distributed Dependency Discovery. | 0 | 0.34 | 2019 |
APEx: Accuracy-Aware Differentially Private Data Exploration | 2 | 0.38 | 2019 |
Principles of Progress Indicators for Database Repairing. | 0 | 0.34 | 2019 |
Secure Multi-Party Functional Dependency Discovery. | 0 | 0.34 | 2019 |
A Formal Framework For Probabilistic Unclean Databases. | 1 | 0.35 | 2018 |
Data Integration: The Current Status and the Way Forward. | 3 | 0.36 | 2018 |
Seeping Semantics: Linking Datasets Using Word Embeddings for Data Discovery | 4 | 0.38 | 2018 |
Entity Consolidation: The Golden Record Problem. | 0 | 0.34 | 2017 |
HoloClean: holistic data repairs with probabilistic inference | 34 | 0.90 | 2017 |
Data Quality: The Role of Empiricism. | 2 | 0.36 | 2017 |
Smart Meter Data Analytics: Systems, Algorithms, and Benchmarking. | 8 | 0.54 | 2017 |
The Data Civilizer System. | 18 | 0.70 | 2017 |
A Demo of the Data Civilizer System. | 3 | 0.37 | 2017 |
Dataxformer: A Robust Transformation Discovery System | 13 | 0.58 | 2016 |
Dark Data: Are We Solving The Right Problems? | 1 | 0.35 | 2016 |
Effective Data Cleaning with Continuous Evaluation. | 0 | 0.34 | 2016 |
Learning to identify relevant studies for systematic reviews using random forest and external information | 7 | 0.73 | 2016 |
Data Cleaning: Overview and Emerging Challenges. | 24 | 0.69 | 2016 |
Qualitative Data Cleaning. | 0 | 0.34 | 2016 |
LONLIES: Estimating Property Values for Long Tail Entities. | 2 | 0.41 | 2016 |
Distributed Data Deduplication. | 12 | 0.52 | 2016 |
KATARA: reliable data cleaning with knowledge bases and crowdsourcing | 8 | 0.54 | 2015 |
KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing | 64 | 1.44 | 2015 |
DataXFormer: An Interactive Data Transformation Tool | 9 | 0.58 | 2015 |
BigDansing: A System for Big Data Cleansing | 42 | 1.18 | 2015 |
Benchmarking Smart Meter Data Analytics. | 0 | 0.34 | 2015 |
SMAS: A smart meter data analytics system | 9 | 0.57 | 2015 |
Trends in Cleaning Relational Data: Consistency and Deduplication | 29 | 1.20 | 2015 |
Dataxformer: Leveraging the Web for Semantic Transformations. | 10 | 0.58 | 2015 |
Descriptive and prescriptive data cleaning | 23 | 0.79 | 2014 |
NADEEF/ER: generic and interactive entity resolution. | 5 | 0.44 | 2014 |
RuleMiner: Data quality rules discovery | 7 | 0.42 | 2014 |
Top-k nearest neighbor search in uncertain data series | 25 | 0.66 | 2014 |
Sampling from repairs of conditional functional dependency violations | 11 | 0.53 | 2014 |
NADEEF: a commodity data cleaning system | 99 | 2.79 | 2013 |
Discovering denial constraints | 46 | 1.34 | 2013 |