• 2091 Citations
  • 24 h-Index
19992019
If you made any changes in Pure these will be visible here soon.

Research Output 1999 2019

2019

ANMAT: Automatic knowledge discovery and error detection through pattern functional dependencies

Qahtan, A., Tang, N., Ouzzani, M., Cao, Y. & Stonebraker, M., 25 Jun 2019, SIGMOD 2019 - Proceedings of the 2019 International Conference on Management of Data. Association for Computing Machinery, p. 1977-1980 4 p. (Proceedings of the ACM SIGMOD International Conference on Management of Data).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Error detection
Finance
Data mining
Demonstrations
Personnel

Bionetapp: An interactive visual data analysis platform for molecular expressions

Roumani, A. M., Madkour, A., Ouzzani, M., McGrew, T., Omran, E. & Zhang, X., 1 Feb 2019, In : PloS one. 14, 2, e0211277.

Research output: Contribution to journalArticle

Open Access
data analysis
Software
metabolomics
Visualization
Metabolomics

Efficient parallel skyline query processing for high-dimensional data

Tang, M., Yu, Y., Aref, W. G., Malluhi, Q. M. & Ouzzani, M., 1 Apr 2019, Proceedings - 2019 IEEE 35th International Conference on Data Engineering, ICDE 2019. IEEE Computer Society, p. 2113-2114 2 p. 8731496. (Proceedings - International Conference on Data Engineering; vol. 2019-April).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Query processing
Set theory
Processing
Merging
Decision making
2 Citations (Scopus)

EXPLAINER: Entity resolution explanations

Ebaid, A., Thirumuruganathan, S., Aref, W. G., Elmagarmid, A. & Ouzzani, M., 1 Apr 2019, Proceedings - 2019 IEEE 35th International Conference on Data Engineering, ICDE 2019. IEEE Computer Society, p. 2000-2003 4 p. 8731597. (Proceedings - International Conference on Data Engineering; vol. 2019-April).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Classifiers
Learning systems
Cleaning
Pipelines

Explaining entity resolution predictions: Where are we and what needs to be done?

Thirumuruganathan, S., Ouzzani, M. & Tang, N., 5 Jul 2019, Proceedings of the Workshop on Human-In-the-Loop Data Analytics, HILDA 2019. Association for Computing Machinery, a10. (Proceedings of the ACM SIGMOD International Conference on Management of Data).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Open Access
Classifiers
Data integration
Insurance
Taxonomies
Learning systems
2 Citations (Scopus)

Raha: A configuration-free error detection system

Mahdavi, M., Madden, S., Abedjan, Z., Ouzzani, M., Tang, N., Fernandez, R. C. & Stonebraker, M., 25 Jun 2019, SIGMOD 2019 - Proceedings of the 2019 International Conference on Management of Data. Association for Computing Machinery, p. 865-882 18 p. (Proceedings of the ACM SIGMOD International Conference on Management of Data).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Error detection
Cleaning
Sampling
Experiments

Towards an end-to-end human-centric data cleaning framework

Rezig, E. K., Ouzzani, M., Elmagarmid, A., Aref, W. G. & Stonebraker, M., 5 Jul 2019, Proceedings of the Workshop on Human-In-the-Loop Data Analytics, HILDA 2019. Association for Computing Machinery, a1. (Proceedings of the ACM SIGMOD International Conference on Management of Data).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Open Access
Cleaning
Repair
Pipelines
1 Citation (Scopus)

Unsupervised string transformation learning for entity consolidation

Deng, D., Tao, W., Abedjan, Z., Elmagarmid, A., Ilyas, I. F., Li, G., Madden, S., Ouzzani, M., Stonebraker, M. & Tang, N., 1 Apr 2019, Proceedings - 2019 IEEE 35th International Conference on Data Engineering, ICDE 2019. IEEE Computer Society, p. 196-207 12 p. 8731550. (Proceedings - International Conference on Data Engineering; vol. 2019-April).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Data integration
Consolidation
Information management
Data fusion
2018

AUDIT: approving and tracking updates with dependencies in collaborative databases

Mershad, K., Malluhi, Q. M., Ouzzani, M., Tang, M., Gribskov, M. & Aref, W. G., 1 Mar 2018, In : Distributed and Parallel Databases. 36, 1, p. 81-119 39 p.

Research output: Contribution to journalArticle

Genes
Data base
Experiments
Scenarios
Authorization
2 Citations (Scopus)

Building data civilizer pipelines with an advanced workflow engine

Mansour, E., Deng, D., Fernandez, R. C., Qahtan, A., Tao, W., Abedjan, Z., Elmagarmid, A., Ilyas, I. F., Madden, S., Ouzzani, M., Stonebraker, M. & Tang, N., 24 Oct 2018, Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018. Institute of Electrical and Electronics Engineers Inc., p. 1593-1596 4 p. 8509405

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Cleaning
Pipelines
Engines
Industry
Data warehouses

COACT: a query interface language for collaborative databases

Mershad, K., Malluhi, Q. M., Ouzzani, M., Tang, M., Gribskov, M., Aref, W. G. & Prakash, D., 1 Mar 2018, In : Distributed and Parallel Databases. 36, 1, p. 121-151 31 p.

Research output: Contribution to journalArticle

Query languages
Feedback
Data base
Language
Query
1 Citation (Scopus)

Data civilizer 2.0: A holistic framework for data preparation and analytics

Rezig, E. K., Cao, L., Stonebraker, M., Simonini, G., Tao, W., Madden, S., Ouzzani, M., Tang, N. & Elmagarmid, A. K., 1 Jan 2018, In : Proceedings of the VLDB Endowment. 12, 12, p. 1954-1957 4 p.

Research output: Contribution to journalConference article

Learning systems
Cleaning
Brain
Visualization
Tuning
4 Citations (Scopus)

Efficient Parallel Skyline Query Processing for High-Dimensional Data

Mingjie, T., Yu, Y., Aref, W. G., Malluhi, Q. & Ouzzani, M., 23 Feb 2018, (Accepted/In press) In : IEEE Transactions on Knowledge and Data Engineering.

Research output: Contribution to journalArticle

Query processing
Merging
Processing
Costs
Experiments
2 Citations (Scopus)

FAHES: Detecting disguised missing values

Qahtan, A., Elmagarmid, A., Ouzzani, M. & Tang, N., 24 Oct 2018, Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018. Institute of Electrical and Electronics Engineers Inc., p. 1609-1612 4 p. 8509409

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Missing values
Module
Outlier detection
2 Citations (Scopus)

FAHES: A robust disguised missing values detector

Qahtan, A., Elmagarmid, A., Fernandez, R. C., Ouzzani, M. & Tang, N., 19 Jul 2018, KDD 2018 - Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Association for Computing Machinery, p. 2100-2109 10 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Syntactics
Standardization
Education
Statistics
Detectors
13 Citations (Scopus)

Making progress with the automation of systematic reviews: Principles of the International Collaboration for the Automation of Systematic Reviews (ICASR)

On behalf of the founding members of the ICASR group, 19 May 2018, In : Systematic Reviews. 7, 1, 77.

Research output: Contribution to journalComment/debate

Automation
Natural Language Processing
Data Mining
Delivery of Health Care
4 Citations (Scopus)

Seeping semantics: linking datasets using word embeddings for data discovery

Castro Fernandez, R., Mansour, E., Qahtan, A., Elmagarmid, A., Ilyas, I., Madden, S., Ouzzani, M., Stonebraker, M. & Tang, N., 24 Oct 2018, Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018. Institute of Electrical and Electronics Engineers Inc., p. 989-1000 12 p. 8509314

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Semantics
Syntactics
Glossaries
Ontology
Industry
2017
2 Citations (Scopus)

A demonstration of Lusail - Querying linked data at scale

Mansour, E., Abdelaziz, I., Ouzzani, M., Aboulnaga, A. & Kalnis, P., 9 May 2017, SIGMOD 2017 - Proceedings of the 2017 ACM International Conference on Management of Data. Association for Computing Machinery, Vol. Part F127746. p. 1603-1606 4 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Demonstrations
Scalability
Engines
Decomposition
Internet of things
7 Citations (Scopus)

A demo of the data civilizer system

Fernandez, R. C., Deng, D., Mansour, E., Qahtan, A., Tao, W., Abedjan, Z., Elmagarmid, A., Ilyas, I. F., Madden, S., Ouzzani, M., Stonebraker, M. & Tang, N., 9 May 2017, SIGMOD 2017 - Proceedings of the 2017 ACM International Conference on Management of Data. Association for Computing Machinery, Vol. Part F127746. p. 1639-1642 4 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Cleaning
Data integration
Information management
Industry
Engines
60 Citations (Scopus)

A service computing manifesto: The next 10 years

Bouguettaya, A., Singh, M., Huhns, M., Sheng, Q. Z., Dong, H., Yu, Q., Neiat, A. G., Mistry, S., Benatallah, B., Medjahed, B., Ouzzani, M., Casati, F., Liu, X., Wang, H., Georgakopoulos, D., Chen, L., Nepal, S., Malik, Z., Erradi, A., Wang, Y. & 4 others, Blake, B., Dustdar, S., Leymann, F. & Papazoglou, M., 1 Apr 2017, In : Communications of the ACM. 60, 4, p. 64-72 9 p.

Research output: Contribution to journalReview article

Mobile computing
Cloud computing
Industry
Internet of things
Big data
14 Citations (Scopus)

Distributed representations of tuples for entity resolution

Ebraheem, M., Thirumuruganathan, S., Rayhan Joty, S., Ouzzani, M. & Tang, N., 1 Jan 2017, In : Proceedings of the VLDB Endowment. 11, 11, p. 1454-1467 14 p.

Research output: Contribution to journalConference article

Recurrent neural networks
Labeling
Tuning
Chemical analysis
Long short-term memory

Erratum: Lightning Fast and Space Efficient Inequality Joins. [PVLDB, 8, 13, (2017) (2074-2085)] DOI: 10.14778/2831360.2831362

Khayyat, Z., Lucia, W., Singh, M., Ouzzani, M., Papotti, P., Quiane Ruiz, J. A., Tang, N. & Kalnis, P., 1 Jan 2017, In : Proceedings of the VLDB Endowment. 10, 9, 1 p.

Research output: Contribution to journalComment/debate

Lightning
Feedback
Positive ions
9 Citations (Scopus)

In-memory distributed matrix computation processing & optimization

Yu, Y., Tang, M., Aref, W. G., Malluhi, Q. M., Abbas, M. & Ouzzani, M., 16 May 2017, Proceedings - 2017 IEEE 33rd International Conference on Data Engineering, ICDE 2017. IEEE Computer Society, p. 1047-1058 12 p. 7930046

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Data storage equipment
Processing
Costs
Competitive intelligence
Communication
4 Citations (Scopus)

Query optimizations over decentralized RDF graphs

Abdelaziz, I., Mansour, E., Ouzzani, M., Aboulnaga, A. & Kalnis, P., 16 May 2017, Proceedings - 2017 IEEE 33rd International Conference on Data Engineering, ICDE 2017. IEEE Computer Society, p. 139-142 4 p. 7929955

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Scalability
Query processing
Decomposition
Communication
Internet of things
7 Citations (Scopus)

RHEEM: Enabling cross platform data processing

Agrawal, D., Chawla, S., Contreras-Rojas, B., Elmagarmid, A., Idris, Y., Kaoudi, Z., Kruse, S., Lucas, J., Mansour, E., Ouzzani, M., Papotti, P., Quiane´-Ruiz, J. A., Tang, N., Thirumuruganathan, S. & Troudi, A., 1 Jan 2017, In : Proceedings of the VLDB Endowment. 11, 11, p. 1414-1427 14 p.

Research output: Contribution to journalConference article

Costs
Industry
Mechanics
32 Citations (Scopus)

The data civilizer system

Deng, D., Castro Fernandez, R., Abedjan, Z., Wang, S., Stonebraker, M., Elmagarmid, A., Ilyas, I. F., Madden, S., Ouzzani, M. & Tang, N., 1 Jan 2017.

Research output: Contribution to conferencePaper

Query processing
Cleaning
Information management
Engines
Chemical analysis
10 Citations (Scopus)

UGuide - User-guided discovery of FD-detectable errors

Thirumuruganathan, S., Berti-Equille, L., Ouzzani, M., Quiane Ruiz, J. A. & Tang, N., 9 May 2017, SIGMOD 2017 - Proceedings of the 2017 ACM International Conference on Management of Data. Association for Computing Machinery, Vol. Part F127746. p. 1385-1397 13 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Error detection
Experiments
2016

Cyber security - Part 1 the need to share

Elmagarmid, A., Cochrane, P. & Ouzzani, M., 2016, In : Journal of the Institute of Telecommunications Professionals. 10, 3, p. 28-32 5 p.

Research output: Contribution to journalArticle

Theaters

Cyber security - Part 2 auto-immunity

Elmagarmid, A., Cochrane, P. & Ouzzani, M., 2016, In : Journal of the Institute of Telecommunications Professionals. 10, 3, p. 33-37 5 p.

Research output: Contribution to journalArticle

Arsenals
14 Citations (Scopus)

DataXFormer: A robust transformation discovery system

Abedjan, Z., Morcos, J., Ilyas, I. F., Ouzzani, M., Papotti, P. & Stonebraker, M., 22 Jun 2016, 2016 IEEE 32nd International Conference on Data Engineering, ICDE 2016. Institute of Electrical and Electronics Engineers Inc., p. 1134-1145 12 p. 7498319

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Data integration
Airports
Feedback
Chemical analysis
Experiments
43 Citations (Scopus)

Detecting data errors: Where are we and what needs to be done?

Abedjan, Z., Chu, X., Deng, D., Fernandez, R. C., Ilyas, I. F., Ouzzani, M., Papotti, P., Stonebraker, M. & Tang, N., 2016, In : Proceedings of the VLDB Endowment. 9, 12, p. 993-1004 12 p.

Research output: Contribution to journalArticle

Cleaning
Error detection
Repair
Industry
7 Citations (Scopus)

Fast and scalable inequality joins

Khayyat, Z., Lucia, W., Singh, M., Ouzzani, M., Papotti, P., Quiane Ruiz, J. A., Tang, N. & Kalnis, P., 7 Sep 2016, (Accepted/In press) In : VLDB Journal. p. 1-26 26 p.

Research output: Contribution to journalArticle

Electric sparks
Cleaning
4 Citations (Scopus)

ORLF: A flexible framework for online record linkage and fusion

Rezig, E. K., Dragut, E. C., Ouzzani, M., Elmagarmid, A. & Aref, W. G., 22 Jun 2016, 2016 IEEE 32nd International Conference on Data Engineering, ICDE 2016. Institute of Electrical and Electronics Engineers Inc., p. 1378-1381 4 p. 7498349

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Fusion reactions
Record linkage
Fusion
Query
World Wide Web
14 Citations (Scopus)

Rheem: Enabling multi-platform task execution

Agrawal, D., Ba, L., Berti-Equille, L., Chawla, S., Elmagarmid, A., Hammady, H., Idris, Y., Kaoudi, Z., Khayyat, Z., Kruse, S., Ouzzani, M., Papotti, P., Quiane Ruiz, J. A., Tang, N. & Zaki, M. J., 26 Jun 2016, SIGMOD 2016 - Proceedings of the 2016 International Conference on Management of Data. Association for Computing Machinery, Vol. 26-June-2016. p. 2069-2072 4 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Data fusion
Learning systems
Cleaning
Gases
Oils
16 Citations (Scopus)

Road to freedom in big data analytics

Agrawal, D., Chawla, S., Elmagarmid, A., Kaoudi, Z., Ouzzani, M., Papotti, P., Quiane Ruiz, J. A., Tang, N. & Zaki, M. J., 1 Jan 2016, Advances in Database Technology - EDBT 2016: 19th International Conference on Extending Database Technology, Proceedings. OpenProceedings.org, Vol. 2016-March. p. 479-484 6 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Interoperability
Cleaning
Big data
10 Citations (Scopus)

Similarity Group-by Operators for Multi-Dimensional Relational Data

Tang, M., Tahboub, R. Y., Aref, W. G., Atallah, M. J., Malluhi, Q. M., Ouzzani, M. & Silva, Y. N., 1 Feb 2016, In : IEEE Transactions on Knowledge and Data Engineering. 28, 2, p. 510-523 14 p., 7289415.

Research output: Contribution to journalArticle

Semantics
Mathematical operators
2 Citations (Scopus)

Similarity Group-By operators for multi-dimensional relational data

Tang, M., Tahboub, R. Y., Aref, W. G., Atallah, M. J., Malluhi, Q. M., Ouzzani, M. & Silva, Y. N., 22 Jun 2016, 2016 IEEE 32nd International Conference on Data Engineering, ICDE 2016. Institute of Electrical and Electronics Engineers Inc., p. 1448-1449 2 p. 7498368

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Semantics
Mathematical operators
Operator
21 Citations (Scopus)

Temporal rules discovery for web data cleaning

Abedjan, Z., Akcora, C. G., Ouzzani, M., Papotti, P. & Stonebraker, M., 2016, Proceedings of the VLDB Endowment. 4 ed. Association for Computing Machinery, Vol. 9. p. 336-347 12 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Cleaning
Learning systems
Repair
Industry
2015
3 Citations (Scopus)

A demonstration of AQWA: Adaptive query-workload aware partitioning of big spatial data

Aly, A. M., Abdelhamid, A. S., Mahmood, A. R., Aref, W. G., Hassan, M. S., Elmeleegy, H. & Ouzzani, M., 2015, Proceedings of the VLDB Endowment. 12 ed. Association for Computing Machinery, Vol. 8. p. 1968-1971 4 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Cluster computing
Location based services
Smartphones
Electric sparks
Global positioning system
1 Citation (Scopus)

Approving updates in collaborative databases

Mershad, K., Malluhi, Q. M., Ouzzani, M., Tang, M. & Aref, W. G., 2015, Proceedings - 2015 IEEE International Conference on Cloud Engineering, IC2E 2015. Institute of Electrical and Electronics Engineers Inc., p. 42-47 6 p. 7092897

Research output: Chapter in Book/Report/Conference proceedingConference contribution

22 Citations (Scopus)

AQWA: Adaptive query-workload-aware partitioning of big spatial data

Aly, A. M., Mahmood, A. R., Hassan, M. S., Aref, W. G., Ouzzani, M., Elmeleegy, H. & Qadah, T., 2015, Proceedings of the VLDB Endowment. 13 ed. Association for Computing Machinery, Vol. 8. p. 2062-2073 12 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Cluster computing
Location based services
Processing
Experiments
46 Citations (Scopus)

BigDansing: A system for big data cleansing

Khayyaty, Z., Ilyasz, I. F., Jindal, A., Madden, S., Ouzzani, M., Papotti, P., Quiane Ruiz, J. A., Tang, N. & Yin, S., 27 May 2015, Proceedings of the ACM SIGMOD International Conference on Management of Data. Association for Computing Machinery, Vol. 2015-May. p. 1215-1230 16 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

User interfaces
Scalability
Repair
Big data
4 Citations (Scopus)

Cost estimation of spatial k-Nearest-neighbor operators

Aly, A. M., Aref, W. G. & Ouzzani, M., 2015, EDBT 2015 - 18th International Conference on Extending Database Technology, Proceedings. OpenProceedings.org, University of Konstanz, University Library, p. 457-468 12 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Mathematical operators
Costs
Location based services
Query processing
Data storage equipment
9 Citations (Scopus)

DataXFormer: Leveraging the web for semantic transformations

Abedjan, Z., Morcos, J., Gubanov, M., Ilyas, I. F., Stonebraker, M., Papotti, P. & Ouzzani, M., 1 Jan 2015.

Research output: Contribution to conferencePaper

Semantics
Data integration
Engines
World Wide Web
Experiments
10 Citations (Scopus)

DataXFormer: An interactive data transformation tool

Morcos, J., Abedjan, Z., Ilyas, I. F., Ouzzani, M., Papotti, P. & Stonebraker, M., 27 May 2015, Proceedings of the ACM SIGMOD International Conference on Management of Data. Association for Computing Machinery, Vol. 2015-May. p. 883-888 6 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Syntactics
Demonstrations
Semantics
7 Citations (Scopus)

Efficient processing of hamming-distance-based similarity-search queries over MapReduce

Tang, M., Yu, Y., Aref, W. G., Malluhi, Q. M. & Ouzzani, M., 2015, EDBT 2015 - 18th International Conference on Extending Database Technology, Proceedings. OpenProceedings.org, University of Konstanz, University Library, p. 361-372 12 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Hamming distance
Processing
Binary codes
Flavors
Redundancy
86 Citations (Scopus)

Katara: A data cleaning system powered by knowledge bases and crowdsourcing

Chu, X., Morcos, J., Ilyas, I. F., Ouzzani, M., Papotti, P., Tang, N. & Ye, Y., 27 May 2015, Proceedings of the ACM SIGMOD International Conference on Management of Data. Association for Computing Machinery, Vol. 2015-May. p. 1247-1261 15 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Cleaning
Repair
Learning systems
Semantics
Statistics
9 Citations (Scopus)

KATARA: Reliable data cleaning with knowledge bases and crowdsourcing

Chu, X., Morcos, J., Ilyas, I. F., Ouzzani, M., Papotti, P., Tang, N. & Ye, Y., 2015, Proceedings of the VLDB Endowment. 12 ed. Association for Computing Machinery, Vol. 8. p. 1952-1955 4 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Cleaning
Repair
Semantics
Specifications
20 Citations (Scopus)

Learning to identify relevant studies for systematic reviews using random forest and external information

Khabsa, M., Elmagarmid, A., Ilyas, I., Hammady, H. & Ouzzani, M., 23 Oct 2015, (Accepted/In press) In : Machine Learning.

Research output: Contribution to journalArticle

Classifiers
Heuristic methods
Health care
Costs
Experiments
15 Citations (Scopus)

Lightning fast and space efficient inequality joins

Khayyat, Z., Lucia, W., Singh, M., Ouzzani, M., Papotti, P., Quiane Ruiz, J. A., Tang, N. & Kalnis, P., 2015, Proceedings of the VLDB Endowment. 13 ed. Association for Computing Machinery, Vol. 8. p. 2074-2085 12 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Lightning
Electric sparks