If you made any changes in Pure these will be visible here soon.

Research Output 2002 2019

2019

Farspeech: Arabic natural language processing for live Arabic speech

Eldesouki, M., Gopee, N., Ali, A. & Darwish, K., 1 Jan 2019, In : Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2019-September, p. 2372-2373 2 p.

Research output: Contribution to journalConference article

Natural Language
Processing
Neural Networks
Multimodality
Transcription
1 Citation (Scopus)

Multi-dialect Arabic POS tagging: A CRF approach

Darwish, K., Mubarak, H., Eldesouki, M., Abdelali, A., Samih, Y., Alharbi, R., Attia, M., Magdy, W. & Kallmeyer, L., 1 Jan 2019, LREC 2018 - 11th International Conference on Language Resources and Evaluation. Isahara, H., Maegaard, B., Piperidis, S., Cieri, C., Declerck, T., Hasida, K., Mazo, H., Choukri, K., Goggi, S., Mariani, J., Moreno, A., Calzolari, N., Odijk, J. & Tokunaga, T. (eds.). European Language Resources Association (ELRA), p. 93-98 6 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

dialect
Tagging
Arabic Dialects
Train
Clitics

Part-of-speech tagging for Arabic Gulf dialect using Bi-LSTM

Alharbi, R., Magdy, W., Darwish, K., Abdelali, A. & Mubarak, H., 1 Jan 2019, LREC 2018 - 11th International Conference on Language Resources and Evaluation. Isahara, H., Maegaard, B., Piperidis, S., Cieri, C., Declerck, T., Hasida, K., Mazo, H., Choukri, K., Goggi, S., Mariani, J., Moreno, A., Calzolari, N., Odijk, J. & Tokunaga, T. (eds.). European Language Resources Association (ELRA), p. 3925-3932 8 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

dialect
language
Short-term Memory
Part of Speech
Arabic Dialects
2017
5 Citations (Scopus)

Improved stance prediction in a user similarity feature space

Darwish, K., Magdy, W. & Zanouda, T., 31 Jul 2017, Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2017. Association for Computing Machinery, Inc, p. 145-148 4 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Joint topic modeling for event summarization across news and social media streams

Gao, W., Li, P. & Darwish, K., 1 Jan 2017, Social Media Content Analysis: Natural Language Processing and Beyond. World Scientific Publishing Co. Pte Ltd, p. 321-346 26 p.

Research output: Chapter in Book/Report/Conference proceedingChapter

Experiments
3 Citations (Scopus)

Language processing and learning models for community question answering in Arabic

Romeo, S., Martino, G., Belinkov, Y., Barron, A., Eldesouki, M., Darwish, K., Mubarak, H., Glass, J. & Moschitti, A., 1 Jan 2017, (Accepted/In press) In : Information Processing and Management.

Research output: Contribution to journalArticle

neural network
Processing
language
learning
community
4 Citations (Scopus)

Learning from relatives: Unified dialectal Arabic segmentation

Samih, Y., Eldesouki, M., Attia, M., Darwish, K., Abdelali, A., Mubarak, H. & Kallmeyer, L., 1 Jan 2017, CoNLL 2017 - 21st Conference on Computational Natural Language Learning, Proceedings. Association for Computational Linguistics (ACL), p. 432-441 10 p. (CoNLL 2017 - 21st Conference on Computational Natural Language Learning, Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

dialect
learning
Linguistics
linguistics
Labeling
10 Citations (Scopus)

Seminar users in the arabic twitter sphere

Darwish, K., Alexandrov, D., Nakov, P. & Mejova, Y., 1 Jan 2017, Social Informatics - 9th International Conference, SocInfo 2017, Proceedings. Springer Verlag, p. 91-108 18 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 10539 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Technical presentations
Social Media
Minor
8 Citations (Scopus)

Trump vs. Hillary: What went viral during the 2016 US presidential election

Darwish, K., Magdy, W. & Zanouda, T., 1 Jan 2017, Social Informatics - 9th International Conference, SocInfo 2017, Proceedings. Springer Verlag, p. 143-161 19 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 10539 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Elections
Statistics
Social Media
Qualitative Analysis
Quantitative Analysis
2016
20 Citations (Scopus)

#FailedRevolutions: Using Twitter to study the antecedents of ISIS support

Magdy, W., Darwish, K. & Weber, I., 2016, In : First Monday. 21, 2

Research output: Contribution to journalArticle

twitter
Syria
Iraq
Classifiers
opposition
22 Citations (Scopus)

#ISISisNotIslam or #DeportAllMuslims? Predicting unspoken views

Magdy, W., Darwish, K., Abokhodair, N., Rahimi, A. & Baldwin, T., 22 May 2016, WebSci 2016 - Proceedings of the 2016 ACM Web Science Conference. Association for Computing Machinery, Inc, p. 95-106 12 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Macros
Classifiers
Sampling
14 Citations (Scopus)

Farasa: A new fast and accurate Arabic word segmenter

Darwish, K. & Mubarak, H., 1 Jan 2016, Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016. European Language Resources Association (ELRA), p. 1070-1074 5 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

segmentation
Kernel
Java
Template
Open Source
2015
41 Citations (Scopus)

Content and network dynamics behind Egyptian political polarization on twitter

Borge-Holthoefer, J., Magdy, W., Darwish, K. & Weber, I., 28 Feb 2015, CSCW 2015 - Proceedings of the 2015 ACM International Conference on Computer-Supported Cooperative Work and Social Computing. Association for Computing Machinery, Inc, p. 700-711 12 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Polarization
3 Citations (Scopus)

"I like ISIS, but I want to watch Chris Nolan's new movie": Exploring ISIS supporters on Twitter

Magdy, W., Darwish, K. & Weber, I., 24 Aug 2015, HT 2015 - Proceedings of the 26th ACM Conference on Hypertext and Social Media. Association for Computing Machinery, Inc, p. 321-322 2 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Navigation
13 Citations (Scopus)

Overview of the AraPlagDet PAN@FIRE2015 shared task on Arabic plagiarism detection

Bensalem, I., Boukhalfa, I., Rosso, P., Abouenour, L., Darwish, K. & Chikhi, S., 2015, In : Unknown Journal. 1587, p. 111-122 12 p.

Research output: Contribution to journalArticle

detection method
detection
evaluation
method
14 Citations (Scopus)

Randomized greedy inference for joint segmentation, POS tagging and dependency parsing

Zhang, Y., Li, C., Barzilay, R. & Darwish, K., 2015, NAACL HLT 2015 - 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference. Association for Computational Linguistics (ACL), p. 42-52 11 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Pipelines
Parsing
Segmentation
Inference
Tagging
2014

Query term expansion by automatic learning of morphological equivalence patterns from Wikipedia

Darwish, K., Ali, A. & Abdelali, A., 2014, CEUR Workshop Proceedings. CEUR-WS, Vol. 1204. p. 24-29 6 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Processing
Linguistics
Degradation
6 Citations (Scopus)

Simple effective microblog named entity recognition: Arabic as an example

Darwish, K. & Gao, W., 1 Jan 2014, Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014. European Language Resources Association (ELRA), p. 2513-2517 5 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

language
news
Entity
Language
News
19 Citations (Scopus)

Using stem-templates to improve Arabic pos and gender/number tagging

Darwish, K., Abdelali, A. & Mubarak, H., 1 Jan 2014, Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014. European Language Resources Association (ELRA), p. 2926-2931 6 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

gender
Template
Tagging
Nouns
Adjective
17 Citations (Scopus)

Verifiably effective arabic dialect identification

Darwish, K., Sajjad, H. & Mubarak, H., 2014, EMNLP 2014 - 2014 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference. Association for Computational Linguistics (ACL), p. 1465-1468 4 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Identification (control systems)
2013
39 Citations (Scopus)

Arabic information retrieval

Darwish, K. & Magdy, W., 1 Dec 2013, In : Foundations and Trends in Information Retrieval. 7, 4, p. 239-342 104 p.

Research output: Contribution to journalArticle

Information retrieval
Image retrieval
Optical character recognition
Formal languages
Processing
22 Citations (Scopus)

Detecting comments on news articles in microblogs

Kothari, A., Magdy, W., Darwish, K., Mourad, A. & Taei, A., 1 Jan 2013, Proceedings of the 7th International Conference on Weblogs and Social Media, ICWSM 2013. AAAI press, p. 293-302 10 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Learning systems
20 Citations (Scopus)

Named entity recognition using cross-lingual resources: Arabic as an example

Darwish, K., 1 Jan 2013, ACL 2013 - 51st Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference. Association for Computational Linguistics (ACL), Vol. 1. p. 1558-1567 10 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

news
Wikipedia
lack
language
resources
16 Citations (Scopus)

Translating dialectal arabic to english

Sajjad, H., Darwish, K. & Belinkov, Y., 1 Jan 2013, ACL 2013 - 51st Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference. Association for Computational Linguistics (ACL), Vol. 2. p. 1-6 6 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

vocabulary
coverage
Translating
Egyptians
Spelling
2012
8 Citations (Scopus)

Arabic retrieval revisited: Morphological hole filling

Darwish, K. & Ali, A., 1 Dec 2012, 50th Annual Meeting of the Association for Computational Linguistics, ACL 2012 - Proceedings of the Conference. Vol. 2. p. 218-222 5 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Linguistics
4 Citations (Scopus)

A summarization tool for time-sensitive social media

Magdy, W., Ali, A. & Darwish, K., 19 Dec 2012, ACM International Conference Proceeding Series. p. 2695-2697 3 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Monitoring
Time series
50 Citations (Scopus)

Joint topic modeling for event summarization across news and social media streams

Gao, W., Li, P. & Darwish, K., 19 Dec 2012, ACM International Conference Proceeding Series. p. 1173-1182 10 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Experiments
31 Citations (Scopus)

Language processing for arabic microblog retrieval

Darwish, K., Magdy, W. & Mourad, A., 19 Dec 2012, ACM International Conference Proceeding Series. p. 2427-2430 4 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Speech analysis
Processing
4 Citations (Scopus)

Statistical denormalization for arabic text

Moussa, M., Fakhr, M. W. & Darwish, K., 1 Dec 2012, 11th Conference on Natural Language Processing, KONVENS 2012: Empirical Methods in Natural Language Processing - Proceedings of the Conference on Natural Language Processing 2012. Vol. 5. p. 228-232 5 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Error correction
Labeling
Processing
4 Citations (Scopus)

Transliteration mining using large training and test sets

Kahki, A. E., Darwish, K., Din, A. S. E. & El-Wahab, M. A., 2012, NAACL HLT 2012 - 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference. Association for Computational Linguistics (ACL), p. 243-252 10 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

reinforcement
Reinforcement
training method
ranking
penalty
2011
4 Citations (Scopus)

ICE-TEA: In-context expansion and translation of english abbreviations

Ammar, W., Darwish, K., El Kahki, A. & Hafez, K., 9 Mar 2011, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). PART 2 ed. Vol. 6609 LNCS. p. 41-54 14 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 6609 LNCS, no. PART 2).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abbreviation
Hybrid systems
Hybrid Systems
Context
Test Set
12 Citations (Scopus)

Improved transliteration mining using graph reinforcement

El-Kahky, A., Darwish, K., Aldein, A. S., El-Wahab, M. A., Hefny, A. & Ammar, W., 3 Oct 2011, EMNLP 2011 - Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference. p. 1384-1393 10 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Reinforcement
Query languages
Processing
2 Citations (Scopus)

Is a query worth translating: Ask the users!

Hefny, A., Darwish, K. & Alkahky, A., 1 Jan 2011, Advances in Information Retrieval - 33rd European Conference on IR Research, ECIR 2011, Proceedings. Springer Verlag, p. 238-250 13 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 6611 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Query languages
Query
Merging
Query Language
Baseline

QCRI @ TREC 2011: Microblog track

El-Kahki, A. & Darwish, K., 2011, NIST Special Publication.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2010

Improved Relevance feedback using density-based clustering

Darwish, K., El Deeb, A., Yousri, N. & Kamel, M. S., 1 Dec 2010, Proceedings of the 2010 10th International Conference on Intelligent Systems Design and Applications, ISDA'10. p. 580-585 6 p. 5687203

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Feedback
Clustering algorithms
2 Citations (Scopus)

Omni font OCR error correction with effect on retrieval

Magdy, W. & Darwish, K., 1 Dec 2010, Proceedings of the 2010 10th International Conference on Intelligent Systems Design and Applications, ISDA'10. p. 415-420 6 p. 5687228

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Optical character recognition
Error correction
Analog to digital conversion
Degradation
2009

CMIC@INEX 2008: Link-the-wiki track

Darwish, K., 4 Nov 2009, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 5631 LNCS. p. 337-342 6 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5631 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Anchors
Wikipedia
Engine
Query
Engines

CMIC@TREC-2009: Relevance feedback track

Darwish, K. & El-Deeb, A., 2009, NIST Special Publication.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Feedback
Weighing
1 Citation (Scopus)

Efficient language-independent retrieval of printed documents without OCR

Magdy, W., Darwish, K. & El-Saban, M., 9 Nov 2009, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 5721 LNCS. p. 334-343 10 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 5721 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Optical character recognition
Retrieval
Engine
Subword
Digitization
2008
3 Citations (Scopus)

Automatic extraction of textual elements from news web pages

Ibrahim, H., Darwish, K. & Abdel-Sabor, A. R., 1 Jan 2008, Proceedings of the 6th International Conference on Language Resources and Evaluation, LREC 2008. European Language Resources Association (ELRA), p. 1600-1603 4 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

news
hypertext
World Wide Web
News
Classifier
5 Citations (Scopus)

Book search: Indexing the valuable parts

Magdy, W. & Darwish, K., 1 Dec 2008, International Conference on Information and Knowledge Management, Proceedings. p. 53-56 4 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Indexing
Isolation
Web search
Hypertext

CMIC at INEX 2007: Book Search track

Magdy, W. & Darwish, K., 22 Sep 2008, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 4862 LNCS. p. 175-182 8 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4862 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Analog to digital conversion
Retrieval
Digitization
Indexing
6-cyano-5-methoxyindolo(2,3-a)carbazole
9 Citations (Scopus)

Effect of OCR error correction on Arabic retrieval

Magdy, W. & Darwish, K., 1 Oct 2008, In : Information Retrieval. 11, 5, p. 405-425 21 p.

Research output: Contribution to journalArticle

Optical character recognition
Error correction
language
ability
Degradation
2007
11 Citations (Scopus)

BioNoculars: Extracting protein-protein interactions from biomedical text

Madkour, A., Darwish, K., Hassan, H., Hassan, A. & Emam, O., 1 Jan 2007, p. 89-96. 8 p.

Research output: Contribution to conferencePaper

Proteins
MEDLINE
Protein Databases
Protein
Interaction
3 Citations (Scopus)

Error correction vs. query garbling for Arabic OCR document retrieval

Darwish, K. & Magdy, W., 1 Nov 2007, In : ACM Transactions on Information Systems. 26, 1, 5.

Research output: Contribution to journalArticle

Optical character recognition
Error correction
Query
5 Citations (Scopus)

Providing multilingual access to FLICKR for Arabic users

Clough, P., Al-Maskari, A. & Darwish, K., 1 Dec 2007, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 4730 LNCS. p. 205-216 12 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4730 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Language
User Studies
Glossaries
Experiments
Availability
2006
23 Citations (Scopus)

Arabic OCR error correction using character segment correction, language modeling, and shallow morphology

Magdy, W. & Darwish, K., 1 Dec 2006, COLING/ACL 2006 - EMNLP 2006: 2006 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference. p. 408-414 7 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Optical character recognition
Error correction
Glossaries

Building a heterogeneous information retrieval test collection of Arabic document images

Darwish, K., Magdy, W., Emam, O., Abdelsapor, A., Adly, N. & Nagi, M., 1 Jan 2006, p. 657-662. 6 p.

Research output: Contribution to conferencePaper

information retrieval
experiment
Information Retrieval

Providing multilingual access to FLICKR for Arabic users

Clough, P., Al-Maskari, A. & Darwish, K., 2006, CEUR Workshop Proceedings. CEUR-WS, Vol. 1172.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Glossaries
Experiments
Availability

Word-based correction for retrieval of Arabic OCR degraded documents

Magdy, W. & Darwish, K., 31 Oct 2006, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 4209 LNCS. p. 205-216 12 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 4209 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Optical character recognition
Retrieval
Language
N-gram
Channel Model