LRE-Rel Language Resources and Evaluation for Religious Texts
Total Page:16
File Type:pdf, Size:1020Kb
-Rel Language Resources and Evaluation for Religious Texts LRE-Rel Workshop Programme Tuesday 22 May 2012 09:00 10:30 Session 1 Papers 09:00 Eric Atwell, Claire Brierley, and Majdi Sawalha (Workshop Chairs) Introduction to Language Resources and Evaluation for Religious Texts 09.10 Harry Erwin and Michael Oakes Correspondence Analysis of the New Testament 09.30 Mohammad Hossein Elahimanesh, Behrouz Minaei-Bidgoli and Hossein Malekinezhad Automatic classification of Islamic Jurisprudence Categories 09.50 Nathan Ellis Rasmussen and Deryle Lonsdale Lexical Correspondences Between the Masoretic Text and the Septuagint 10.10 Hossein Juzi, Ahmed Rabiei Zadeh, Ehsan Baraty and Behrouz Minaei-Bidgoli A new framework for detecting similar texts in Islamic Hadith Corpora 10:30 11:20 Coffee break and Session 2 Posters Majid Asgari Bidhendi, Behrouz Minaei-Bidgoli and Hosein Jouzi Extracting person names from ancient Islamic Arabic texts Assem Chelli, Amar Balla and Taha Zerrouki Advanced Search in Quran: Classification and Proposition of All Possible Features Akbar Dastani, Behrouz Minaei-Bidgoli, Mohammad Reza Vafaei and Hossein Juzi An Introduction to Noor Diacritized Corpus Karlheinz Mörth, Claudia Resch, Thierry Declerck and Ulrike Czeitschner Linguistic and Semantic Annotation in Religious Memento Mori Literature Aida Mustapha, Zulkifli Mohd. Yusoff and Raja Jamilah Raja Yusof Mohsen Shahmohammadi, Toktam Alizadeh, Mohammad Habibzadeh Bijani and Behrouz Minaei A framework for detecting Holy Quran inside Arabic and Persian texts Gurpreet Singh Letter-to-Sound Rules for Gurmukhi Panjabi (Pa): First step towards Text-to-Speech for Gurmukhi Sanja Stajner and Ruslan Mitkov Style of Religious Texts in 20th Century Daniel Stein Multi-Word Expressions in the Spanish Bhagavad Gita, Extracted with Local Grammars Based on Semantic Classes Nagwa Younis Dictionaries? Taha Zerrouki, Ammar Balla Reusability of Quranic document using XML 11:20 13:00 Session 3 Papers 11.20 Halim Sayoud Authorship Classification of two Old Arabic Religious Books Based on a Hierarchical Clustering 11.40 Liviu P. Dinu, Ion Resceanu, Anca Dinu and Alina Resceanu Some issues on the authorship identification in the Apostles' Epistles 12.00 John Lee, Simon S. M. Wong, Pui Ki Tang and Jonathan Webster A Greek-Chinese Interlinear of the New Testament Gospels 12.20 Soyara Zaidi, Ahmed Abdelali, Fatiha Sadat and Mohamed-Tayeb Laskri Hybrid Approach for Extracting Collocations from Arabic Quran Texts 12.40 Eric Atwell, Claire Brierley, and Majdi Sawalha (Workshop Chairs) Plenary Discussion 13:00 End of Workshop Editors Eric Atwell: University of Leeds, UK Claire Brierley: University of Leeds, UK Majdi Sawalha: University of Jordan, Jordan Workshop Organizers AbdulMalik Al-Salman: King Saud University, Saudi Arabia Eric Atwell: University of Leeds, UK Claire Brierley: University of Leeds, UK Azzeddine Mazroui: Mohammed First University, Morocco Majdi Sawalha: University of Jordan, Jordan Abdul-Baquee Muhammad Sharaf: University of Leeds, UK Bayan Abu Shawar: Arab Open University, Jordan ii Workshop Programme Committee Nawal Alhelwal: Arabic Department, Princess Nora bint Abdulrahman University, Saudi Arabia Qasem Al-Radaideh: Computer Information Systems, Yarmouk University, Jordan AbdulMalik Al-Salman: Computer and Information Sciences, King Saud University, Saudi Arabia Eric Atwell: School of Computing, University of Leeds, UK Amna Basharat: Foundation for Advancement of Science and Technology, FAST-NU, Pakistan James Dickins: Arabic and Middle Eastern Studies, University of Leeds, UK Kais Dukes: School of Computing, University of Leeds, UK Mahmoud El-Haj: Computer Science and Electronic Engineering, University of Essex, UK Nizar Habash: Center for Computational Learning Systems, Columbia University, USA Salwa Hamada: Electronics Research Institute, Egypt Bassam Hasan Hammo: Information Systems, King Saud University, Saudi Arabia Dag Haug: Philosophy, Classics, History of Art and Ideas, University of Oslo, Norway Moshe Koppel: Department of Computer Science, Bar-Ilan University, Israel Rohana Mahmud: Computer Science & Information Technology, University of Malaya, Malaysia Azzeddine Mazroui: Mathematics and Computer Science, Mohammed 1st University, Morocco Tony McEnery: English Language and Linguistics, University of Lancaster, UK Aida Mustapha: Computer Science and Information Technology, University of Putra, Malaysia Mohamadou Nassourou: Computer Philology & Modern German Literature, Uni of Würzburg, Germany Nils Reiter: Department of Computational Linguistics, Heidelberg University, Germany Abdul-Baquee M. Sharaf: School of Computing, University of Leeds, UK Bayan Abu Shawar: Information Technology and Computing, Arab Open University, Jordan Andrew Wilson: Linguistics and English Language, University of Lancaster, UK Nagwa Younis: English Department, Ain Shams University, Egypt Wajdi Zaghouani: Linguistic Data Consortium, University of Pennsylvania, USA iii Table of contents Introduction to LRE for Religious Texts Eric Atwell, Claire Brierley, Majdi Sawalha.................vi Extracting person names from ancient Islamic Arabic texts Majid Asgari Bidhendi, Behrouz Minaei-Bidgoli and Hosein Jouzi....................................................1 Advanced Search in Quran: Classification and Proposition of All Possible Features Assem Chelli, Amar Balla and Taha Zerrouki.....................................................................................7 An Introduction to Noor Diacritized Corpus Akbar Dastani, Behrouz Minaei-Bidgoli, Mohammad Reza Vafaei and Hossein Juzi......................13 Some issues on the authorship identification in the Apostles' Epistles Liviu P. Dinu, Ion Resceanu, Anca Dinu and Alina Resceanu...........................................................18 Automatic classification of Islamic Jurisprudence Categories Mohammad Hossein Elahimanesh, Behrouz Minaei-Bidgoli and Hossein Malekinezhad................25 Correspondence Analysis of the New Testament Harry Erwin and Michael Oakes.......................................................................................................30 A new framework for detecting similar texts in Islamic Hadith Corpora Hossein Juzi, Ahmed Rabiei Zadeh, Ehsan Baraty and Behrouz Minaei-Bidgoli..............................38 A Greek-Chinese Interlinear of the New Testament Gospels John Lee, Simon S. M. Wong, Pui Ki Tang and Jonathan Webster...................................................42 Linguistic and Semantic Annotation in Religious Memento Mori Literature Karlheinz Mörth, Claudia Resch, Thierry Declerck and Ulrike Czeitschner....................................49 for Juzuk Amma Aida Mustapha, Zulkifli Mohd. Yusoff and Raja Jamilah Raja Yusof................................................54 Lexical Correspondences Between the Masoretic Text and the Septuagint Nathan Ellis Rasmussen and Deryle Lonsdale...................................................................................58 Authorship Classification of Two Old Arabic Religious Books Based on a Hierarchical Clustering Halim Sayoud.................................................................................................................65 A framework for detecting Holy Quran inside Arabic and Persian texts Mohsen Shahmohammadi, Toktam Alizadeh, Mohammad Habibzadeh Bijani, Behrouz Minaei......71 Letter-to-Sound Rules for Gurmukhi Panjabi (Pa): First step towards Text-to-Speech for Gurmukhi Gurpreet Singh...............................................................................................................77 Style of Religious Texts in 20th Century Sanja Stajner and Ruslan Mitkov.......................................................................................................81 Multi-Word Expressions in the Spanish Bhagavad Gita, Extracted with Local Grammars Based on Semantic Classes Daniel Stein.........................................................................................88 Dictionaries? Nagwa Younis.............................................................................................................94 Hybrid Approach for Extracting Collocations from Arabic Quran Texts Soyara Zaidi, Ahmed Abdelali, Fatiha Sadat and Mohamed-Tayeb Laskri.....................................102 Reusability of Quranic document using XML Taha Zerrouki, Ammar Balla .108 iv Author Index Abdelali, Ahmed ...........................................102 Alizadeh, Toktam ............................................71 Atwell, Eric ......................................................vi Balla, Amar ...............................................7, 108 Baraty, Ehsan ..................................................38 Bidhendi, Majid Asgari .....................................1 Bijani, Mohammad Habibzadeh .....................71 Brierley, Claire ................................................vi Chelli, Assem ....................................................7 Czeitschner, Ulrike .........................................49 Dastani, Akbar ................................................13 Declerck, Thierry ............................................49 Dinu, Anca ......................................................18 Dinu, Liviu P ...................................................18 Elahimanesh, Mohammad Hossein .................25 Erwin, Harry ...................................................30 Jouzi, Hosein .....................................................1 Juzi, Hossein .............................................13,