-
Notifications
You must be signed in to change notification settings - Fork 29
Future Work Ideas
karimouda edited this page Feb 6, 2016
·
6 revisions
Some ideas for researchers and developers, some can be based on QA
- Quranic Wordnet
- Generation of full sentiment analysis corpus for the Quran
- Quranic Language Models (Helps in auto completion and visualization)
- Corpus of all derivations of Quranic words
- There is a need for a corpus for "All roots of Arabic words" and "All derivations of Arabic words" to help in query enrichment
- Better Question Answering performance
- Ontolology Enrichment from Wikipedia
- Google-like auto-complete suggestion functionality for search
- Enrich user queries using wordnet synsets
- Group words by lemma in “Word Frequency” page and all word clouds in the website
- Use Arabic PoS tagger to tag Arabic queries
- Quran memorization tool: a tool to help people trying to memorize the Quran by showing related verses and variations of the same word in different locations in the Quran
- Verse PoS Tagger: the user will enter verse text and get it PoS-tagged using QAC annotations
- Statistics about PoS tags distribution in the Quran
- Word-by-word translation and transliteration mapping page
- Verse similarity (similar to the one in TextMiningTheQuran.com) + Bipartite graph between similar verses and chapters
- Showing and searching verses by PoS Particles (http://corpus.quran.com/documentation/tagset.jsp) such as INTG or VOC where you can groups verses by dialogue types (ex: list all orders of Allah - the DO's and DON'Ts)
- Soundex for the Quran (can help in query spelling suggestion)
- Show context for each word in different locations (ex: حظ can mean "fortune" or "share" depending on the verse), this can be done using the following corpora: Word-by-word translation, tafseer and/or general verse translation
- Search, show or group verses by dialogues (ex: show all verses that are dialogues between Allah and a specific prophet)
- Experimenting the result of using Machine Learning & Topic Modelling (from Hadith text) for ontology extraction and enrichment
- A tools/algorithm to convert verses into Flowcharts, Info-graphics and Trees where applicable (ex: inheritance verses)
- Adding more charts in the charts section (ex: revelation order + animated time-series - if applicable)
- Creating a corpus by resolving references in the Quran that are not pronouns (ex: أُو۟لَٰٓئِكَ/ Those, also "they" and "who" )
- Search by words with exact tashkeel "ٱلْجِنَّةِ" VS ٱلْجَنَّةِ"
More details can be found in section 9.2 Future Work in the Thesis