We aggregate information from all open source repositories to provide the most of useful open source products, from Enterprise product to small libraries across all platforms. Please search and find what you want.

PADIC

PADIC (Parallel Arabic DIalectal Corpus) is a multi-dialectal corpus built in the framework of the National Research Project “TORJMAN”, led by Scientific and Technical Research Center for the Development of Arabic Language and funded by the Algerian Ministry of Higher Education and Scientific Research.
PADIC is composed of 6 dialects: two Algerian dialects (Algiers and Annaba cities), Palestinian, Syrian, Tunisian, Moroccan) and MSA.

Mourad Abbas
Computational Linguistics Department, crstdla
https://sites.google.com/site/mouradabbas9

Publications
—————–
S. Harrat, K. Meftouh, M. Abbas, K. Smaili. Building Resources for Algerian Arabic Dialects, INTERSPEECH 2014,

S. Harrat, M. Abbas, K. Meftouh, K. Smaili. Diacritics restoration for Arabic dialect texts. INTERSPEECH 2013, France.

TORJMAN website:
————————-
https://sites.google.com/site/torjmanepnr/6-corpus

You’re probably paying too much for cell phone service. Wirefly compares hundreds of plans to help you save. Enter what you need (minutes, data, texts) into Wirefly’s innovative plan comparison tools and see your savings instantly.

Website https://padic.sourceforge.io
Tags
License
Platform
Features
  • XML
  • Buckwalter
  • 5 Arabic dialects + Modern Standard Arabic
  • More than 6000 sentences