Saturday, July 24, 2010

Tanzil Project From Tanzil قرآن




Tanzil Project

From Tanzil

Jump to:navigation, search

Tanzil is a quranic project launched in early 2007 to produce a highly verified precise quran text to be used in quranic websites and applications. Our mission in the Tanzil project is to produce a standard quran text and serve as a reliable source for this standard text on the web.

Contents

Why Tanzil?

Google search result for aya 68:38, as of January 21, 2008

Since the appearance of the first digital copy of the holy quran, there had been substantial effort to produce an accurate quran text, but due to some difficulties (some of which are listed below), these efforts were unsuccessful in many of the cases, and unfortunately, the quran texts appeared in a majority of quranic websites and applications were suffering from lots of errors and typos.

These erroneous quran texts were so rapidly spread over the Internet that finding the correct form of quranic verses had became almost impossible without referring to a verified printed manuscript. For example, in early 2008, when somebody was searching Google for the verse 38 of Sura Al-Qalam which reads إِنَّ لَكُمْ فِيهِ لَمَا تَخَيَّرُونَ, the first result that he or she was receiving was a wrong form ending with the word يَتَخَيَّرُونَ as shown in the picture to the right.

This terrible situation made us to launch the Tanzil project to produce a highly verified error-free quran text, and make this text available to the quranic websites and applications to prevent further spreading of the erroneous quran texts.

Typos in several existing quran texts

Many of the digital quran texts publicly available on the web suffer from lots of typos, mainly due to the following facts:

Missing Diacritics

A popular quran text which is widely used in websites and applications is based on an old diacritic-less copy of the quran into which diacritics are manually added over time by some volunteers. Unfortunately, the diacritics added to this text are incomplete and have tons of typos.

Text Conversion

Printed copies of the holy quran are usually published in Uthmani script, while electronic copies of the quran are commonly prepared in the simple (Imla'ei) script. The conversion between these two text types has been the source of many inconsistencies and typos in the digital texts.

Technical Difficulties

The common Arabic character sets used in web-pages and applications to encode quran texts lack some of the essential diacritics and symbols needed. For example, the widely used Arabic encoding called "Windows-1256" does not have any code-point for small-alef which is needed to represent a complete quran text.

How precise is Tanzil quran text?

In the Tanzil project, we have integrated several previous works done in a number of quranic projects to obtain a unified quran text, and brought the text to a high level of accuracy by passing it through several manual and automatic verification phases. The lengthy work that leads to the final Tanzil quran text is summarized in the following three steps:

Automatic Text Extraction

At this step, a collection of quran texts from various authentic resources is collected, and a program is developed to convert each text to a canonical format. The canonical texts are then analyzed and carefully compared to extract a base quran text.

Rule-Based Verification

In this step, a program is developed to verify the base quran text against a set of grammatical and recitation rules. The program has been also able to produce quran texts in Imlaei and Uthmani formats, all derived automatically from the base text.

Manual Verification

In this step, the quran texts produced are examined thoroughly against the Medina Mushaf, and a checksum for each letter and diacritic is calculated to verify the accuracy of the texts.

We have extensively used the help of a group of experts including Quran specialists, Hafizes an Aalims in order to assure the correctness and preciseness of the obtained quran texts. The quran text produced is now under a careful versioning system to enable keeping track of updates made to the texts. Since the release of the first version of Tanzil quran text, no typo has been found in the text by millions of the users that have used Tanzil Quran Navigator. The accuracy of the text has been also reconfirmed by several projects that are actively using Tanzil quran text.

Do you need to replace your text?

To see if the quran text you have already obtained from another resource is erroneous, simply do the following checks:

  1. If your text has some of the typos listed here or here, then your text is highly inaccurate. You need to replace your text immediately by one of the Tanzil quran texts.
  2. If your text has one of the typos or missing diacritics listed here, then you need to replace it with Tanzil simple text as soon as possible to remove any deficiencies.
  3. If your text doesn't have any of the typos listed above, but it has no small-alef (e.g., in the words الرَّحْمَن and إِلَه), then you'd better replace your text with Tanzil text to prevent any errors in reading quran.

You can download Tanzil quran texts at the project

Who is using Tanzil?

1 comment: