Persian alphabet
Persian alphabet |
---|
ا ب پ ت ث ج چ ح خ د ذ ر ز ژ س ش ص ض ط ظ ع غ ف ق ک گ ل م ن و ه ی |
Perso Arabic script |
|
The Persian alphabet (Persian: الفبای فارسی, alefbā-ye fârsi), or Perso-Arabic alphabet, is a writing system used for the Persian language.
The Persian script is a modified version of the Arabic script. It is an abjad, meaning vowels are underrepresented in writing. The writing direction is mostly but not exclusively right-to-left; mathematical expressions, numeric dates and numbers bearing units are embedded from left to right. The script is cursive, meaning most letters in a word connect to each other; when they are typed, contemporary word processors automatically join adjacent letterforms. However, some Persian compounds do not join, and Persian adds four letters to the basic set for a total of 32 characters.
The replacement of the Pahlavi scripts with the Persian alphabet to write the Persian language was done by the Tahirid dynasty in 9th-century Greater Khorasan.[1][2]
Contents
1 Letters
1.1 Overview table
1.2 Letters that do not link to a following letter
1.3 Diacritics
1.3.1 Short vowels
1.3.2 Tanvin (nunation)
1.3.3 Tashdid
1.4 Other characters
1.5 Novel letters
1.6 Deviations from the Arabic script
2 Word boundaries
3 See also
4 References
5 External links
Letters
Below are the 32 letters of the modern Persian alphabet. Since the script is cursive, the appearance of a letter changes depending on its position: isolated, initial (joined on the left), medial (joined on both sides) and final (joined on the right) of a word.[3]
The names of the letter are mostly the ones used in Arabic except for the Persian pronunciation. The only ambiguous name is he, which is used for both ح and ه. For clarification, they are often called ḥâ-ye ḥotti or ḥä-ye jimi (literally "jim-like ḥe" after jim, the name for the letter ج that uses the same base form) and hâ-ye havvaz or hâ-ye do-češm (literally "two-eyed he", after the contextual middle letterform ـهـ), respectively.
Overview table
# | Name (in Persian) | Name (transliterated) | DIN 31635 | IPA | Unicode | Contextual forms | |||
---|---|---|---|---|---|---|---|---|---|
Final | Medial | Initial | Isolated | ||||||
0 | همزه | hamzeh[4] | ʾ | [ʔ] | U+0621 | N/A | N/A | N/A | ء |
U+0623 | ـأ | أ | |||||||
U+0626 | ـئ | ـئـ | ئـ | ئ | |||||
U+0624 | ـؤ | ؤ | |||||||
1 | الف | ʾalef | â | [ɒ] | U+0627 | ـا | ا | ||
2 | به | be | b | [b] | U+0628 | ـب | ـبـ | بـ | ب |
3 | په | pe | p | [p] | U+067E | ـپ | ـپـ | پـ | پ |
4 | ته | te | t | [t] | U+062A | ـت | ـتـ | تـ | ت |
5 | ثه | s̱e | s̱ | [s] | U+062B | ـث | ـثـ | ثـ | ث |
6 | جیم | jim | j | [d͡ʒ] | U+062C | ـج | ـجـ | جـ | ج |
7 | چه | che | č | [t͡ʃ] | U+0686 | ـچ | ـچـ | چـ | چ |
8 | حه | ḥe (ḥâ-ye ḥotti, ḥâ-ye jimi) | ḥ | [h] | U+062D | ـح | ـحـ | حـ | ح |
9 | خه | khe | kh | [x] | U+062E | ـخ | ـخـ | خـ | خ |
10 | دال | dâl | d | [d] | U+062F | ـد | د | ||
11 | ذال | ẕâl | ẕ | [z] | U+0630 | ـذ | ذ | ||
12 | ره | re | r | [ɾ] | U+0631 | ـر | ر | ||
13 | زه | ze | z | [z] | U+0632 | ـز | ز | ||
14 | ژه | že | ž | [ʒ] | U+0698 | ـژ | ژ | ||
15 | سین | sin | s | [s] | U+0633 | ـس | ـسـ | سـ | س |
16 | شین | šin | š | [ʃ] | U+0634 | ـش | ـشـ | شـ | ش |
17 | صاد | ṣäd | ṣ | [s] | U+0635 | ـص | ـصـ | صـ | ص |
18 | ضاد | zâd | z | [z] | U+0636 | ـض | ـضـ | ضـ | ض |
19 | طی، طا | tâ, toy (in Dari) | ts | [t] | U+0637 | ـط | ـطـ | طـ | ط |
20 | ظی، ظا | ẓâ, ẓoy (in Dari) | ẓ | [z] | U+0638 | ـظ | ـظـ | ظـ | ظ |
21 | عین | ʿayn | ʿ | [ʔ] | U+0639 | ـع | ـعـ | عـ | ع |
22 | غین | ġayn | ġ | [ɣ] | U+063A | ـغ | ـغـ | غـ | غ |
23 | فه | fe | f | [f] | U+0641 | ـف | ـفـ | فـ | ف |
24 | قاف | q̈âf | q̈ | [ɣ] | U+0642 | ـق | ـقـ | قـ | ق |
25 | کاف | kâf | k | [k] | U+06A9 | ـک | ـکـ | کـ | ک |
26 | گاف | gâf | g | [ɡ] | U+06AF | ـگ | ـگـ | گـ | گ |
27 | لام | lâm | l | [l] | U+0644 | ـل | ـلـ | لـ | ل |
28 | میم | mim | m | [m] | U+0645 | ـم | ـمـ | مـ | م |
29 | نون | nun | n | [n] | U+0646 | ـن | ـنـ | نـ | ن |
30 | واو | vâv | v / ū / ow / (w / aw / ō in Dari) | [v] / [uː] / [o] / [ow] / ([w] / [aw] / [oː] in Dari) | U+0648 | ـو | و | ||
31 | هه | he (hā-ye havvaz, hā-ye do-češm) | h | [h] | U+0647 | ـه | ـهـ | هـ | ه |
32 | یه | ye | y / ī / á / (ay / ē in Dari) | [j] / [i] / [ɒː] / ([aj] / [eː] in Dari) | U+06CC | ـی | ـیـ | یـ | ی |
Letters that do not link to a following letter
Seven letters (و, ژ, ز, ر, ذ, د, ا) do not connect to a following letter, unlike the rest of the letters of the alphabet. The seven letters have the same form in isolated and initial position and a second form in medial and final position. For example, when the letter ا alef is at the beginning of a word such as اینجا injâ ("here"), the same form is used as in an isolated alef. In the case of امروز emruz ("today"), the letter ر re takes the final form and the letter و vâv takes the isolated form, but they are in the middle of the word, and ز also has its isolated form, but it occurs at the end of the word.
Diacritics
Persian script has adopted a subset of Arabic diacritics: zebar /æ/ (fatḥah in Arabic), zir /e/ (kasrah in Arabic), and pish /ou̯/ or /o/ (ḍammah in Arabic, pronounced zamme in Western Persian), tanwīn e nasb /æn/ and shadda (gemination). Other Arabic diacritics may be seen in Arabic loanwords.
Short vowels
Of the four Arabic short vowels, the Persian language has adopted the following three. The last one, sukūn, is not adopted.
Short vowels (fully vocalized text) | Name (in Persian) | Name (transliterated) | Trans. | Value |
---|---|---|---|---|
064E .mw-parser-output .script-arabic,.mw-parser-output .script-Arab{font-family:Scheherazade,Lateef,LateefGR,Amiri,"Noto Naskh Arabic","Droid Arabic Naskh",Harmattan,"Arabic Typesetting","Traditional Arabic","Simplified Arabic","Times New Roman",Arial,"Sakkal Majalla","Microsoft Uighur",Calibri,"Microsoft Sans Serif","Segoe UI",serif,sans-serif;font-weight:normal} ◌َ | زبر (فتحه) | zebar/zibar | a | Ir. /æ/; D. /a/ |
0650 ◌ِ | زیر (کسره) | zer/zir | e | /e/ |
064F ◌ُ | پیش (ضمّه) | pesh/pish | o | /o/ |
In Iranian Persian, none of these short vowels may be the initial or final grapheme in an isolated word, although they may appear in the final position as an inflection, when the word is part of a noun group. In a word that starts with a vowel, the first grapheme is a silent alef which carries the short vowel, e.g. اُمید (omid, meaning "hope"). In a word that ends with a vowel, letters ع, ه and و respectively become the proxy letters for zebar, zir and pish, e.g. نو (no, meaning "new") or بسته (bas-teh, meaning "package").
Tanvin (nunation)
Nunation (fully vocalized text) | Name (in Persian) | Name (transliterated) | Notes |
---|---|---|---|
064B َاً، ـاً، ءً | تنوین نصب | Tanvin e nasb | |
064D ٍِ | تنوین جرّ | Tanvin e jarr | Never used in the Persian language. Taught in Islamic nations to complement Quran education. |
064C ٌ | تنوین رفع | Tanvin e rafe |
Tashdid
Nunation (fully vocalized text) | Name (in Persian) | Name (transliteration) |
---|---|---|
0651 ّ | تشدید | tashdid |
Other characters
The following are not actual letters but different orthographical shapes for letters, a ligature in the case of the lâm alef. As to ﺀ (hamza), it has only one graphic since it is never tied to a preceding or following letter. However, it is sometimes 'seated' on a vâv, ye or alef, and in that case, the seat behaves like an ordinary vâv, ye or alef respectively. Technically, hamza is not a letter but a diacritic.
Name | Pronunciation | IPA | Unicode | Final | Medial | Initial | Stand-alone | Notes |
---|---|---|---|---|---|---|---|---|
alef madde | â | [ɒ] | U+0622 | ـآ | — | آ | آ | The final form is very rare and is freely replaced with ordinary alef. |
he ye | -eye or -eyeh | [eje] | U+06C0 | ـۀ | — | — | ۀ | Validity of this form depends on region and dialect. Some may use the three-letter ـهی combination instead. |
lām alef | lā | [lɒ] | U+0644 (lām) and U+0627 (alef) | ـلا | — | — | لا | |
kashida | U+0640 | — | ـ | — | — | This is the medial character which connects other characters |
Although at first glance, they may seem similar, there are many differences in the way the different languages use the alphabets. For example, similar words are written differently in Persian and Arabic, as they are used differently.
Novel letters
The Persian alphabet adds four letters to the Arabic alphabet: /p/, /ɡ/, /t͡ʃ/ (ch in chair), /ʒ/ (s in measure).
Sound | Shape | Unicode name | Unicode code point |
---|---|---|---|
/p/ | پ | peh | U+067E |
/t͡ʃ/ (ch) | چ | tcheh | U+0686 |
/ʒ/ (zh) | ژ | jeh | U+0698 |
/ɡ/ | گ | gāf | U+06AF |
Deviations from the Arabic script
The shapes of the Persian digits four (۴), five (۵), and six (۶) are different from the shapes used in Arabic and the other numbers have different codepoints.[5]
Name | Persian | Unicode | Arabic | Unicode |
---|---|---|---|---|
0 | ۰ | U+06F0 | ٠ | U+0660 |
1 | ۱ | U+06F1 | ١ | U+0661 |
2 | ۲ | U+06F2 | ٢ | U+0662 |
3 | ۳ | U+06F3 | ٣ | U+0663 |
4 | ۴ | U+06F4 | ٤ | U+0664 |
5 | ۵ | U+06F5 | ٥ | U+0665 |
6 | ۶ | U+06F6 | ٦ | U+0666 |
7 | ۷ | U+06F7 | ٧ | U+0667 |
8 | ۸ | U+06F8 | ٨ | U+0668 |
9 | ۹ | U+06F9 | ٩ | U+0669 |
ye | ی | U+06CC | ي | U+064A |
kāf | ک | U+06A9 | ك | U+0643 |
Word boundaries
Typically, words are separated from each other by a space. Certain morphemes (such as the plural ending '-hâ'), however, are written without a space. On a computer, they are separated from the word using the zero-width non-joiner.
See also
- Scripts used for Persian
- Persian braille
Nastaʿlīq, used to write Persian before the 20th century- Abjad numerals
- Fingilish
References
^ Ira M. Lapidus (2012). Islamic Societies to the Nineteenth Century: A Global History. Cambridge University Press. pp. 256–. ISBN 978-0-521-51441-5..mw-parser-output cite.citation{font-style:inherit}.mw-parser-output .citation q{quotes:"""""""'""'"}.mw-parser-output .citation .cs1-lock-free a{background:url("//upload.wikimedia.org/wikipedia/commons/thumb/6/65/Lock-green.svg/9px-Lock-green.svg.png")no-repeat;background-position:right .1em center}.mw-parser-output .citation .cs1-lock-limited a,.mw-parser-output .citation .cs1-lock-registration a{background:url("//upload.wikimedia.org/wikipedia/commons/thumb/d/d6/Lock-gray-alt-2.svg/9px-Lock-gray-alt-2.svg.png")no-repeat;background-position:right .1em center}.mw-parser-output .citation .cs1-lock-subscription a{background:url("//upload.wikimedia.org/wikipedia/commons/thumb/a/aa/Lock-red-alt-2.svg/9px-Lock-red-alt-2.svg.png")no-repeat;background-position:right .1em center}.mw-parser-output .cs1-subscription,.mw-parser-output .cs1-registration{color:#555}.mw-parser-output .cs1-subscription span,.mw-parser-output .cs1-registration span{border-bottom:1px dotted;cursor:help}.mw-parser-output .cs1-ws-icon a{background:url("//upload.wikimedia.org/wikipedia/commons/thumb/4/4c/Wikisource-logo.svg/12px-Wikisource-logo.svg.png")no-repeat;background-position:right .1em center}.mw-parser-output code.cs1-code{color:inherit;background:inherit;border:inherit;padding:inherit}.mw-parser-output .cs1-hidden-error{display:none;font-size:100%}.mw-parser-output .cs1-visible-error{font-size:100%}.mw-parser-output .cs1-maint{display:none;color:#33aa33;margin-left:0.3em}.mw-parser-output .cs1-subscription,.mw-parser-output .cs1-registration,.mw-parser-output .cs1-format{font-size:95%}.mw-parser-output .cs1-kern-left,.mw-parser-output .cs1-kern-wl-left{padding-left:0.2em}.mw-parser-output .cs1-kern-right,.mw-parser-output .cs1-kern-wl-right{padding-right:0.2em}
^ Ira M. Lapidus (2002). A History of Islamic Societies. Cambridge University Press. pp. 127–. ISBN 978-0-521-77933-3.
^ "ویژگىهاى خطّ فارسى". Academy of Persian Language and Literature.
^ "??" (PDF). Persianacademy.ir. Retrieved 2015-09-05.
^ "Unicode Characters in the 'Number, Decimal Digit' Category".
External links
Wikimedia Commons has media related to Persian alphabet. |
Dastoor e khat - The Official document in Persian by Academy of Persian Language and Literature