Type Alphasyllabary
Languages Gandhari
Time period 4th century BCE – 3rd century CE
Parent systems
Proto-Sinaitic alphabet
Sister systems Brāhmī
ISO 15924 ,
Unicode alias
Unicode range U+10A00—U+10A5F
The Kharoṣṭhī script is an ancient Indic script used by the Gandhara culture of ancient Northwest India (primarily modern-day Afghanistan, Pakistan and North India) to write the Gāndhārī language (a dialect of Prakrit) and the Sanskrit language. An alphasyllabary, it was in use from the middle of the 3rd century BCE until it died out in its homeland around the 3rd century CE. It was also in use in Bactria, Gandhara (particularly in the period of the Kushan Empire), Sogdiana (see Issyk kurgan) and along the Silk Road, where there is some evidence it may have survived until the 7th century in the remote way stations of Khotan and Niya. Kharoṣṭhī is encoded in the Unicode range U+10A00–U+10A5F, from version 4.1.0.


Kharoṣṭhī is mostly written right to left (type A), but some inscriptions (type B) already show the left to right direction that was to become universal for the later South Asian scripts.

Each syllable includes the short a sound by default, with other vowels being indicated by diacritic marks. Recent epigraphical evidence highlighted by Professor Richard Salomon of the University of Washington has shown that the order of letters in the Kharoṣṭhī script follows what has become known as the Arapacana Alphabet. As preserved in Sanskrit documents the alphabet runs:

a ra pa ca na la da ba ḍa ṣa va ta ya ṣṭa ka sa ma ga stha ja śva dha śa kha kṣa sta jñā rtha (or ha) bha cha sma hva tsa gha ṭha ṇa pha ska ysa śca ṭa ḍha

Some variations in both the number and order of syllables occur in extant texts.

Kharoṣṭhī includes only one standalone vowel sign which is used for initial vowels in words. Other initial vowels use the a character modified by diacritics. Using epigraphic evidence Salomon has established that the vowel order is a e i o u, rather than the usual vowel order for Indic scripts a i u e o. This is the same as the Semitic vowel order. Also, there is no differentiation between long and short vowels in kharoshti. Both are marked using the same vowel markers

The alphabet was used in Gandharan Buddhism as a mnemonic for remembering a series of verses relating to the nature of phenomena. In Tantric Buddhism this list was incorporated into ritual practices, and later became enshrined in mantras.


a i u e o
k kh g gh
c ch j ñ
ṭh ḍh
t th d dh n
p ph b bh m
y r l v
ś s h


Kharoṣṭhī numerals
۱ ۲ ۳ ۱ㄨ ۲ㄨ ۳ㄨ ㄨㄨ ۱ㄨㄨ
1 2 3 4 5 6 7 8 9
10 20 30 40 50 60 70  
ʎ۱ ʎ۲  
100 200  

Kharoṣṭhī included a set of numerals that are reminiscent of Roman numerals. The symbols were I for the unit, X for four (perhaps representative of four lines or directions), for ten (doubled for twenty), and ʎ for the hundreds multiplier. The system is based on an additive and a multiplicative principle, but does not have the subtractive feature used in the Roman number system.[1]

1 2 3 4 10 20 100 1000

Note that the table beside reads right-to-left, just like the Kharoṣṭhī abugida itself and the displayed numbers.

The numerals are encoded by Unicode at codepoints U+10A40 to U+10A47:

One Hundred
One Thousand


The Kharoṣṭhī script was deciphered by James Prinsep (1799–1840), using the bilingual coins of the Indo-Greeks (Obverse in Greek, reverse in Pāli, using the Kharoṣṭhī script). This in turn led to the reading of the Edicts of Ashoka, some of which, from the northwest of the Indian subcontinent, were written in the Kharoṣṭhī script.

Scholars are not in agreement as to whether the Kharoṣṭhī script evolved gradually, or was the deliberate work of a single inventor. An analysis of the script forms shows a clear dependency on the Aramaic alphabet but with extensive modifications to support the sounds found in Indic languages. One model is that the Aramaic script arrived with the Achaemenid conquest of the region of northwest India in 500 BCE and evolved over the next 200+ years to reach its final form by the 3rd century BCE where it appears in some of the Edicts of Ashoka found in northwestern part of the Indian.However, no intermediate forms have yet been found to confirm this evolutionary model, and rock and coin inscriptions from the 3rd century BCE onward show a unified and standard form.

The study of the Kharoṣṭhī script was recently invigorated by the discovery of the Gandharan Buddhist Texts, a set of birch-bark manuscripts written in Kharoṣṭhī, discovered near the Afghan city of Hadda just west of the Khyber Pass in modern Pakistan. The manuscripts were donated to the British Library in 1994. The entire set of manuscripts are dated to the 1st century CE, making them the oldest Buddhist manuscripts yet discovered.


Kharosthi was added to the Unicode Standard in March, 2005 with the release of version 4.1.

The Unicode block for Kharosthi is U+10A00–U+10A5F: chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+10A0x 𐨀  𐨁  𐨂 𐨃  𐨅  𐨆  𐨌  𐨍  𐨎  𐨏
U+10A1x 𐨐 𐨑 𐨒 𐨓 𐨕 𐨖 𐨗 𐨙 𐨚 𐨛 𐨜 𐨝 𐨞 𐨟
U+10A2x 𐨠 𐨡 𐨢 𐨣 𐨤 𐨥 𐨦 𐨧 𐨨 𐨩 𐨪 𐨫 𐨬 𐨭 𐨮 𐨯
U+10A3x 𐨰 𐨱 𐨲 𐨳  𐨸  𐨹  𐨺 𐨿
U+10A4x 𐩀 𐩁 𐩂 𐩃 𐩄 𐩅 𐩆 𐩇
U+10A5x 𐩐 𐩑 𐩒 𐩓 𐩔 𐩕 𐩖 𐩗 𐩘
