ProposaltoencodetheDograscriptinUnicode · L2/15-234R 2015-11-04...

38
L2/15-234R 2015-11-04 Proposal to encode the Dogra script in Unicode Anshuman Pandey Department of Linguistics University of California, Berkeley Berkeley, California, U.S.A. [email protected] November 4, 2015 1 Introduction This is a proposal to encode the Dogra script in Unicode. Dogra was briefly described in “Proposal to Encode the Takri Script in ISO/IEC 10646” (L2/09-424) as one of several scripts to be unified in the ‘Takri’ block. The block was intended for representing several related scripts of Himachal Pradesh, Jammu, Kashmir, and Panjab by means of a generic character repertoire until additional research on individual scripts could be conducted. As described in L2/09-424, the representative glyphs for characters in the ‘Takri’ block are based upon the standard, printed form of the Chamba or Chambeali form of Takri. The Chamba form was selected because materials in the script were readily available. Although Dogra has its own standardized form, it was stated in L2/09-424 that there were insufficient materials available for developing a formal encoding. Dogra was, therefore, grouped with ‘Takri’ on the basis of historical typology and as a matter of practicality. My recent research upon various ‘Takri’ scripts has yielded additional sources and information that that support of the encoding of Dogra as an independent script, 1 especially because of its official status in the former State of Jammu and Kashmir and on account of the differences between it and the standard Chamba script (see tables 2 and 3). A step towards encoding Dogra was taken through the submission of a preliminary proposal (L2/15-213). The present document supersedes L2/15-213 and provides additional information and a number of new sources that support the encoding of Dogra in Unicode. 2 Background The Dogra script is historically associated with the Dogri language (ISO 639-3: doi), but it is not actively used at present. In 1916, George A. Grierson wrote in the Linguistic Survey of India that “Ḍōgrā has an alphabet of its own, which is allied to the Ṭakrī alphabet current in the Punjab Himalayas” (1916: 638). Although it was used mostly for informal communication and commercial activities, Dogra was standardized in the 1860s. Grierson noted that “[s]ome thirty or forty years ago the then Mahārājā of Jammu and Kashmir caused to be invented a modified form of the current Ṭakrī so as to bring it more into line with Dēvanāgarī and Gurmukhī” and “[t]his improved Ḍōgrī is used for official documents, but it has not generally displaced the old Ṭakrī form of script” (ibid). This official form of Dogra is known as ‘Name Dogra Akkhar’ or 1 The Jaunsari or Sirmauri script was also briefly described in L2/09-424 and unified with the Takri block. It is another potential candidate for separate encoding in Unicode. 1

Transcript of ProposaltoencodetheDograscriptinUnicode · L2/15-234R 2015-11-04...

L215-234R2015-11-04

Proposal to encode the Dogra script in Unicode

Anshuman PandeyDepartment of Linguistics

University of California BerkeleyBerkeley California USA

anshumanpandeyberkeleyedu

November 4 2015

1 Introduction

This is a proposal to encode the Dogra script in Unicode Dogra was briefly described in ldquoProposal to Encodethe Takri Script in ISOIEC 10646rdquo (L209-424) as one of several scripts to be unified in the lsquoTakrirsquo blockThe block was intended for representing several related scripts of Himachal Pradesh Jammu Kashmir andPanjab by means of a generic character repertoire until additional research on individual scripts could beconducted As described in L209-424 the representative glyphs for characters in the lsquoTakrirsquo block arebased upon the standard printed form of the Chamba or Chambeali form of Takri The Chamba form wasselected because materials in the script were readily available Although Dogra has its own standardizedform it was stated in L209-424 that there were insufficient materials available for developing a formalencoding Dogra was therefore grouped with lsquoTakrirsquo on the basis of historical typology and as a matter ofpracticality My recent research upon various lsquoTakrirsquo scripts has yielded additional sources and informationthat that support of the encoding of Dogra as an independent script1 especially because of its official statusin the former State of Jammu and Kashmir and on account of the differences between it and the standardChamba script (see tables 2 and 3) A step towards encoding Dogra was taken through the submission ofa preliminary proposal (L215-213) The present document supersedes L215-213 and provides additionalinformation and a number of new sources that support the encoding of Dogra in Unicode

2 Background

TheDogra script is historically associatedwith theDogri language (ISO 639-3 doi) but it is not actively usedat present In 1916 George A Grierson wrote in the Linguistic Survey of India that ldquoḌōgrā has an alphabetof its own which is allied to the Ṭakrī alphabet current in the Punjab Himalayasrdquo (1916 638) Althoughit was used mostly for informal communication and commercial activities Dogra was standardized in the1860s Grierson noted that ldquo[s]ome thirty or forty years ago the then Mahārājā of Jammu and Kashmircaused to be invented a modified form of the current Ṭakrī so as to bring it more into line with Dēvanāgarīand Gurmukhīrdquo and ldquo[t]his improved Ḍōgrī is used for official documents but it has not generally displacedthe old Ṭakrī form of scriptrdquo (ibid) This official form of Dogra is known as lsquoName Dogra Akkharrsquo or

1 The Jaunsari or Sirmauri script was also briefly described in L209-424 and unified with the Takri block It is another potentialcandidate for separate encoding in Unicode

1

Proposal to encode the Dogra script in Unicode Anshuman Pandey

the lsquoNew Dogra Scriptrsquo It was the official script of the State of Jammu and Kashmir during the reign ofMaharaja Ranbir Singh (r 1857ndash1885) who is the ruler referred to by Grierson The New Dogra script wasused for administrative purposes and upon currency judicial and non-judicial stamp papers postcards andpostage stamps It was also used for literary activities The first book published in the standard Dogra was atranslation of a Sanskrit mathematical treatise by Bhāskarācārya titled Līlavatī The work was commissionedby Ranbir Singh and printed in 1872 at Vidya Vilas Press the first printing press in Jammu

The old and standard forms of Dogra differ primarily in the shapes of letterforms and the representation ofconsonant clusters Specific differences are described throughout this proposal As mentioned by Griersonstandard Dogra was influenced by Devanagari and Gurmukhi The motivation behind such a move is notclear but it could be related to the idea of providing the Dogra script with an air of lsquoauthorityrsquo by giving it aform that resembled scripts which were already endowed with lsquoofficialrsquo and lsquoliteraryrsquo attributes The stan-dardization of Dogra in the 1860s established two distinctive forms of the script Throughout this documentthe common written form is called lsquoOld Dograrsquo and the standard form is referred to as lsquoNew Dograrsquo

The Dogra script is no longer used actively and the Dogri language is now generally written in DevanagariNonetheless the script continues to attract the interest of philatelists who collect ephemera from Jammu andKashmir Although there is very little scholarship on the script it was described briefly by the prominentSanskrit scholar Fritz Staal in his Stamps of Jammu amp Kashmir (New York The Collectors Club 1984)Information on Dogra is also curated on a philately website called ldquoCollecting Kashmirrdquo maintained byCarol von der Lin

3 Proposed Encoding

31 Approach

The justification for encoding Dogra as a separate script in Unicode is based upon the following criteria 1)graphical differences between Dogra Chamba and other forms of Takri 2) usage of the script in an officialcapacity 3) cultural and political recognition of it as a distinctive script 4) differentiation into lsquooldrsquo andstandardrsquo forms Of these the fourth point requires a means for handling the internal distinctions of thescript A separate block for Dogra would provide a means to maintain the scriptrsquos external identity fromother related scripts and to also maintain the identity of styles internally

32 Structure

The Dogra script is a Brahmi-based alphasyllabary that is written from left to right Independent and initialvowels are written using letters while dependent vowels are expressed using combining signs Consonantletters possess the inherent vowel a which is changed by the attachment of a vowel sign Consonant clustersare written in various ways such as with visible halanta as an atomic ligature and with half-forms rephais attested but rarely used There are no formal conventions regarding representation of conjuncts

33 Script name

The designation lsquoDograrsquo has been assigned to the script block in Unicode The script is referred to as bothlsquoDograrsquo and lsquoDogrirsquo in English secondary sources The name lsquoDograrsquo is preferred over lsquoDogrirsquo because itlinks the script with the Dogra dynasty of Jammu andKashmir under whose government it was established asthe official script Secondly lsquoDogrirsquo suggests a connection with the language of the same name however thelanguage is no longer written in the script The script was officially referred to as dvigarta akṣara or ldquoDogra

2

Proposal to encode the Dogra script in Unicode Anshuman Pandey

lettersrdquo where lsquoDvigartarsquo is presented as the Sanskritic root of the name lsquoDograrsquo Another indigenous namefor the script is name dogra akkhar or ldquoNew Dogra lettersrdquo

34 Representative glyphs

The representative glyphs for Dogra characters are based primarily upon forms used in New Dogra Charac-ters from Old Dogra have been used when necessary sourced primarily from figures 1 and 2 The font hasbeen designed by the proposal author

35 Vowel letters

There are 10 vowel letters

71680

71681

71682

71683

71684

71685

71686

71687

71688

71689

In Old Dogra the values of the vowel letters 71683 71682 and are variable As shown in the accompanyingsources the letter 71683 was used for both i and ī 71682 for u and ū and for u ū o and au In New Dograthe values of these three letters became fixed and new letters were introduced in order to provide distinctiverepresentations for all basic vowels It is apparent that the attempt to stabilize the vowel repertoire resultedin overlap and reassignment of several letters The Old Dogra 71683 (i ī) was retained solely for ī and 71682 (u ū)was reassigned for i A modified form of (u o) came to represent u while the form71685 was introduced forū which is certainly an adaptation of the letter ū from Devanagari or Sharada Distinct letters for71688 o and71689au were provided by placing the corresponding dependent vowel signs upon the modified u

Old New

71683 71682

mdash 71683

71682 71684

mdash 71685

71688

71689

The proposed vowels letters are based upon the repertoire of New Dogra This approach completely accom-modates vowels in the old and new forms of the script Although the names of New Dogra vowel letters do

3

Proposal to encode the Dogra script in Unicode Anshuman Pandey

not strictly correlate with values of Old Dogra vowels the proposed letters can be used for representing theold script The 71682 can be used for u 71683 can be used for i and 71684

can be used for o and au the height and curvature of the left stroke may be considered a stylisticattribute

36 Vowel Signs

There are 10 dependent vowel signs

71724

71725

71726

71727

71728

71729

71730

71731

71732

71733

The 71729 appears to have been introduced as part of New Dogra It is usedfor representing words of Sanskrit origin that contain the vocalic sound r Its shape is derived from either+11190 or +0943 There is

no corresponding independent vowel letter

There is an alphabetic aspect to the representation of consonant-vowel syllables in written Dogra The de-pendent vowel may be represented using the independent vowel letter as well as the vowel sign eg 7169071686may be used in place of 7169071730 for the syllable ke

There are differences in the shape of certain vowel letters between the old and new script The typical Takriforms of the vowel signs -u and -u which also resemble corresponding forms in Gurmukhi were replacedwith forms resembling Devanagari signs 71727 and 71728 respectively The typical Takri style of the vowel sign o was applied for writing 71733 au while a simplified of o in the form 71732 was used for o The duplication of o as for writing au was eliminated A comparison of vowel letters in the old and new scripts as they alignwith the proposed characters is as follows

Old New

71725

71726

71727

71728

71732

71733

4

Proposal to encode the Dogra script in Unicode Anshuman Pandey

37 Consonants

There are 34 consonant letters

71690

71691

71692

71693

71694

71695

71696

71697

71698

71699

71700

71701

71702

71703

71704

71705

71706

71707

71708

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

71721

71722

71723

The 71720 was introduced into New Dogra for representing the retroflex sibilant ṣa whichoccurs in words of Sanskrit origin It resembles 70064 +111B0

There are glyphic variants of several consonant letters

Regular Variant

71692

71693

71696

71698

71706

5

Proposal to encode the Dogra script in Unicode Anshuman Pandey

71708

71716

There is evidence of usage of different forms of the same letter in the available sources For instance both71696 and are both used for

38 Anusvara

The 71734 is used for marking nasalization

39 Visarga

The 71735 is used for indicating post-vocalic aspiration in words of Sanskrit origin It isshown below in the word 717097171471735 namaḥ ldquohomage tordquo

310 Nukta

The 71737 is used for representing sounds that are not native to Dogri and related languages

311 Virama

The 71736 has two functions similar to the corresponding character of Devanagari It isused as a halanta for marking the absence of the inherent vowel of a consonant letter It is also a controlcharacter that is used for producing conjuncts

312 Consonant Conjuncts

Consonant clusters are rendered in three ways 1) with visible halanta beneath the full form of each bareconsonant 2) with a half-form of the initial consonant followed by the full form of the following letter and3) as ligatures There are no formal conventions for the written expression of conjuncts The 71736

is used for the encoded representation of conjuncts

3121 Types of conjuncts

Some representations of Dogra conjuncts as found in the available sources are illustrated below

bull Visible halanta In both Old and New Dogra the most common method of representing consonantclusters is by placing a halanta beneath each bare consonant

6

Proposal to encode the Dogra script in Unicode Anshuman Pandey

bull Half-forms Another method is to use half-forms of letters if the graphical structure provides such anopportunity

The above shows the conjunct 71716 śra in the word 7171671726 śrī (a Sanskrit honorific) The half form isproduced by dropping the right vertical bar of 71719 The conjunct 71716 is used consistently in NewDogra for representing the cluster śra the form 717197173671716 with visible halanta beneath a full form of 71719does not occur in the available sources

bull Ligatures There are two types of ligature In the first the shapes of individual letters are visible Inthe below the boxed text contains the conjunct sṭa from the word717347172471710 sṭāṃp a transcription of theEnglish ldquostamprdquo

The second type is an atomic (akhaṇḍa) ligature whose form may be considered a distinctive letterSuch a ligature occurs in the excerpt below

The boxed text above contains the atomic ligature kṣa which occurs in the word7168071716 akṣaroṃ ldquoinlettersrdquo (oblique form of7168071716 akṣara ldquoletterrdquo) The conjunct kṣa is used consistently in New Dografor representing the cluster ksa Its shape may be influenced by Devanangari kṣa

bull Forms of Special conjunct forms of 71716 do not generally occur eg repha or vattu Cluster-initialis represented as a full form with halanta as in rṇa in the word saṃpūrṇa below

and non-initial is treated similarly as in pra in the word prabhu below

7

Proposal to encode the Dogra script in Unicode Anshuman Pandey

However there is one instance in the available sources in which cluster-initial is represented asa repha The occurrence is curious because the word caturbhuja ldquosquarerdquo in which repha occursappears three times in proximity but is spelled differently in all three instances

In the boxed text at the top the in caturbhuja is written with halanta 7169571705717277171671736717137172771697 caturbhuja In thetext at bottom left occurs with its inherent vowel intact without halanta 71695717057172771716717137172771697 caturabhuja Atbottom right ra is represented as repha 716957170571727717137172771697 caturbhuja

The usage of repha in Dogra is to be considered anomalous There is no present need to support thisform of in encoded representation

3122 Encoded representation of conjuncts

The Devanagari model for the representation of conjuncts is appropriate for Dogra In encoded text con-juncts are produced by placing the control character 71736 after each non-initial consonantin a cluster ie lt(C 71736 ) Cgt

Representations of conjuncts rendered as atomic ligatures are

kṣa lt71690 71736 71720 gt

sṭa lt71721 71736 71700 gt

Examples of conjuncts rendered using half-forms are

śra 71716 lt71719 71736 71716 gt

Conjuncts displayed using visible halanta are

pra 717107173671716 lt71710 71736 71716 gt

rṇa 717167173671704 lt71716 71736 71704 gt

8

Proposal to encode the Dogra script in Unicode Anshuman Pandey

3123 Modifying conjunct formation

As is the case in Devanagari and other Indic scripts a ligature may be broken using the generic controlcharacter +200C -

kṣa lt71690 71736 71720 gt

kṣa 716907173671720 lt71690 71736 - 71720 gt

The rendering of a conjunct as a ligature or with visible is a matter of font design

313 Punctuation

Abbreviation sign The 71738 is used for indicating truncation of text In the excerptbelow the highlighted text717177172671738 7171871738 717137172471738 (lī va bhā 23) is is an abbreviation for līla vatī bhāga 23

Danda-s The daṇḍā and double daṇḍā are commonly used in Dogra documents for marking the ends ofsentences and paragraphs The below excerpt shows usage of the daṇḍā

and the following shows use of the double daṇḍā

These daṇḍā-s are not proposed for encoding as separate characters in the lsquoDograrsquo block They resemble +0964 and +0965 and may be represented using

these characters The Devanagari daṇḍā-s are to be specified as extensions for Dogra

Other signs Other marks of punctuation such as dashes are also used

9

Proposal to encode the Dogra script in Unicode Anshuman Pandey

A hyphen is used in some cases for indicating breaks in a word at end of line (see section 316)

314 Digits

Digits are commonly used in Dogra documents (see figure 11) The style of digits varies across written andprinted sources Some digits in Old Dogra sources resemble those of Takri (see figure 4) while those innew Dogra sources are similar to Devanagar The excerpt below shows digits 1ndash9 and the number 10 asrepresented in the Līlavatī written in New Dogra

The majority of the digits shown here resemble Devanagari forms however the exception is lsquofourrsquo Theform of Dogra lsquofourrsquo does not resemble 71364 +116C4 which is nearly identical to theDevanagari digit four Information about its origins are not known at this time but it may be related to 70100+111D4 The forms of lsquoeightrsquo and lsquoninersquo are well-known variant forms of the regular

Devanagari digits

Additional research into Dogra digits is required before they may be considered for encoding The availablesources suggest that there may have been a cultural understanding that Dogra possessed unique digits Forinstance on the bank notes shown in figure 20 the digits written in Devanagari are rather similar but arenonetheless displayed twice in order to align the display of the currency value with each script used for theinscriptions

At present script-specific digits are not proposed for inclusion in the Dogra repertoire Devanagari digitsshould be used normatively and should be extended for usage with Dogra If future research yields distinctivedigits for Dogra there is sufficient space in the block to acccommodate their encoding

315 Number forms

Fraction signs and currency marks are commonly used in Dogra sources (see figure 12) These may berepresented using the fraction and currency signs already encoded in the lsquoCommon Indic Number Formsrsquoblock (U+A830) These characters are specified as extensions for Dogra The lsquoCommon Indic NumberFormsrsquo are described briefly in The Unicode Standard (chapter 223 lsquoNumeralsrsquo pp 770ndash771) and in moredetail in the original encoding proposal (L207-354)

316 Linebreaking

Linebreaking occurs after an orthographic syllable Breaks are generally not indicated in informal sourcesbut are marked in literary materials using hyphens

10

Proposal to encode the Dogra script in Unicode Anshuman Pandey

317 Collation

The primary collating order for Dogra is as follows

71680 lt 71681 lt 71682 lt 71683 lt 71684 lt 71685 lt 71686 lt 71687 lt 71688 lt 71689 lt

71690 lt 71691 lt 71692 lt 71693 lt 71694 lt 71695 lt 71696 lt 71697 lt

71698 lt 71699 lt 71700 lt 71701 lt 71702 lt 71703 lt 71704 lt 71705 lt

71706 lt 71707 lt 71708 lt 71709 lt 71710 lt 71711 lt 71712 lt 71713 lt

71714 lt 71715 lt 71716 lt 71717 lt 71718 lt 71719 lt 71720 lt 71721 lt 71722 lt

71723 lt 71724 lt 71725 lt 71726 lt

71727 lt 71728 lt 71729 lt 71730 lt

71731 lt 71732 lt 71733

The following characters have secondary weights 71734 71735 71736 71737

4 Character data

41 Character properties

The properties for Dogra in the Unicode Character Database format are

11800DOGRA LETTER ALo0LN11801DOGRA LETTER AALo0LN11802DOGRA LETTER ILo0LN11804DOGRA LETTER ULo0LN11806DOGRA LETTER ELo0LN11807DOGRA LETTER AILo0LN11808DOGRA LETTER OLo0LN1180ADOGRA LETTER KALo0LN1180BDOGRA LETTER KHALo0LN1180CDOGRA LETTER GALo0LN1180DDOGRA LETTER GHALo0LN1180EDOGRA LETTER NGALo0LN1180FDOGRA LETTER CALo0LN11810DOGRA LETTER CHALo0LN11811DOGRA LETTER JALo0LN11812DOGRA LETTER JHALo0LN11813DOGRA LETTER NYALo0LN

11

Proposal to encode the Dogra script in Unicode Anshuman Pandey

11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN

42 Linebreaking

Linebreaking properties given in the data format of LineBreaktxt

118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN

43 Syllabic categories

Syllabic categories given in the format of IndicSyllabicCategorytxt

Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA

12

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA

Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA

Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA

Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU

Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU

Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA

44 Positional categories

Positional data for characters in the format of IndicPositionalCategorytxt

Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA

Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I

Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU

Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA

45 Script extensions

The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt

0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA

0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE

A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS

A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK

13

Proposal to encode the Dogra script in Unicode Anshuman Pandey

46 lsquoConfusablersquo characters

Some Dogra characters that resemble characters encoded in other script blocks are

DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN

5 References

Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press

Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India

Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press

Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf

mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf

mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf

Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club

von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml

14

Proposal to encode the Dogra script in Unicode Anshuman Pandey

6 Acknowledgments

I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal

This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities

15

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 1

1184FDogra11800

1180 1181 1182 1183 1184

a

71681

i

71683

u

71685

e

71687

o

71689

71690

71691

71692

71693

71694

71695

71696

71697

71698

71699

71700

71701

71702

71703

71704

71705

71706

71707

71708

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

71721

71722

71723

$71724

$ i

$71726

$ u

$ 71728

$ r

$ e

$ 71731

$ o

$ 71733

$ 71734

$71735

$ 71736

$ 71737

71738

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 2

1183ADogra11800

Punctuation1183A 71738 DOGRA ABBREVIATION SIGN

Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU

Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA

Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU

Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

18

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

19

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)

20

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)

21

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)

22

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)

23

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

the lsquoNew Dogra Scriptrsquo It was the official script of the State of Jammu and Kashmir during the reign ofMaharaja Ranbir Singh (r 1857ndash1885) who is the ruler referred to by Grierson The New Dogra script wasused for administrative purposes and upon currency judicial and non-judicial stamp papers postcards andpostage stamps It was also used for literary activities The first book published in the standard Dogra was atranslation of a Sanskrit mathematical treatise by Bhāskarācārya titled Līlavatī The work was commissionedby Ranbir Singh and printed in 1872 at Vidya Vilas Press the first printing press in Jammu

The old and standard forms of Dogra differ primarily in the shapes of letterforms and the representation ofconsonant clusters Specific differences are described throughout this proposal As mentioned by Griersonstandard Dogra was influenced by Devanagari and Gurmukhi The motivation behind such a move is notclear but it could be related to the idea of providing the Dogra script with an air of lsquoauthorityrsquo by giving it aform that resembled scripts which were already endowed with lsquoofficialrsquo and lsquoliteraryrsquo attributes The stan-dardization of Dogra in the 1860s established two distinctive forms of the script Throughout this documentthe common written form is called lsquoOld Dograrsquo and the standard form is referred to as lsquoNew Dograrsquo

The Dogra script is no longer used actively and the Dogri language is now generally written in DevanagariNonetheless the script continues to attract the interest of philatelists who collect ephemera from Jammu andKashmir Although there is very little scholarship on the script it was described briefly by the prominentSanskrit scholar Fritz Staal in his Stamps of Jammu amp Kashmir (New York The Collectors Club 1984)Information on Dogra is also curated on a philately website called ldquoCollecting Kashmirrdquo maintained byCarol von der Lin

3 Proposed Encoding

31 Approach

The justification for encoding Dogra as a separate script in Unicode is based upon the following criteria 1)graphical differences between Dogra Chamba and other forms of Takri 2) usage of the script in an officialcapacity 3) cultural and political recognition of it as a distinctive script 4) differentiation into lsquooldrsquo andstandardrsquo forms Of these the fourth point requires a means for handling the internal distinctions of thescript A separate block for Dogra would provide a means to maintain the scriptrsquos external identity fromother related scripts and to also maintain the identity of styles internally

32 Structure

The Dogra script is a Brahmi-based alphasyllabary that is written from left to right Independent and initialvowels are written using letters while dependent vowels are expressed using combining signs Consonantletters possess the inherent vowel a which is changed by the attachment of a vowel sign Consonant clustersare written in various ways such as with visible halanta as an atomic ligature and with half-forms rephais attested but rarely used There are no formal conventions regarding representation of conjuncts

33 Script name

The designation lsquoDograrsquo has been assigned to the script block in Unicode The script is referred to as bothlsquoDograrsquo and lsquoDogrirsquo in English secondary sources The name lsquoDograrsquo is preferred over lsquoDogrirsquo because itlinks the script with the Dogra dynasty of Jammu andKashmir under whose government it was established asthe official script Secondly lsquoDogrirsquo suggests a connection with the language of the same name however thelanguage is no longer written in the script The script was officially referred to as dvigarta akṣara or ldquoDogra

2

Proposal to encode the Dogra script in Unicode Anshuman Pandey

lettersrdquo where lsquoDvigartarsquo is presented as the Sanskritic root of the name lsquoDograrsquo Another indigenous namefor the script is name dogra akkhar or ldquoNew Dogra lettersrdquo

34 Representative glyphs

The representative glyphs for Dogra characters are based primarily upon forms used in New Dogra Charac-ters from Old Dogra have been used when necessary sourced primarily from figures 1 and 2 The font hasbeen designed by the proposal author

35 Vowel letters

There are 10 vowel letters

71680

71681

71682

71683

71684

71685

71686

71687

71688

71689

In Old Dogra the values of the vowel letters 71683 71682 and are variable As shown in the accompanyingsources the letter 71683 was used for both i and ī 71682 for u and ū and for u ū o and au In New Dograthe values of these three letters became fixed and new letters were introduced in order to provide distinctiverepresentations for all basic vowels It is apparent that the attempt to stabilize the vowel repertoire resultedin overlap and reassignment of several letters The Old Dogra 71683 (i ī) was retained solely for ī and 71682 (u ū)was reassigned for i A modified form of (u o) came to represent u while the form71685 was introduced forū which is certainly an adaptation of the letter ū from Devanagari or Sharada Distinct letters for71688 o and71689au were provided by placing the corresponding dependent vowel signs upon the modified u

Old New

71683 71682

mdash 71683

71682 71684

mdash 71685

71688

71689

The proposed vowels letters are based upon the repertoire of New Dogra This approach completely accom-modates vowels in the old and new forms of the script Although the names of New Dogra vowel letters do

3

Proposal to encode the Dogra script in Unicode Anshuman Pandey

not strictly correlate with values of Old Dogra vowels the proposed letters can be used for representing theold script The 71682 can be used for u 71683 can be used for i and 71684

can be used for o and au the height and curvature of the left stroke may be considered a stylisticattribute

36 Vowel Signs

There are 10 dependent vowel signs

71724

71725

71726

71727

71728

71729

71730

71731

71732

71733

The 71729 appears to have been introduced as part of New Dogra It is usedfor representing words of Sanskrit origin that contain the vocalic sound r Its shape is derived from either+11190 or +0943 There is

no corresponding independent vowel letter

There is an alphabetic aspect to the representation of consonant-vowel syllables in written Dogra The de-pendent vowel may be represented using the independent vowel letter as well as the vowel sign eg 7169071686may be used in place of 7169071730 for the syllable ke

There are differences in the shape of certain vowel letters between the old and new script The typical Takriforms of the vowel signs -u and -u which also resemble corresponding forms in Gurmukhi were replacedwith forms resembling Devanagari signs 71727 and 71728 respectively The typical Takri style of the vowel sign o was applied for writing 71733 au while a simplified of o in the form 71732 was used for o The duplication of o as for writing au was eliminated A comparison of vowel letters in the old and new scripts as they alignwith the proposed characters is as follows

Old New

71725

71726

71727

71728

71732

71733

4

Proposal to encode the Dogra script in Unicode Anshuman Pandey

37 Consonants

There are 34 consonant letters

71690

71691

71692

71693

71694

71695

71696

71697

71698

71699

71700

71701

71702

71703

71704

71705

71706

71707

71708

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

71721

71722

71723

The 71720 was introduced into New Dogra for representing the retroflex sibilant ṣa whichoccurs in words of Sanskrit origin It resembles 70064 +111B0

There are glyphic variants of several consonant letters

Regular Variant

71692

71693

71696

71698

71706

5

Proposal to encode the Dogra script in Unicode Anshuman Pandey

71708

71716

There is evidence of usage of different forms of the same letter in the available sources For instance both71696 and are both used for

38 Anusvara

The 71734 is used for marking nasalization

39 Visarga

The 71735 is used for indicating post-vocalic aspiration in words of Sanskrit origin It isshown below in the word 717097171471735 namaḥ ldquohomage tordquo

310 Nukta

The 71737 is used for representing sounds that are not native to Dogri and related languages

311 Virama

The 71736 has two functions similar to the corresponding character of Devanagari It isused as a halanta for marking the absence of the inherent vowel of a consonant letter It is also a controlcharacter that is used for producing conjuncts

312 Consonant Conjuncts

Consonant clusters are rendered in three ways 1) with visible halanta beneath the full form of each bareconsonant 2) with a half-form of the initial consonant followed by the full form of the following letter and3) as ligatures There are no formal conventions for the written expression of conjuncts The 71736

is used for the encoded representation of conjuncts

3121 Types of conjuncts

Some representations of Dogra conjuncts as found in the available sources are illustrated below

bull Visible halanta In both Old and New Dogra the most common method of representing consonantclusters is by placing a halanta beneath each bare consonant

6

Proposal to encode the Dogra script in Unicode Anshuman Pandey

bull Half-forms Another method is to use half-forms of letters if the graphical structure provides such anopportunity

The above shows the conjunct 71716 śra in the word 7171671726 śrī (a Sanskrit honorific) The half form isproduced by dropping the right vertical bar of 71719 The conjunct 71716 is used consistently in NewDogra for representing the cluster śra the form 717197173671716 with visible halanta beneath a full form of 71719does not occur in the available sources

bull Ligatures There are two types of ligature In the first the shapes of individual letters are visible Inthe below the boxed text contains the conjunct sṭa from the word717347172471710 sṭāṃp a transcription of theEnglish ldquostamprdquo

The second type is an atomic (akhaṇḍa) ligature whose form may be considered a distinctive letterSuch a ligature occurs in the excerpt below

The boxed text above contains the atomic ligature kṣa which occurs in the word7168071716 akṣaroṃ ldquoinlettersrdquo (oblique form of7168071716 akṣara ldquoletterrdquo) The conjunct kṣa is used consistently in New Dografor representing the cluster ksa Its shape may be influenced by Devanangari kṣa

bull Forms of Special conjunct forms of 71716 do not generally occur eg repha or vattu Cluster-initialis represented as a full form with halanta as in rṇa in the word saṃpūrṇa below

and non-initial is treated similarly as in pra in the word prabhu below

7

Proposal to encode the Dogra script in Unicode Anshuman Pandey

However there is one instance in the available sources in which cluster-initial is represented asa repha The occurrence is curious because the word caturbhuja ldquosquarerdquo in which repha occursappears three times in proximity but is spelled differently in all three instances

In the boxed text at the top the in caturbhuja is written with halanta 7169571705717277171671736717137172771697 caturbhuja In thetext at bottom left occurs with its inherent vowel intact without halanta 71695717057172771716717137172771697 caturabhuja Atbottom right ra is represented as repha 716957170571727717137172771697 caturbhuja

The usage of repha in Dogra is to be considered anomalous There is no present need to support thisform of in encoded representation

3122 Encoded representation of conjuncts

The Devanagari model for the representation of conjuncts is appropriate for Dogra In encoded text con-juncts are produced by placing the control character 71736 after each non-initial consonantin a cluster ie lt(C 71736 ) Cgt

Representations of conjuncts rendered as atomic ligatures are

kṣa lt71690 71736 71720 gt

sṭa lt71721 71736 71700 gt

Examples of conjuncts rendered using half-forms are

śra 71716 lt71719 71736 71716 gt

Conjuncts displayed using visible halanta are

pra 717107173671716 lt71710 71736 71716 gt

rṇa 717167173671704 lt71716 71736 71704 gt

8

Proposal to encode the Dogra script in Unicode Anshuman Pandey

3123 Modifying conjunct formation

As is the case in Devanagari and other Indic scripts a ligature may be broken using the generic controlcharacter +200C -

kṣa lt71690 71736 71720 gt

kṣa 716907173671720 lt71690 71736 - 71720 gt

The rendering of a conjunct as a ligature or with visible is a matter of font design

313 Punctuation

Abbreviation sign The 71738 is used for indicating truncation of text In the excerptbelow the highlighted text717177172671738 7171871738 717137172471738 (lī va bhā 23) is is an abbreviation for līla vatī bhāga 23

Danda-s The daṇḍā and double daṇḍā are commonly used in Dogra documents for marking the ends ofsentences and paragraphs The below excerpt shows usage of the daṇḍā

and the following shows use of the double daṇḍā

These daṇḍā-s are not proposed for encoding as separate characters in the lsquoDograrsquo block They resemble +0964 and +0965 and may be represented using

these characters The Devanagari daṇḍā-s are to be specified as extensions for Dogra

Other signs Other marks of punctuation such as dashes are also used

9

Proposal to encode the Dogra script in Unicode Anshuman Pandey

A hyphen is used in some cases for indicating breaks in a word at end of line (see section 316)

314 Digits

Digits are commonly used in Dogra documents (see figure 11) The style of digits varies across written andprinted sources Some digits in Old Dogra sources resemble those of Takri (see figure 4) while those innew Dogra sources are similar to Devanagar The excerpt below shows digits 1ndash9 and the number 10 asrepresented in the Līlavatī written in New Dogra

The majority of the digits shown here resemble Devanagari forms however the exception is lsquofourrsquo Theform of Dogra lsquofourrsquo does not resemble 71364 +116C4 which is nearly identical to theDevanagari digit four Information about its origins are not known at this time but it may be related to 70100+111D4 The forms of lsquoeightrsquo and lsquoninersquo are well-known variant forms of the regular

Devanagari digits

Additional research into Dogra digits is required before they may be considered for encoding The availablesources suggest that there may have been a cultural understanding that Dogra possessed unique digits Forinstance on the bank notes shown in figure 20 the digits written in Devanagari are rather similar but arenonetheless displayed twice in order to align the display of the currency value with each script used for theinscriptions

At present script-specific digits are not proposed for inclusion in the Dogra repertoire Devanagari digitsshould be used normatively and should be extended for usage with Dogra If future research yields distinctivedigits for Dogra there is sufficient space in the block to acccommodate their encoding

315 Number forms

Fraction signs and currency marks are commonly used in Dogra sources (see figure 12) These may berepresented using the fraction and currency signs already encoded in the lsquoCommon Indic Number Formsrsquoblock (U+A830) These characters are specified as extensions for Dogra The lsquoCommon Indic NumberFormsrsquo are described briefly in The Unicode Standard (chapter 223 lsquoNumeralsrsquo pp 770ndash771) and in moredetail in the original encoding proposal (L207-354)

316 Linebreaking

Linebreaking occurs after an orthographic syllable Breaks are generally not indicated in informal sourcesbut are marked in literary materials using hyphens

10

Proposal to encode the Dogra script in Unicode Anshuman Pandey

317 Collation

The primary collating order for Dogra is as follows

71680 lt 71681 lt 71682 lt 71683 lt 71684 lt 71685 lt 71686 lt 71687 lt 71688 lt 71689 lt

71690 lt 71691 lt 71692 lt 71693 lt 71694 lt 71695 lt 71696 lt 71697 lt

71698 lt 71699 lt 71700 lt 71701 lt 71702 lt 71703 lt 71704 lt 71705 lt

71706 lt 71707 lt 71708 lt 71709 lt 71710 lt 71711 lt 71712 lt 71713 lt

71714 lt 71715 lt 71716 lt 71717 lt 71718 lt 71719 lt 71720 lt 71721 lt 71722 lt

71723 lt 71724 lt 71725 lt 71726 lt

71727 lt 71728 lt 71729 lt 71730 lt

71731 lt 71732 lt 71733

The following characters have secondary weights 71734 71735 71736 71737

4 Character data

41 Character properties

The properties for Dogra in the Unicode Character Database format are

11800DOGRA LETTER ALo0LN11801DOGRA LETTER AALo0LN11802DOGRA LETTER ILo0LN11804DOGRA LETTER ULo0LN11806DOGRA LETTER ELo0LN11807DOGRA LETTER AILo0LN11808DOGRA LETTER OLo0LN1180ADOGRA LETTER KALo0LN1180BDOGRA LETTER KHALo0LN1180CDOGRA LETTER GALo0LN1180DDOGRA LETTER GHALo0LN1180EDOGRA LETTER NGALo0LN1180FDOGRA LETTER CALo0LN11810DOGRA LETTER CHALo0LN11811DOGRA LETTER JALo0LN11812DOGRA LETTER JHALo0LN11813DOGRA LETTER NYALo0LN

11

Proposal to encode the Dogra script in Unicode Anshuman Pandey

11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN

42 Linebreaking

Linebreaking properties given in the data format of LineBreaktxt

118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN

43 Syllabic categories

Syllabic categories given in the format of IndicSyllabicCategorytxt

Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA

12

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA

Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA

Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA

Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU

Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU

Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA

44 Positional categories

Positional data for characters in the format of IndicPositionalCategorytxt

Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA

Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I

Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU

Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA

45 Script extensions

The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt

0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA

0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE

A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS

A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK

13

Proposal to encode the Dogra script in Unicode Anshuman Pandey

46 lsquoConfusablersquo characters

Some Dogra characters that resemble characters encoded in other script blocks are

DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN

5 References

Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press

Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India

Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press

Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf

mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf

mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf

Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club

von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml

14

Proposal to encode the Dogra script in Unicode Anshuman Pandey

6 Acknowledgments

I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal

This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities

15

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 1

1184FDogra11800

1180 1181 1182 1183 1184

a

71681

i

71683

u

71685

e

71687

o

71689

71690

71691

71692

71693

71694

71695

71696

71697

71698

71699

71700

71701

71702

71703

71704

71705

71706

71707

71708

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

71721

71722

71723

$71724

$ i

$71726

$ u

$ 71728

$ r

$ e

$ 71731

$ o

$ 71733

$ 71734

$71735

$ 71736

$ 71737

71738

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 2

1183ADogra11800

Punctuation1183A 71738 DOGRA ABBREVIATION SIGN

Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU

Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA

Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU

Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

18

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

19

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)

20

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)

21

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)

22

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)

23

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

lettersrdquo where lsquoDvigartarsquo is presented as the Sanskritic root of the name lsquoDograrsquo Another indigenous namefor the script is name dogra akkhar or ldquoNew Dogra lettersrdquo

34 Representative glyphs

The representative glyphs for Dogra characters are based primarily upon forms used in New Dogra Charac-ters from Old Dogra have been used when necessary sourced primarily from figures 1 and 2 The font hasbeen designed by the proposal author

35 Vowel letters

There are 10 vowel letters

71680

71681

71682

71683

71684

71685

71686

71687

71688

71689

In Old Dogra the values of the vowel letters 71683 71682 and are variable As shown in the accompanyingsources the letter 71683 was used for both i and ī 71682 for u and ū and for u ū o and au In New Dograthe values of these three letters became fixed and new letters were introduced in order to provide distinctiverepresentations for all basic vowels It is apparent that the attempt to stabilize the vowel repertoire resultedin overlap and reassignment of several letters The Old Dogra 71683 (i ī) was retained solely for ī and 71682 (u ū)was reassigned for i A modified form of (u o) came to represent u while the form71685 was introduced forū which is certainly an adaptation of the letter ū from Devanagari or Sharada Distinct letters for71688 o and71689au were provided by placing the corresponding dependent vowel signs upon the modified u

Old New

71683 71682

mdash 71683

71682 71684

mdash 71685

71688

71689

The proposed vowels letters are based upon the repertoire of New Dogra This approach completely accom-modates vowels in the old and new forms of the script Although the names of New Dogra vowel letters do

3

Proposal to encode the Dogra script in Unicode Anshuman Pandey

not strictly correlate with values of Old Dogra vowels the proposed letters can be used for representing theold script The 71682 can be used for u 71683 can be used for i and 71684

can be used for o and au the height and curvature of the left stroke may be considered a stylisticattribute

36 Vowel Signs

There are 10 dependent vowel signs

71724

71725

71726

71727

71728

71729

71730

71731

71732

71733

The 71729 appears to have been introduced as part of New Dogra It is usedfor representing words of Sanskrit origin that contain the vocalic sound r Its shape is derived from either+11190 or +0943 There is

no corresponding independent vowel letter

There is an alphabetic aspect to the representation of consonant-vowel syllables in written Dogra The de-pendent vowel may be represented using the independent vowel letter as well as the vowel sign eg 7169071686may be used in place of 7169071730 for the syllable ke

There are differences in the shape of certain vowel letters between the old and new script The typical Takriforms of the vowel signs -u and -u which also resemble corresponding forms in Gurmukhi were replacedwith forms resembling Devanagari signs 71727 and 71728 respectively The typical Takri style of the vowel sign o was applied for writing 71733 au while a simplified of o in the form 71732 was used for o The duplication of o as for writing au was eliminated A comparison of vowel letters in the old and new scripts as they alignwith the proposed characters is as follows

Old New

71725

71726

71727

71728

71732

71733

4

Proposal to encode the Dogra script in Unicode Anshuman Pandey

37 Consonants

There are 34 consonant letters

71690

71691

71692

71693

71694

71695

71696

71697

71698

71699

71700

71701

71702

71703

71704

71705

71706

71707

71708

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

71721

71722

71723

The 71720 was introduced into New Dogra for representing the retroflex sibilant ṣa whichoccurs in words of Sanskrit origin It resembles 70064 +111B0

There are glyphic variants of several consonant letters

Regular Variant

71692

71693

71696

71698

71706

5

Proposal to encode the Dogra script in Unicode Anshuman Pandey

71708

71716

There is evidence of usage of different forms of the same letter in the available sources For instance both71696 and are both used for

38 Anusvara

The 71734 is used for marking nasalization

39 Visarga

The 71735 is used for indicating post-vocalic aspiration in words of Sanskrit origin It isshown below in the word 717097171471735 namaḥ ldquohomage tordquo

310 Nukta

The 71737 is used for representing sounds that are not native to Dogri and related languages

311 Virama

The 71736 has two functions similar to the corresponding character of Devanagari It isused as a halanta for marking the absence of the inherent vowel of a consonant letter It is also a controlcharacter that is used for producing conjuncts

312 Consonant Conjuncts

Consonant clusters are rendered in three ways 1) with visible halanta beneath the full form of each bareconsonant 2) with a half-form of the initial consonant followed by the full form of the following letter and3) as ligatures There are no formal conventions for the written expression of conjuncts The 71736

is used for the encoded representation of conjuncts

3121 Types of conjuncts

Some representations of Dogra conjuncts as found in the available sources are illustrated below

bull Visible halanta In both Old and New Dogra the most common method of representing consonantclusters is by placing a halanta beneath each bare consonant

6

Proposal to encode the Dogra script in Unicode Anshuman Pandey

bull Half-forms Another method is to use half-forms of letters if the graphical structure provides such anopportunity

The above shows the conjunct 71716 śra in the word 7171671726 śrī (a Sanskrit honorific) The half form isproduced by dropping the right vertical bar of 71719 The conjunct 71716 is used consistently in NewDogra for representing the cluster śra the form 717197173671716 with visible halanta beneath a full form of 71719does not occur in the available sources

bull Ligatures There are two types of ligature In the first the shapes of individual letters are visible Inthe below the boxed text contains the conjunct sṭa from the word717347172471710 sṭāṃp a transcription of theEnglish ldquostamprdquo

The second type is an atomic (akhaṇḍa) ligature whose form may be considered a distinctive letterSuch a ligature occurs in the excerpt below

The boxed text above contains the atomic ligature kṣa which occurs in the word7168071716 akṣaroṃ ldquoinlettersrdquo (oblique form of7168071716 akṣara ldquoletterrdquo) The conjunct kṣa is used consistently in New Dografor representing the cluster ksa Its shape may be influenced by Devanangari kṣa

bull Forms of Special conjunct forms of 71716 do not generally occur eg repha or vattu Cluster-initialis represented as a full form with halanta as in rṇa in the word saṃpūrṇa below

and non-initial is treated similarly as in pra in the word prabhu below

7

Proposal to encode the Dogra script in Unicode Anshuman Pandey

However there is one instance in the available sources in which cluster-initial is represented asa repha The occurrence is curious because the word caturbhuja ldquosquarerdquo in which repha occursappears three times in proximity but is spelled differently in all three instances

In the boxed text at the top the in caturbhuja is written with halanta 7169571705717277171671736717137172771697 caturbhuja In thetext at bottom left occurs with its inherent vowel intact without halanta 71695717057172771716717137172771697 caturabhuja Atbottom right ra is represented as repha 716957170571727717137172771697 caturbhuja

The usage of repha in Dogra is to be considered anomalous There is no present need to support thisform of in encoded representation

3122 Encoded representation of conjuncts

The Devanagari model for the representation of conjuncts is appropriate for Dogra In encoded text con-juncts are produced by placing the control character 71736 after each non-initial consonantin a cluster ie lt(C 71736 ) Cgt

Representations of conjuncts rendered as atomic ligatures are

kṣa lt71690 71736 71720 gt

sṭa lt71721 71736 71700 gt

Examples of conjuncts rendered using half-forms are

śra 71716 lt71719 71736 71716 gt

Conjuncts displayed using visible halanta are

pra 717107173671716 lt71710 71736 71716 gt

rṇa 717167173671704 lt71716 71736 71704 gt

8

Proposal to encode the Dogra script in Unicode Anshuman Pandey

3123 Modifying conjunct formation

As is the case in Devanagari and other Indic scripts a ligature may be broken using the generic controlcharacter +200C -

kṣa lt71690 71736 71720 gt

kṣa 716907173671720 lt71690 71736 - 71720 gt

The rendering of a conjunct as a ligature or with visible is a matter of font design

313 Punctuation

Abbreviation sign The 71738 is used for indicating truncation of text In the excerptbelow the highlighted text717177172671738 7171871738 717137172471738 (lī va bhā 23) is is an abbreviation for līla vatī bhāga 23

Danda-s The daṇḍā and double daṇḍā are commonly used in Dogra documents for marking the ends ofsentences and paragraphs The below excerpt shows usage of the daṇḍā

and the following shows use of the double daṇḍā

These daṇḍā-s are not proposed for encoding as separate characters in the lsquoDograrsquo block They resemble +0964 and +0965 and may be represented using

these characters The Devanagari daṇḍā-s are to be specified as extensions for Dogra

Other signs Other marks of punctuation such as dashes are also used

9

Proposal to encode the Dogra script in Unicode Anshuman Pandey

A hyphen is used in some cases for indicating breaks in a word at end of line (see section 316)

314 Digits

Digits are commonly used in Dogra documents (see figure 11) The style of digits varies across written andprinted sources Some digits in Old Dogra sources resemble those of Takri (see figure 4) while those innew Dogra sources are similar to Devanagar The excerpt below shows digits 1ndash9 and the number 10 asrepresented in the Līlavatī written in New Dogra

The majority of the digits shown here resemble Devanagari forms however the exception is lsquofourrsquo Theform of Dogra lsquofourrsquo does not resemble 71364 +116C4 which is nearly identical to theDevanagari digit four Information about its origins are not known at this time but it may be related to 70100+111D4 The forms of lsquoeightrsquo and lsquoninersquo are well-known variant forms of the regular

Devanagari digits

Additional research into Dogra digits is required before they may be considered for encoding The availablesources suggest that there may have been a cultural understanding that Dogra possessed unique digits Forinstance on the bank notes shown in figure 20 the digits written in Devanagari are rather similar but arenonetheless displayed twice in order to align the display of the currency value with each script used for theinscriptions

At present script-specific digits are not proposed for inclusion in the Dogra repertoire Devanagari digitsshould be used normatively and should be extended for usage with Dogra If future research yields distinctivedigits for Dogra there is sufficient space in the block to acccommodate their encoding

315 Number forms

Fraction signs and currency marks are commonly used in Dogra sources (see figure 12) These may berepresented using the fraction and currency signs already encoded in the lsquoCommon Indic Number Formsrsquoblock (U+A830) These characters are specified as extensions for Dogra The lsquoCommon Indic NumberFormsrsquo are described briefly in The Unicode Standard (chapter 223 lsquoNumeralsrsquo pp 770ndash771) and in moredetail in the original encoding proposal (L207-354)

316 Linebreaking

Linebreaking occurs after an orthographic syllable Breaks are generally not indicated in informal sourcesbut are marked in literary materials using hyphens

10

Proposal to encode the Dogra script in Unicode Anshuman Pandey

317 Collation

The primary collating order for Dogra is as follows

71680 lt 71681 lt 71682 lt 71683 lt 71684 lt 71685 lt 71686 lt 71687 lt 71688 lt 71689 lt

71690 lt 71691 lt 71692 lt 71693 lt 71694 lt 71695 lt 71696 lt 71697 lt

71698 lt 71699 lt 71700 lt 71701 lt 71702 lt 71703 lt 71704 lt 71705 lt

71706 lt 71707 lt 71708 lt 71709 lt 71710 lt 71711 lt 71712 lt 71713 lt

71714 lt 71715 lt 71716 lt 71717 lt 71718 lt 71719 lt 71720 lt 71721 lt 71722 lt

71723 lt 71724 lt 71725 lt 71726 lt

71727 lt 71728 lt 71729 lt 71730 lt

71731 lt 71732 lt 71733

The following characters have secondary weights 71734 71735 71736 71737

4 Character data

41 Character properties

The properties for Dogra in the Unicode Character Database format are

11800DOGRA LETTER ALo0LN11801DOGRA LETTER AALo0LN11802DOGRA LETTER ILo0LN11804DOGRA LETTER ULo0LN11806DOGRA LETTER ELo0LN11807DOGRA LETTER AILo0LN11808DOGRA LETTER OLo0LN1180ADOGRA LETTER KALo0LN1180BDOGRA LETTER KHALo0LN1180CDOGRA LETTER GALo0LN1180DDOGRA LETTER GHALo0LN1180EDOGRA LETTER NGALo0LN1180FDOGRA LETTER CALo0LN11810DOGRA LETTER CHALo0LN11811DOGRA LETTER JALo0LN11812DOGRA LETTER JHALo0LN11813DOGRA LETTER NYALo0LN

11

Proposal to encode the Dogra script in Unicode Anshuman Pandey

11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN

42 Linebreaking

Linebreaking properties given in the data format of LineBreaktxt

118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN

43 Syllabic categories

Syllabic categories given in the format of IndicSyllabicCategorytxt

Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA

12

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA

Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA

Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA

Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU

Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU

Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA

44 Positional categories

Positional data for characters in the format of IndicPositionalCategorytxt

Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA

Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I

Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU

Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA

45 Script extensions

The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt

0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA

0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE

A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS

A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK

13

Proposal to encode the Dogra script in Unicode Anshuman Pandey

46 lsquoConfusablersquo characters

Some Dogra characters that resemble characters encoded in other script blocks are

DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN

5 References

Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press

Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India

Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press

Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf

mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf

mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf

Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club

von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml

14

Proposal to encode the Dogra script in Unicode Anshuman Pandey

6 Acknowledgments

I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal

This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities

15

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 1

1184FDogra11800

1180 1181 1182 1183 1184

a

71681

i

71683

u

71685

e

71687

o

71689

71690

71691

71692

71693

71694

71695

71696

71697

71698

71699

71700

71701

71702

71703

71704

71705

71706

71707

71708

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

71721

71722

71723

$71724

$ i

$71726

$ u

$ 71728

$ r

$ e

$ 71731

$ o

$ 71733

$ 71734

$71735

$ 71736

$ 71737

71738

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 2

1183ADogra11800

Punctuation1183A 71738 DOGRA ABBREVIATION SIGN

Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU

Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA

Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU

Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

18

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

19

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)

20

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)

21

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)

22

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)

23

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

not strictly correlate with values of Old Dogra vowels the proposed letters can be used for representing theold script The 71682 can be used for u 71683 can be used for i and 71684

can be used for o and au the height and curvature of the left stroke may be considered a stylisticattribute

36 Vowel Signs

There are 10 dependent vowel signs

71724

71725

71726

71727

71728

71729

71730

71731

71732

71733

The 71729 appears to have been introduced as part of New Dogra It is usedfor representing words of Sanskrit origin that contain the vocalic sound r Its shape is derived from either+11190 or +0943 There is

no corresponding independent vowel letter

There is an alphabetic aspect to the representation of consonant-vowel syllables in written Dogra The de-pendent vowel may be represented using the independent vowel letter as well as the vowel sign eg 7169071686may be used in place of 7169071730 for the syllable ke

There are differences in the shape of certain vowel letters between the old and new script The typical Takriforms of the vowel signs -u and -u which also resemble corresponding forms in Gurmukhi were replacedwith forms resembling Devanagari signs 71727 and 71728 respectively The typical Takri style of the vowel sign o was applied for writing 71733 au while a simplified of o in the form 71732 was used for o The duplication of o as for writing au was eliminated A comparison of vowel letters in the old and new scripts as they alignwith the proposed characters is as follows

Old New

71725

71726

71727

71728

71732

71733

4

Proposal to encode the Dogra script in Unicode Anshuman Pandey

37 Consonants

There are 34 consonant letters

71690

71691

71692

71693

71694

71695

71696

71697

71698

71699

71700

71701

71702

71703

71704

71705

71706

71707

71708

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

71721

71722

71723

The 71720 was introduced into New Dogra for representing the retroflex sibilant ṣa whichoccurs in words of Sanskrit origin It resembles 70064 +111B0

There are glyphic variants of several consonant letters

Regular Variant

71692

71693

71696

71698

71706

5

Proposal to encode the Dogra script in Unicode Anshuman Pandey

71708

71716

There is evidence of usage of different forms of the same letter in the available sources For instance both71696 and are both used for

38 Anusvara

The 71734 is used for marking nasalization

39 Visarga

The 71735 is used for indicating post-vocalic aspiration in words of Sanskrit origin It isshown below in the word 717097171471735 namaḥ ldquohomage tordquo

310 Nukta

The 71737 is used for representing sounds that are not native to Dogri and related languages

311 Virama

The 71736 has two functions similar to the corresponding character of Devanagari It isused as a halanta for marking the absence of the inherent vowel of a consonant letter It is also a controlcharacter that is used for producing conjuncts

312 Consonant Conjuncts

Consonant clusters are rendered in three ways 1) with visible halanta beneath the full form of each bareconsonant 2) with a half-form of the initial consonant followed by the full form of the following letter and3) as ligatures There are no formal conventions for the written expression of conjuncts The 71736

is used for the encoded representation of conjuncts

3121 Types of conjuncts

Some representations of Dogra conjuncts as found in the available sources are illustrated below

bull Visible halanta In both Old and New Dogra the most common method of representing consonantclusters is by placing a halanta beneath each bare consonant

6

Proposal to encode the Dogra script in Unicode Anshuman Pandey

bull Half-forms Another method is to use half-forms of letters if the graphical structure provides such anopportunity

The above shows the conjunct 71716 śra in the word 7171671726 śrī (a Sanskrit honorific) The half form isproduced by dropping the right vertical bar of 71719 The conjunct 71716 is used consistently in NewDogra for representing the cluster śra the form 717197173671716 with visible halanta beneath a full form of 71719does not occur in the available sources

bull Ligatures There are two types of ligature In the first the shapes of individual letters are visible Inthe below the boxed text contains the conjunct sṭa from the word717347172471710 sṭāṃp a transcription of theEnglish ldquostamprdquo

The second type is an atomic (akhaṇḍa) ligature whose form may be considered a distinctive letterSuch a ligature occurs in the excerpt below

The boxed text above contains the atomic ligature kṣa which occurs in the word7168071716 akṣaroṃ ldquoinlettersrdquo (oblique form of7168071716 akṣara ldquoletterrdquo) The conjunct kṣa is used consistently in New Dografor representing the cluster ksa Its shape may be influenced by Devanangari kṣa

bull Forms of Special conjunct forms of 71716 do not generally occur eg repha or vattu Cluster-initialis represented as a full form with halanta as in rṇa in the word saṃpūrṇa below

and non-initial is treated similarly as in pra in the word prabhu below

7

Proposal to encode the Dogra script in Unicode Anshuman Pandey

However there is one instance in the available sources in which cluster-initial is represented asa repha The occurrence is curious because the word caturbhuja ldquosquarerdquo in which repha occursappears three times in proximity but is spelled differently in all three instances

In the boxed text at the top the in caturbhuja is written with halanta 7169571705717277171671736717137172771697 caturbhuja In thetext at bottom left occurs with its inherent vowel intact without halanta 71695717057172771716717137172771697 caturabhuja Atbottom right ra is represented as repha 716957170571727717137172771697 caturbhuja

The usage of repha in Dogra is to be considered anomalous There is no present need to support thisform of in encoded representation

3122 Encoded representation of conjuncts

The Devanagari model for the representation of conjuncts is appropriate for Dogra In encoded text con-juncts are produced by placing the control character 71736 after each non-initial consonantin a cluster ie lt(C 71736 ) Cgt

Representations of conjuncts rendered as atomic ligatures are

kṣa lt71690 71736 71720 gt

sṭa lt71721 71736 71700 gt

Examples of conjuncts rendered using half-forms are

śra 71716 lt71719 71736 71716 gt

Conjuncts displayed using visible halanta are

pra 717107173671716 lt71710 71736 71716 gt

rṇa 717167173671704 lt71716 71736 71704 gt

8

Proposal to encode the Dogra script in Unicode Anshuman Pandey

3123 Modifying conjunct formation

As is the case in Devanagari and other Indic scripts a ligature may be broken using the generic controlcharacter +200C -

kṣa lt71690 71736 71720 gt

kṣa 716907173671720 lt71690 71736 - 71720 gt

The rendering of a conjunct as a ligature or with visible is a matter of font design

313 Punctuation

Abbreviation sign The 71738 is used for indicating truncation of text In the excerptbelow the highlighted text717177172671738 7171871738 717137172471738 (lī va bhā 23) is is an abbreviation for līla vatī bhāga 23

Danda-s The daṇḍā and double daṇḍā are commonly used in Dogra documents for marking the ends ofsentences and paragraphs The below excerpt shows usage of the daṇḍā

and the following shows use of the double daṇḍā

These daṇḍā-s are not proposed for encoding as separate characters in the lsquoDograrsquo block They resemble +0964 and +0965 and may be represented using

these characters The Devanagari daṇḍā-s are to be specified as extensions for Dogra

Other signs Other marks of punctuation such as dashes are also used

9

Proposal to encode the Dogra script in Unicode Anshuman Pandey

A hyphen is used in some cases for indicating breaks in a word at end of line (see section 316)

314 Digits

Digits are commonly used in Dogra documents (see figure 11) The style of digits varies across written andprinted sources Some digits in Old Dogra sources resemble those of Takri (see figure 4) while those innew Dogra sources are similar to Devanagar The excerpt below shows digits 1ndash9 and the number 10 asrepresented in the Līlavatī written in New Dogra

The majority of the digits shown here resemble Devanagari forms however the exception is lsquofourrsquo Theform of Dogra lsquofourrsquo does not resemble 71364 +116C4 which is nearly identical to theDevanagari digit four Information about its origins are not known at this time but it may be related to 70100+111D4 The forms of lsquoeightrsquo and lsquoninersquo are well-known variant forms of the regular

Devanagari digits

Additional research into Dogra digits is required before they may be considered for encoding The availablesources suggest that there may have been a cultural understanding that Dogra possessed unique digits Forinstance on the bank notes shown in figure 20 the digits written in Devanagari are rather similar but arenonetheless displayed twice in order to align the display of the currency value with each script used for theinscriptions

At present script-specific digits are not proposed for inclusion in the Dogra repertoire Devanagari digitsshould be used normatively and should be extended for usage with Dogra If future research yields distinctivedigits for Dogra there is sufficient space in the block to acccommodate their encoding

315 Number forms

Fraction signs and currency marks are commonly used in Dogra sources (see figure 12) These may berepresented using the fraction and currency signs already encoded in the lsquoCommon Indic Number Formsrsquoblock (U+A830) These characters are specified as extensions for Dogra The lsquoCommon Indic NumberFormsrsquo are described briefly in The Unicode Standard (chapter 223 lsquoNumeralsrsquo pp 770ndash771) and in moredetail in the original encoding proposal (L207-354)

316 Linebreaking

Linebreaking occurs after an orthographic syllable Breaks are generally not indicated in informal sourcesbut are marked in literary materials using hyphens

10

Proposal to encode the Dogra script in Unicode Anshuman Pandey

317 Collation

The primary collating order for Dogra is as follows

71680 lt 71681 lt 71682 lt 71683 lt 71684 lt 71685 lt 71686 lt 71687 lt 71688 lt 71689 lt

71690 lt 71691 lt 71692 lt 71693 lt 71694 lt 71695 lt 71696 lt 71697 lt

71698 lt 71699 lt 71700 lt 71701 lt 71702 lt 71703 lt 71704 lt 71705 lt

71706 lt 71707 lt 71708 lt 71709 lt 71710 lt 71711 lt 71712 lt 71713 lt

71714 lt 71715 lt 71716 lt 71717 lt 71718 lt 71719 lt 71720 lt 71721 lt 71722 lt

71723 lt 71724 lt 71725 lt 71726 lt

71727 lt 71728 lt 71729 lt 71730 lt

71731 lt 71732 lt 71733

The following characters have secondary weights 71734 71735 71736 71737

4 Character data

41 Character properties

The properties for Dogra in the Unicode Character Database format are

11800DOGRA LETTER ALo0LN11801DOGRA LETTER AALo0LN11802DOGRA LETTER ILo0LN11804DOGRA LETTER ULo0LN11806DOGRA LETTER ELo0LN11807DOGRA LETTER AILo0LN11808DOGRA LETTER OLo0LN1180ADOGRA LETTER KALo0LN1180BDOGRA LETTER KHALo0LN1180CDOGRA LETTER GALo0LN1180DDOGRA LETTER GHALo0LN1180EDOGRA LETTER NGALo0LN1180FDOGRA LETTER CALo0LN11810DOGRA LETTER CHALo0LN11811DOGRA LETTER JALo0LN11812DOGRA LETTER JHALo0LN11813DOGRA LETTER NYALo0LN

11

Proposal to encode the Dogra script in Unicode Anshuman Pandey

11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN

42 Linebreaking

Linebreaking properties given in the data format of LineBreaktxt

118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN

43 Syllabic categories

Syllabic categories given in the format of IndicSyllabicCategorytxt

Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA

12

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA

Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA

Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA

Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU

Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU

Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA

44 Positional categories

Positional data for characters in the format of IndicPositionalCategorytxt

Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA

Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I

Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU

Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA

45 Script extensions

The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt

0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA

0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE

A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS

A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK

13

Proposal to encode the Dogra script in Unicode Anshuman Pandey

46 lsquoConfusablersquo characters

Some Dogra characters that resemble characters encoded in other script blocks are

DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN

5 References

Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press

Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India

Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press

Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf

mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf

mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf

Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club

von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml

14

Proposal to encode the Dogra script in Unicode Anshuman Pandey

6 Acknowledgments

I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal

This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities

15

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 1

1184FDogra11800

1180 1181 1182 1183 1184

a

71681

i

71683

u

71685

e

71687

o

71689

71690

71691

71692

71693

71694

71695

71696

71697

71698

71699

71700

71701

71702

71703

71704

71705

71706

71707

71708

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

71721

71722

71723

$71724

$ i

$71726

$ u

$ 71728

$ r

$ e

$ 71731

$ o

$ 71733

$ 71734

$71735

$ 71736

$ 71737

71738

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 2

1183ADogra11800

Punctuation1183A 71738 DOGRA ABBREVIATION SIGN

Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU

Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA

Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU

Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

18

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

19

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)

20

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)

21

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)

22

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)

23

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

37 Consonants

There are 34 consonant letters

71690

71691

71692

71693

71694

71695

71696

71697

71698

71699

71700

71701

71702

71703

71704

71705

71706

71707

71708

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

71721

71722

71723

The 71720 was introduced into New Dogra for representing the retroflex sibilant ṣa whichoccurs in words of Sanskrit origin It resembles 70064 +111B0

There are glyphic variants of several consonant letters

Regular Variant

71692

71693

71696

71698

71706

5

Proposal to encode the Dogra script in Unicode Anshuman Pandey

71708

71716

There is evidence of usage of different forms of the same letter in the available sources For instance both71696 and are both used for

38 Anusvara

The 71734 is used for marking nasalization

39 Visarga

The 71735 is used for indicating post-vocalic aspiration in words of Sanskrit origin It isshown below in the word 717097171471735 namaḥ ldquohomage tordquo

310 Nukta

The 71737 is used for representing sounds that are not native to Dogri and related languages

311 Virama

The 71736 has two functions similar to the corresponding character of Devanagari It isused as a halanta for marking the absence of the inherent vowel of a consonant letter It is also a controlcharacter that is used for producing conjuncts

312 Consonant Conjuncts

Consonant clusters are rendered in three ways 1) with visible halanta beneath the full form of each bareconsonant 2) with a half-form of the initial consonant followed by the full form of the following letter and3) as ligatures There are no formal conventions for the written expression of conjuncts The 71736

is used for the encoded representation of conjuncts

3121 Types of conjuncts

Some representations of Dogra conjuncts as found in the available sources are illustrated below

bull Visible halanta In both Old and New Dogra the most common method of representing consonantclusters is by placing a halanta beneath each bare consonant

6

Proposal to encode the Dogra script in Unicode Anshuman Pandey

bull Half-forms Another method is to use half-forms of letters if the graphical structure provides such anopportunity

The above shows the conjunct 71716 śra in the word 7171671726 śrī (a Sanskrit honorific) The half form isproduced by dropping the right vertical bar of 71719 The conjunct 71716 is used consistently in NewDogra for representing the cluster śra the form 717197173671716 with visible halanta beneath a full form of 71719does not occur in the available sources

bull Ligatures There are two types of ligature In the first the shapes of individual letters are visible Inthe below the boxed text contains the conjunct sṭa from the word717347172471710 sṭāṃp a transcription of theEnglish ldquostamprdquo

The second type is an atomic (akhaṇḍa) ligature whose form may be considered a distinctive letterSuch a ligature occurs in the excerpt below

The boxed text above contains the atomic ligature kṣa which occurs in the word7168071716 akṣaroṃ ldquoinlettersrdquo (oblique form of7168071716 akṣara ldquoletterrdquo) The conjunct kṣa is used consistently in New Dografor representing the cluster ksa Its shape may be influenced by Devanangari kṣa

bull Forms of Special conjunct forms of 71716 do not generally occur eg repha or vattu Cluster-initialis represented as a full form with halanta as in rṇa in the word saṃpūrṇa below

and non-initial is treated similarly as in pra in the word prabhu below

7

Proposal to encode the Dogra script in Unicode Anshuman Pandey

However there is one instance in the available sources in which cluster-initial is represented asa repha The occurrence is curious because the word caturbhuja ldquosquarerdquo in which repha occursappears three times in proximity but is spelled differently in all three instances

In the boxed text at the top the in caturbhuja is written with halanta 7169571705717277171671736717137172771697 caturbhuja In thetext at bottom left occurs with its inherent vowel intact without halanta 71695717057172771716717137172771697 caturabhuja Atbottom right ra is represented as repha 716957170571727717137172771697 caturbhuja

The usage of repha in Dogra is to be considered anomalous There is no present need to support thisform of in encoded representation

3122 Encoded representation of conjuncts

The Devanagari model for the representation of conjuncts is appropriate for Dogra In encoded text con-juncts are produced by placing the control character 71736 after each non-initial consonantin a cluster ie lt(C 71736 ) Cgt

Representations of conjuncts rendered as atomic ligatures are

kṣa lt71690 71736 71720 gt

sṭa lt71721 71736 71700 gt

Examples of conjuncts rendered using half-forms are

śra 71716 lt71719 71736 71716 gt

Conjuncts displayed using visible halanta are

pra 717107173671716 lt71710 71736 71716 gt

rṇa 717167173671704 lt71716 71736 71704 gt

8

Proposal to encode the Dogra script in Unicode Anshuman Pandey

3123 Modifying conjunct formation

As is the case in Devanagari and other Indic scripts a ligature may be broken using the generic controlcharacter +200C -

kṣa lt71690 71736 71720 gt

kṣa 716907173671720 lt71690 71736 - 71720 gt

The rendering of a conjunct as a ligature or with visible is a matter of font design

313 Punctuation

Abbreviation sign The 71738 is used for indicating truncation of text In the excerptbelow the highlighted text717177172671738 7171871738 717137172471738 (lī va bhā 23) is is an abbreviation for līla vatī bhāga 23

Danda-s The daṇḍā and double daṇḍā are commonly used in Dogra documents for marking the ends ofsentences and paragraphs The below excerpt shows usage of the daṇḍā

and the following shows use of the double daṇḍā

These daṇḍā-s are not proposed for encoding as separate characters in the lsquoDograrsquo block They resemble +0964 and +0965 and may be represented using

these characters The Devanagari daṇḍā-s are to be specified as extensions for Dogra

Other signs Other marks of punctuation such as dashes are also used

9

Proposal to encode the Dogra script in Unicode Anshuman Pandey

A hyphen is used in some cases for indicating breaks in a word at end of line (see section 316)

314 Digits

Digits are commonly used in Dogra documents (see figure 11) The style of digits varies across written andprinted sources Some digits in Old Dogra sources resemble those of Takri (see figure 4) while those innew Dogra sources are similar to Devanagar The excerpt below shows digits 1ndash9 and the number 10 asrepresented in the Līlavatī written in New Dogra

The majority of the digits shown here resemble Devanagari forms however the exception is lsquofourrsquo Theform of Dogra lsquofourrsquo does not resemble 71364 +116C4 which is nearly identical to theDevanagari digit four Information about its origins are not known at this time but it may be related to 70100+111D4 The forms of lsquoeightrsquo and lsquoninersquo are well-known variant forms of the regular

Devanagari digits

Additional research into Dogra digits is required before they may be considered for encoding The availablesources suggest that there may have been a cultural understanding that Dogra possessed unique digits Forinstance on the bank notes shown in figure 20 the digits written in Devanagari are rather similar but arenonetheless displayed twice in order to align the display of the currency value with each script used for theinscriptions

At present script-specific digits are not proposed for inclusion in the Dogra repertoire Devanagari digitsshould be used normatively and should be extended for usage with Dogra If future research yields distinctivedigits for Dogra there is sufficient space in the block to acccommodate their encoding

315 Number forms

Fraction signs and currency marks are commonly used in Dogra sources (see figure 12) These may berepresented using the fraction and currency signs already encoded in the lsquoCommon Indic Number Formsrsquoblock (U+A830) These characters are specified as extensions for Dogra The lsquoCommon Indic NumberFormsrsquo are described briefly in The Unicode Standard (chapter 223 lsquoNumeralsrsquo pp 770ndash771) and in moredetail in the original encoding proposal (L207-354)

316 Linebreaking

Linebreaking occurs after an orthographic syllable Breaks are generally not indicated in informal sourcesbut are marked in literary materials using hyphens

10

Proposal to encode the Dogra script in Unicode Anshuman Pandey

317 Collation

The primary collating order for Dogra is as follows

71680 lt 71681 lt 71682 lt 71683 lt 71684 lt 71685 lt 71686 lt 71687 lt 71688 lt 71689 lt

71690 lt 71691 lt 71692 lt 71693 lt 71694 lt 71695 lt 71696 lt 71697 lt

71698 lt 71699 lt 71700 lt 71701 lt 71702 lt 71703 lt 71704 lt 71705 lt

71706 lt 71707 lt 71708 lt 71709 lt 71710 lt 71711 lt 71712 lt 71713 lt

71714 lt 71715 lt 71716 lt 71717 lt 71718 lt 71719 lt 71720 lt 71721 lt 71722 lt

71723 lt 71724 lt 71725 lt 71726 lt

71727 lt 71728 lt 71729 lt 71730 lt

71731 lt 71732 lt 71733

The following characters have secondary weights 71734 71735 71736 71737

4 Character data

41 Character properties

The properties for Dogra in the Unicode Character Database format are

11800DOGRA LETTER ALo0LN11801DOGRA LETTER AALo0LN11802DOGRA LETTER ILo0LN11804DOGRA LETTER ULo0LN11806DOGRA LETTER ELo0LN11807DOGRA LETTER AILo0LN11808DOGRA LETTER OLo0LN1180ADOGRA LETTER KALo0LN1180BDOGRA LETTER KHALo0LN1180CDOGRA LETTER GALo0LN1180DDOGRA LETTER GHALo0LN1180EDOGRA LETTER NGALo0LN1180FDOGRA LETTER CALo0LN11810DOGRA LETTER CHALo0LN11811DOGRA LETTER JALo0LN11812DOGRA LETTER JHALo0LN11813DOGRA LETTER NYALo0LN

11

Proposal to encode the Dogra script in Unicode Anshuman Pandey

11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN

42 Linebreaking

Linebreaking properties given in the data format of LineBreaktxt

118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN

43 Syllabic categories

Syllabic categories given in the format of IndicSyllabicCategorytxt

Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA

12

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA

Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA

Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA

Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU

Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU

Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA

44 Positional categories

Positional data for characters in the format of IndicPositionalCategorytxt

Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA

Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I

Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU

Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA

45 Script extensions

The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt

0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA

0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE

A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS

A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK

13

Proposal to encode the Dogra script in Unicode Anshuman Pandey

46 lsquoConfusablersquo characters

Some Dogra characters that resemble characters encoded in other script blocks are

DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN

5 References

Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press

Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India

Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press

Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf

mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf

mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf

Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club

von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml

14

Proposal to encode the Dogra script in Unicode Anshuman Pandey

6 Acknowledgments

I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal

This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities

15

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 1

1184FDogra11800

1180 1181 1182 1183 1184

a

71681

i

71683

u

71685

e

71687

o

71689

71690

71691

71692

71693

71694

71695

71696

71697

71698

71699

71700

71701

71702

71703

71704

71705

71706

71707

71708

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

71721

71722

71723

$71724

$ i

$71726

$ u

$ 71728

$ r

$ e

$ 71731

$ o

$ 71733

$ 71734

$71735

$ 71736

$ 71737

71738

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 2

1183ADogra11800

Punctuation1183A 71738 DOGRA ABBREVIATION SIGN

Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU

Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA

Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU

Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

18

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

19

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)

20

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)

21

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)

22

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)

23

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

71708

71716

There is evidence of usage of different forms of the same letter in the available sources For instance both71696 and are both used for

38 Anusvara

The 71734 is used for marking nasalization

39 Visarga

The 71735 is used for indicating post-vocalic aspiration in words of Sanskrit origin It isshown below in the word 717097171471735 namaḥ ldquohomage tordquo

310 Nukta

The 71737 is used for representing sounds that are not native to Dogri and related languages

311 Virama

The 71736 has two functions similar to the corresponding character of Devanagari It isused as a halanta for marking the absence of the inherent vowel of a consonant letter It is also a controlcharacter that is used for producing conjuncts

312 Consonant Conjuncts

Consonant clusters are rendered in three ways 1) with visible halanta beneath the full form of each bareconsonant 2) with a half-form of the initial consonant followed by the full form of the following letter and3) as ligatures There are no formal conventions for the written expression of conjuncts The 71736

is used for the encoded representation of conjuncts

3121 Types of conjuncts

Some representations of Dogra conjuncts as found in the available sources are illustrated below

bull Visible halanta In both Old and New Dogra the most common method of representing consonantclusters is by placing a halanta beneath each bare consonant

6

Proposal to encode the Dogra script in Unicode Anshuman Pandey

bull Half-forms Another method is to use half-forms of letters if the graphical structure provides such anopportunity

The above shows the conjunct 71716 śra in the word 7171671726 śrī (a Sanskrit honorific) The half form isproduced by dropping the right vertical bar of 71719 The conjunct 71716 is used consistently in NewDogra for representing the cluster śra the form 717197173671716 with visible halanta beneath a full form of 71719does not occur in the available sources

bull Ligatures There are two types of ligature In the first the shapes of individual letters are visible Inthe below the boxed text contains the conjunct sṭa from the word717347172471710 sṭāṃp a transcription of theEnglish ldquostamprdquo

The second type is an atomic (akhaṇḍa) ligature whose form may be considered a distinctive letterSuch a ligature occurs in the excerpt below

The boxed text above contains the atomic ligature kṣa which occurs in the word7168071716 akṣaroṃ ldquoinlettersrdquo (oblique form of7168071716 akṣara ldquoletterrdquo) The conjunct kṣa is used consistently in New Dografor representing the cluster ksa Its shape may be influenced by Devanangari kṣa

bull Forms of Special conjunct forms of 71716 do not generally occur eg repha or vattu Cluster-initialis represented as a full form with halanta as in rṇa in the word saṃpūrṇa below

and non-initial is treated similarly as in pra in the word prabhu below

7

Proposal to encode the Dogra script in Unicode Anshuman Pandey

However there is one instance in the available sources in which cluster-initial is represented asa repha The occurrence is curious because the word caturbhuja ldquosquarerdquo in which repha occursappears three times in proximity but is spelled differently in all three instances

In the boxed text at the top the in caturbhuja is written with halanta 7169571705717277171671736717137172771697 caturbhuja In thetext at bottom left occurs with its inherent vowel intact without halanta 71695717057172771716717137172771697 caturabhuja Atbottom right ra is represented as repha 716957170571727717137172771697 caturbhuja

The usage of repha in Dogra is to be considered anomalous There is no present need to support thisform of in encoded representation

3122 Encoded representation of conjuncts

The Devanagari model for the representation of conjuncts is appropriate for Dogra In encoded text con-juncts are produced by placing the control character 71736 after each non-initial consonantin a cluster ie lt(C 71736 ) Cgt

Representations of conjuncts rendered as atomic ligatures are

kṣa lt71690 71736 71720 gt

sṭa lt71721 71736 71700 gt

Examples of conjuncts rendered using half-forms are

śra 71716 lt71719 71736 71716 gt

Conjuncts displayed using visible halanta are

pra 717107173671716 lt71710 71736 71716 gt

rṇa 717167173671704 lt71716 71736 71704 gt

8

Proposal to encode the Dogra script in Unicode Anshuman Pandey

3123 Modifying conjunct formation

As is the case in Devanagari and other Indic scripts a ligature may be broken using the generic controlcharacter +200C -

kṣa lt71690 71736 71720 gt

kṣa 716907173671720 lt71690 71736 - 71720 gt

The rendering of a conjunct as a ligature or with visible is a matter of font design

313 Punctuation

Abbreviation sign The 71738 is used for indicating truncation of text In the excerptbelow the highlighted text717177172671738 7171871738 717137172471738 (lī va bhā 23) is is an abbreviation for līla vatī bhāga 23

Danda-s The daṇḍā and double daṇḍā are commonly used in Dogra documents for marking the ends ofsentences and paragraphs The below excerpt shows usage of the daṇḍā

and the following shows use of the double daṇḍā

These daṇḍā-s are not proposed for encoding as separate characters in the lsquoDograrsquo block They resemble +0964 and +0965 and may be represented using

these characters The Devanagari daṇḍā-s are to be specified as extensions for Dogra

Other signs Other marks of punctuation such as dashes are also used

9

Proposal to encode the Dogra script in Unicode Anshuman Pandey

A hyphen is used in some cases for indicating breaks in a word at end of line (see section 316)

314 Digits

Digits are commonly used in Dogra documents (see figure 11) The style of digits varies across written andprinted sources Some digits in Old Dogra sources resemble those of Takri (see figure 4) while those innew Dogra sources are similar to Devanagar The excerpt below shows digits 1ndash9 and the number 10 asrepresented in the Līlavatī written in New Dogra

The majority of the digits shown here resemble Devanagari forms however the exception is lsquofourrsquo Theform of Dogra lsquofourrsquo does not resemble 71364 +116C4 which is nearly identical to theDevanagari digit four Information about its origins are not known at this time but it may be related to 70100+111D4 The forms of lsquoeightrsquo and lsquoninersquo are well-known variant forms of the regular

Devanagari digits

Additional research into Dogra digits is required before they may be considered for encoding The availablesources suggest that there may have been a cultural understanding that Dogra possessed unique digits Forinstance on the bank notes shown in figure 20 the digits written in Devanagari are rather similar but arenonetheless displayed twice in order to align the display of the currency value with each script used for theinscriptions

At present script-specific digits are not proposed for inclusion in the Dogra repertoire Devanagari digitsshould be used normatively and should be extended for usage with Dogra If future research yields distinctivedigits for Dogra there is sufficient space in the block to acccommodate their encoding

315 Number forms

Fraction signs and currency marks are commonly used in Dogra sources (see figure 12) These may berepresented using the fraction and currency signs already encoded in the lsquoCommon Indic Number Formsrsquoblock (U+A830) These characters are specified as extensions for Dogra The lsquoCommon Indic NumberFormsrsquo are described briefly in The Unicode Standard (chapter 223 lsquoNumeralsrsquo pp 770ndash771) and in moredetail in the original encoding proposal (L207-354)

316 Linebreaking

Linebreaking occurs after an orthographic syllable Breaks are generally not indicated in informal sourcesbut are marked in literary materials using hyphens

10

Proposal to encode the Dogra script in Unicode Anshuman Pandey

317 Collation

The primary collating order for Dogra is as follows

71680 lt 71681 lt 71682 lt 71683 lt 71684 lt 71685 lt 71686 lt 71687 lt 71688 lt 71689 lt

71690 lt 71691 lt 71692 lt 71693 lt 71694 lt 71695 lt 71696 lt 71697 lt

71698 lt 71699 lt 71700 lt 71701 lt 71702 lt 71703 lt 71704 lt 71705 lt

71706 lt 71707 lt 71708 lt 71709 lt 71710 lt 71711 lt 71712 lt 71713 lt

71714 lt 71715 lt 71716 lt 71717 lt 71718 lt 71719 lt 71720 lt 71721 lt 71722 lt

71723 lt 71724 lt 71725 lt 71726 lt

71727 lt 71728 lt 71729 lt 71730 lt

71731 lt 71732 lt 71733

The following characters have secondary weights 71734 71735 71736 71737

4 Character data

41 Character properties

The properties for Dogra in the Unicode Character Database format are

11800DOGRA LETTER ALo0LN11801DOGRA LETTER AALo0LN11802DOGRA LETTER ILo0LN11804DOGRA LETTER ULo0LN11806DOGRA LETTER ELo0LN11807DOGRA LETTER AILo0LN11808DOGRA LETTER OLo0LN1180ADOGRA LETTER KALo0LN1180BDOGRA LETTER KHALo0LN1180CDOGRA LETTER GALo0LN1180DDOGRA LETTER GHALo0LN1180EDOGRA LETTER NGALo0LN1180FDOGRA LETTER CALo0LN11810DOGRA LETTER CHALo0LN11811DOGRA LETTER JALo0LN11812DOGRA LETTER JHALo0LN11813DOGRA LETTER NYALo0LN

11

Proposal to encode the Dogra script in Unicode Anshuman Pandey

11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN

42 Linebreaking

Linebreaking properties given in the data format of LineBreaktxt

118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN

43 Syllabic categories

Syllabic categories given in the format of IndicSyllabicCategorytxt

Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA

12

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA

Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA

Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA

Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU

Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU

Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA

44 Positional categories

Positional data for characters in the format of IndicPositionalCategorytxt

Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA

Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I

Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU

Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA

45 Script extensions

The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt

0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA

0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE

A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS

A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK

13

Proposal to encode the Dogra script in Unicode Anshuman Pandey

46 lsquoConfusablersquo characters

Some Dogra characters that resemble characters encoded in other script blocks are

DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN

5 References

Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press

Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India

Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press

Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf

mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf

mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf

Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club

von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml

14

Proposal to encode the Dogra script in Unicode Anshuman Pandey

6 Acknowledgments

I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal

This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities

15

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 1

1184FDogra11800

1180 1181 1182 1183 1184

a

71681

i

71683

u

71685

e

71687

o

71689

71690

71691

71692

71693

71694

71695

71696

71697

71698

71699

71700

71701

71702

71703

71704

71705

71706

71707

71708

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

71721

71722

71723

$71724

$ i

$71726

$ u

$ 71728

$ r

$ e

$ 71731

$ o

$ 71733

$ 71734

$71735

$ 71736

$ 71737

71738

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 2

1183ADogra11800

Punctuation1183A 71738 DOGRA ABBREVIATION SIGN

Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU

Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA

Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU

Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

18

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

19

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)

20

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)

21

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)

22

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)

23

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

bull Half-forms Another method is to use half-forms of letters if the graphical structure provides such anopportunity

The above shows the conjunct 71716 śra in the word 7171671726 śrī (a Sanskrit honorific) The half form isproduced by dropping the right vertical bar of 71719 The conjunct 71716 is used consistently in NewDogra for representing the cluster śra the form 717197173671716 with visible halanta beneath a full form of 71719does not occur in the available sources

bull Ligatures There are two types of ligature In the first the shapes of individual letters are visible Inthe below the boxed text contains the conjunct sṭa from the word717347172471710 sṭāṃp a transcription of theEnglish ldquostamprdquo

The second type is an atomic (akhaṇḍa) ligature whose form may be considered a distinctive letterSuch a ligature occurs in the excerpt below

The boxed text above contains the atomic ligature kṣa which occurs in the word7168071716 akṣaroṃ ldquoinlettersrdquo (oblique form of7168071716 akṣara ldquoletterrdquo) The conjunct kṣa is used consistently in New Dografor representing the cluster ksa Its shape may be influenced by Devanangari kṣa

bull Forms of Special conjunct forms of 71716 do not generally occur eg repha or vattu Cluster-initialis represented as a full form with halanta as in rṇa in the word saṃpūrṇa below

and non-initial is treated similarly as in pra in the word prabhu below

7

Proposal to encode the Dogra script in Unicode Anshuman Pandey

However there is one instance in the available sources in which cluster-initial is represented asa repha The occurrence is curious because the word caturbhuja ldquosquarerdquo in which repha occursappears three times in proximity but is spelled differently in all three instances

In the boxed text at the top the in caturbhuja is written with halanta 7169571705717277171671736717137172771697 caturbhuja In thetext at bottom left occurs with its inherent vowel intact without halanta 71695717057172771716717137172771697 caturabhuja Atbottom right ra is represented as repha 716957170571727717137172771697 caturbhuja

The usage of repha in Dogra is to be considered anomalous There is no present need to support thisform of in encoded representation

3122 Encoded representation of conjuncts

The Devanagari model for the representation of conjuncts is appropriate for Dogra In encoded text con-juncts are produced by placing the control character 71736 after each non-initial consonantin a cluster ie lt(C 71736 ) Cgt

Representations of conjuncts rendered as atomic ligatures are

kṣa lt71690 71736 71720 gt

sṭa lt71721 71736 71700 gt

Examples of conjuncts rendered using half-forms are

śra 71716 lt71719 71736 71716 gt

Conjuncts displayed using visible halanta are

pra 717107173671716 lt71710 71736 71716 gt

rṇa 717167173671704 lt71716 71736 71704 gt

8

Proposal to encode the Dogra script in Unicode Anshuman Pandey

3123 Modifying conjunct formation

As is the case in Devanagari and other Indic scripts a ligature may be broken using the generic controlcharacter +200C -

kṣa lt71690 71736 71720 gt

kṣa 716907173671720 lt71690 71736 - 71720 gt

The rendering of a conjunct as a ligature or with visible is a matter of font design

313 Punctuation

Abbreviation sign The 71738 is used for indicating truncation of text In the excerptbelow the highlighted text717177172671738 7171871738 717137172471738 (lī va bhā 23) is is an abbreviation for līla vatī bhāga 23

Danda-s The daṇḍā and double daṇḍā are commonly used in Dogra documents for marking the ends ofsentences and paragraphs The below excerpt shows usage of the daṇḍā

and the following shows use of the double daṇḍā

These daṇḍā-s are not proposed for encoding as separate characters in the lsquoDograrsquo block They resemble +0964 and +0965 and may be represented using

these characters The Devanagari daṇḍā-s are to be specified as extensions for Dogra

Other signs Other marks of punctuation such as dashes are also used

9

Proposal to encode the Dogra script in Unicode Anshuman Pandey

A hyphen is used in some cases for indicating breaks in a word at end of line (see section 316)

314 Digits

Digits are commonly used in Dogra documents (see figure 11) The style of digits varies across written andprinted sources Some digits in Old Dogra sources resemble those of Takri (see figure 4) while those innew Dogra sources are similar to Devanagar The excerpt below shows digits 1ndash9 and the number 10 asrepresented in the Līlavatī written in New Dogra

The majority of the digits shown here resemble Devanagari forms however the exception is lsquofourrsquo Theform of Dogra lsquofourrsquo does not resemble 71364 +116C4 which is nearly identical to theDevanagari digit four Information about its origins are not known at this time but it may be related to 70100+111D4 The forms of lsquoeightrsquo and lsquoninersquo are well-known variant forms of the regular

Devanagari digits

Additional research into Dogra digits is required before they may be considered for encoding The availablesources suggest that there may have been a cultural understanding that Dogra possessed unique digits Forinstance on the bank notes shown in figure 20 the digits written in Devanagari are rather similar but arenonetheless displayed twice in order to align the display of the currency value with each script used for theinscriptions

At present script-specific digits are not proposed for inclusion in the Dogra repertoire Devanagari digitsshould be used normatively and should be extended for usage with Dogra If future research yields distinctivedigits for Dogra there is sufficient space in the block to acccommodate their encoding

315 Number forms

Fraction signs and currency marks are commonly used in Dogra sources (see figure 12) These may berepresented using the fraction and currency signs already encoded in the lsquoCommon Indic Number Formsrsquoblock (U+A830) These characters are specified as extensions for Dogra The lsquoCommon Indic NumberFormsrsquo are described briefly in The Unicode Standard (chapter 223 lsquoNumeralsrsquo pp 770ndash771) and in moredetail in the original encoding proposal (L207-354)

316 Linebreaking

Linebreaking occurs after an orthographic syllable Breaks are generally not indicated in informal sourcesbut are marked in literary materials using hyphens

10

Proposal to encode the Dogra script in Unicode Anshuman Pandey

317 Collation

The primary collating order for Dogra is as follows

71680 lt 71681 lt 71682 lt 71683 lt 71684 lt 71685 lt 71686 lt 71687 lt 71688 lt 71689 lt

71690 lt 71691 lt 71692 lt 71693 lt 71694 lt 71695 lt 71696 lt 71697 lt

71698 lt 71699 lt 71700 lt 71701 lt 71702 lt 71703 lt 71704 lt 71705 lt

71706 lt 71707 lt 71708 lt 71709 lt 71710 lt 71711 lt 71712 lt 71713 lt

71714 lt 71715 lt 71716 lt 71717 lt 71718 lt 71719 lt 71720 lt 71721 lt 71722 lt

71723 lt 71724 lt 71725 lt 71726 lt

71727 lt 71728 lt 71729 lt 71730 lt

71731 lt 71732 lt 71733

The following characters have secondary weights 71734 71735 71736 71737

4 Character data

41 Character properties

The properties for Dogra in the Unicode Character Database format are

11800DOGRA LETTER ALo0LN11801DOGRA LETTER AALo0LN11802DOGRA LETTER ILo0LN11804DOGRA LETTER ULo0LN11806DOGRA LETTER ELo0LN11807DOGRA LETTER AILo0LN11808DOGRA LETTER OLo0LN1180ADOGRA LETTER KALo0LN1180BDOGRA LETTER KHALo0LN1180CDOGRA LETTER GALo0LN1180DDOGRA LETTER GHALo0LN1180EDOGRA LETTER NGALo0LN1180FDOGRA LETTER CALo0LN11810DOGRA LETTER CHALo0LN11811DOGRA LETTER JALo0LN11812DOGRA LETTER JHALo0LN11813DOGRA LETTER NYALo0LN

11

Proposal to encode the Dogra script in Unicode Anshuman Pandey

11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN

42 Linebreaking

Linebreaking properties given in the data format of LineBreaktxt

118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN

43 Syllabic categories

Syllabic categories given in the format of IndicSyllabicCategorytxt

Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA

12

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA

Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA

Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA

Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU

Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU

Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA

44 Positional categories

Positional data for characters in the format of IndicPositionalCategorytxt

Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA

Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I

Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU

Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA

45 Script extensions

The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt

0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA

0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE

A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS

A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK

13

Proposal to encode the Dogra script in Unicode Anshuman Pandey

46 lsquoConfusablersquo characters

Some Dogra characters that resemble characters encoded in other script blocks are

DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN

5 References

Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press

Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India

Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press

Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf

mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf

mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf

Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club

von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml

14

Proposal to encode the Dogra script in Unicode Anshuman Pandey

6 Acknowledgments

I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal

This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities

15

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 1

1184FDogra11800

1180 1181 1182 1183 1184

a

71681

i

71683

u

71685

e

71687

o

71689

71690

71691

71692

71693

71694

71695

71696

71697

71698

71699

71700

71701

71702

71703

71704

71705

71706

71707

71708

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

71721

71722

71723

$71724

$ i

$71726

$ u

$ 71728

$ r

$ e

$ 71731

$ o

$ 71733

$ 71734

$71735

$ 71736

$ 71737

71738

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 2

1183ADogra11800

Punctuation1183A 71738 DOGRA ABBREVIATION SIGN

Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU

Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA

Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU

Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

18

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

19

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)

20

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)

21

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)

22

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)

23

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

However there is one instance in the available sources in which cluster-initial is represented asa repha The occurrence is curious because the word caturbhuja ldquosquarerdquo in which repha occursappears three times in proximity but is spelled differently in all three instances

In the boxed text at the top the in caturbhuja is written with halanta 7169571705717277171671736717137172771697 caturbhuja In thetext at bottom left occurs with its inherent vowel intact without halanta 71695717057172771716717137172771697 caturabhuja Atbottom right ra is represented as repha 716957170571727717137172771697 caturbhuja

The usage of repha in Dogra is to be considered anomalous There is no present need to support thisform of in encoded representation

3122 Encoded representation of conjuncts

The Devanagari model for the representation of conjuncts is appropriate for Dogra In encoded text con-juncts are produced by placing the control character 71736 after each non-initial consonantin a cluster ie lt(C 71736 ) Cgt

Representations of conjuncts rendered as atomic ligatures are

kṣa lt71690 71736 71720 gt

sṭa lt71721 71736 71700 gt

Examples of conjuncts rendered using half-forms are

śra 71716 lt71719 71736 71716 gt

Conjuncts displayed using visible halanta are

pra 717107173671716 lt71710 71736 71716 gt

rṇa 717167173671704 lt71716 71736 71704 gt

8

Proposal to encode the Dogra script in Unicode Anshuman Pandey

3123 Modifying conjunct formation

As is the case in Devanagari and other Indic scripts a ligature may be broken using the generic controlcharacter +200C -

kṣa lt71690 71736 71720 gt

kṣa 716907173671720 lt71690 71736 - 71720 gt

The rendering of a conjunct as a ligature or with visible is a matter of font design

313 Punctuation

Abbreviation sign The 71738 is used for indicating truncation of text In the excerptbelow the highlighted text717177172671738 7171871738 717137172471738 (lī va bhā 23) is is an abbreviation for līla vatī bhāga 23

Danda-s The daṇḍā and double daṇḍā are commonly used in Dogra documents for marking the ends ofsentences and paragraphs The below excerpt shows usage of the daṇḍā

and the following shows use of the double daṇḍā

These daṇḍā-s are not proposed for encoding as separate characters in the lsquoDograrsquo block They resemble +0964 and +0965 and may be represented using

these characters The Devanagari daṇḍā-s are to be specified as extensions for Dogra

Other signs Other marks of punctuation such as dashes are also used

9

Proposal to encode the Dogra script in Unicode Anshuman Pandey

A hyphen is used in some cases for indicating breaks in a word at end of line (see section 316)

314 Digits

Digits are commonly used in Dogra documents (see figure 11) The style of digits varies across written andprinted sources Some digits in Old Dogra sources resemble those of Takri (see figure 4) while those innew Dogra sources are similar to Devanagar The excerpt below shows digits 1ndash9 and the number 10 asrepresented in the Līlavatī written in New Dogra

The majority of the digits shown here resemble Devanagari forms however the exception is lsquofourrsquo Theform of Dogra lsquofourrsquo does not resemble 71364 +116C4 which is nearly identical to theDevanagari digit four Information about its origins are not known at this time but it may be related to 70100+111D4 The forms of lsquoeightrsquo and lsquoninersquo are well-known variant forms of the regular

Devanagari digits

Additional research into Dogra digits is required before they may be considered for encoding The availablesources suggest that there may have been a cultural understanding that Dogra possessed unique digits Forinstance on the bank notes shown in figure 20 the digits written in Devanagari are rather similar but arenonetheless displayed twice in order to align the display of the currency value with each script used for theinscriptions

At present script-specific digits are not proposed for inclusion in the Dogra repertoire Devanagari digitsshould be used normatively and should be extended for usage with Dogra If future research yields distinctivedigits for Dogra there is sufficient space in the block to acccommodate their encoding

315 Number forms

Fraction signs and currency marks are commonly used in Dogra sources (see figure 12) These may berepresented using the fraction and currency signs already encoded in the lsquoCommon Indic Number Formsrsquoblock (U+A830) These characters are specified as extensions for Dogra The lsquoCommon Indic NumberFormsrsquo are described briefly in The Unicode Standard (chapter 223 lsquoNumeralsrsquo pp 770ndash771) and in moredetail in the original encoding proposal (L207-354)

316 Linebreaking

Linebreaking occurs after an orthographic syllable Breaks are generally not indicated in informal sourcesbut are marked in literary materials using hyphens

10

Proposal to encode the Dogra script in Unicode Anshuman Pandey

317 Collation

The primary collating order for Dogra is as follows

71680 lt 71681 lt 71682 lt 71683 lt 71684 lt 71685 lt 71686 lt 71687 lt 71688 lt 71689 lt

71690 lt 71691 lt 71692 lt 71693 lt 71694 lt 71695 lt 71696 lt 71697 lt

71698 lt 71699 lt 71700 lt 71701 lt 71702 lt 71703 lt 71704 lt 71705 lt

71706 lt 71707 lt 71708 lt 71709 lt 71710 lt 71711 lt 71712 lt 71713 lt

71714 lt 71715 lt 71716 lt 71717 lt 71718 lt 71719 lt 71720 lt 71721 lt 71722 lt

71723 lt 71724 lt 71725 lt 71726 lt

71727 lt 71728 lt 71729 lt 71730 lt

71731 lt 71732 lt 71733

The following characters have secondary weights 71734 71735 71736 71737

4 Character data

41 Character properties

The properties for Dogra in the Unicode Character Database format are

11800DOGRA LETTER ALo0LN11801DOGRA LETTER AALo0LN11802DOGRA LETTER ILo0LN11804DOGRA LETTER ULo0LN11806DOGRA LETTER ELo0LN11807DOGRA LETTER AILo0LN11808DOGRA LETTER OLo0LN1180ADOGRA LETTER KALo0LN1180BDOGRA LETTER KHALo0LN1180CDOGRA LETTER GALo0LN1180DDOGRA LETTER GHALo0LN1180EDOGRA LETTER NGALo0LN1180FDOGRA LETTER CALo0LN11810DOGRA LETTER CHALo0LN11811DOGRA LETTER JALo0LN11812DOGRA LETTER JHALo0LN11813DOGRA LETTER NYALo0LN

11

Proposal to encode the Dogra script in Unicode Anshuman Pandey

11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN

42 Linebreaking

Linebreaking properties given in the data format of LineBreaktxt

118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN

43 Syllabic categories

Syllabic categories given in the format of IndicSyllabicCategorytxt

Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA

12

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA

Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA

Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA

Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU

Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU

Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA

44 Positional categories

Positional data for characters in the format of IndicPositionalCategorytxt

Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA

Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I

Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU

Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA

45 Script extensions

The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt

0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA

0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE

A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS

A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK

13

Proposal to encode the Dogra script in Unicode Anshuman Pandey

46 lsquoConfusablersquo characters

Some Dogra characters that resemble characters encoded in other script blocks are

DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN

5 References

Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press

Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India

Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press

Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf

mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf

mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf

Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club

von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml

14

Proposal to encode the Dogra script in Unicode Anshuman Pandey

6 Acknowledgments

I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal

This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities

15

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 1

1184FDogra11800

1180 1181 1182 1183 1184

a

71681

i

71683

u

71685

e

71687

o

71689

71690

71691

71692

71693

71694

71695

71696

71697

71698

71699

71700

71701

71702

71703

71704

71705

71706

71707

71708

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

71721

71722

71723

$71724

$ i

$71726

$ u

$ 71728

$ r

$ e

$ 71731

$ o

$ 71733

$ 71734

$71735

$ 71736

$ 71737

71738

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 2

1183ADogra11800

Punctuation1183A 71738 DOGRA ABBREVIATION SIGN

Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU

Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA

Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU

Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

18

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

19

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)

20

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)

21

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)

22

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)

23

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

3123 Modifying conjunct formation

As is the case in Devanagari and other Indic scripts a ligature may be broken using the generic controlcharacter +200C -

kṣa lt71690 71736 71720 gt

kṣa 716907173671720 lt71690 71736 - 71720 gt

The rendering of a conjunct as a ligature or with visible is a matter of font design

313 Punctuation

Abbreviation sign The 71738 is used for indicating truncation of text In the excerptbelow the highlighted text717177172671738 7171871738 717137172471738 (lī va bhā 23) is is an abbreviation for līla vatī bhāga 23

Danda-s The daṇḍā and double daṇḍā are commonly used in Dogra documents for marking the ends ofsentences and paragraphs The below excerpt shows usage of the daṇḍā

and the following shows use of the double daṇḍā

These daṇḍā-s are not proposed for encoding as separate characters in the lsquoDograrsquo block They resemble +0964 and +0965 and may be represented using

these characters The Devanagari daṇḍā-s are to be specified as extensions for Dogra

Other signs Other marks of punctuation such as dashes are also used

9

Proposal to encode the Dogra script in Unicode Anshuman Pandey

A hyphen is used in some cases for indicating breaks in a word at end of line (see section 316)

314 Digits

Digits are commonly used in Dogra documents (see figure 11) The style of digits varies across written andprinted sources Some digits in Old Dogra sources resemble those of Takri (see figure 4) while those innew Dogra sources are similar to Devanagar The excerpt below shows digits 1ndash9 and the number 10 asrepresented in the Līlavatī written in New Dogra

The majority of the digits shown here resemble Devanagari forms however the exception is lsquofourrsquo Theform of Dogra lsquofourrsquo does not resemble 71364 +116C4 which is nearly identical to theDevanagari digit four Information about its origins are not known at this time but it may be related to 70100+111D4 The forms of lsquoeightrsquo and lsquoninersquo are well-known variant forms of the regular

Devanagari digits

Additional research into Dogra digits is required before they may be considered for encoding The availablesources suggest that there may have been a cultural understanding that Dogra possessed unique digits Forinstance on the bank notes shown in figure 20 the digits written in Devanagari are rather similar but arenonetheless displayed twice in order to align the display of the currency value with each script used for theinscriptions

At present script-specific digits are not proposed for inclusion in the Dogra repertoire Devanagari digitsshould be used normatively and should be extended for usage with Dogra If future research yields distinctivedigits for Dogra there is sufficient space in the block to acccommodate their encoding

315 Number forms

Fraction signs and currency marks are commonly used in Dogra sources (see figure 12) These may berepresented using the fraction and currency signs already encoded in the lsquoCommon Indic Number Formsrsquoblock (U+A830) These characters are specified as extensions for Dogra The lsquoCommon Indic NumberFormsrsquo are described briefly in The Unicode Standard (chapter 223 lsquoNumeralsrsquo pp 770ndash771) and in moredetail in the original encoding proposal (L207-354)

316 Linebreaking

Linebreaking occurs after an orthographic syllable Breaks are generally not indicated in informal sourcesbut are marked in literary materials using hyphens

10

Proposal to encode the Dogra script in Unicode Anshuman Pandey

317 Collation

The primary collating order for Dogra is as follows

71680 lt 71681 lt 71682 lt 71683 lt 71684 lt 71685 lt 71686 lt 71687 lt 71688 lt 71689 lt

71690 lt 71691 lt 71692 lt 71693 lt 71694 lt 71695 lt 71696 lt 71697 lt

71698 lt 71699 lt 71700 lt 71701 lt 71702 lt 71703 lt 71704 lt 71705 lt

71706 lt 71707 lt 71708 lt 71709 lt 71710 lt 71711 lt 71712 lt 71713 lt

71714 lt 71715 lt 71716 lt 71717 lt 71718 lt 71719 lt 71720 lt 71721 lt 71722 lt

71723 lt 71724 lt 71725 lt 71726 lt

71727 lt 71728 lt 71729 lt 71730 lt

71731 lt 71732 lt 71733

The following characters have secondary weights 71734 71735 71736 71737

4 Character data

41 Character properties

The properties for Dogra in the Unicode Character Database format are

11800DOGRA LETTER ALo0LN11801DOGRA LETTER AALo0LN11802DOGRA LETTER ILo0LN11804DOGRA LETTER ULo0LN11806DOGRA LETTER ELo0LN11807DOGRA LETTER AILo0LN11808DOGRA LETTER OLo0LN1180ADOGRA LETTER KALo0LN1180BDOGRA LETTER KHALo0LN1180CDOGRA LETTER GALo0LN1180DDOGRA LETTER GHALo0LN1180EDOGRA LETTER NGALo0LN1180FDOGRA LETTER CALo0LN11810DOGRA LETTER CHALo0LN11811DOGRA LETTER JALo0LN11812DOGRA LETTER JHALo0LN11813DOGRA LETTER NYALo0LN

11

Proposal to encode the Dogra script in Unicode Anshuman Pandey

11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN

42 Linebreaking

Linebreaking properties given in the data format of LineBreaktxt

118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN

43 Syllabic categories

Syllabic categories given in the format of IndicSyllabicCategorytxt

Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA

12

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA

Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA

Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA

Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU

Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU

Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA

44 Positional categories

Positional data for characters in the format of IndicPositionalCategorytxt

Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA

Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I

Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU

Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA

45 Script extensions

The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt

0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA

0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE

A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS

A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK

13

Proposal to encode the Dogra script in Unicode Anshuman Pandey

46 lsquoConfusablersquo characters

Some Dogra characters that resemble characters encoded in other script blocks are

DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN

5 References

Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press

Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India

Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press

Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf

mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf

mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf

Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club

von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml

14

Proposal to encode the Dogra script in Unicode Anshuman Pandey

6 Acknowledgments

I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal

This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities

15

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 1

1184FDogra11800

1180 1181 1182 1183 1184

a

71681

i

71683

u

71685

e

71687

o

71689

71690

71691

71692

71693

71694

71695

71696

71697

71698

71699

71700

71701

71702

71703

71704

71705

71706

71707

71708

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

71721

71722

71723

$71724

$ i

$71726

$ u

$ 71728

$ r

$ e

$ 71731

$ o

$ 71733

$ 71734

$71735

$ 71736

$ 71737

71738

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 2

1183ADogra11800

Punctuation1183A 71738 DOGRA ABBREVIATION SIGN

Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU

Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA

Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU

Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

18

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

19

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)

20

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)

21

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)

22

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)

23

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

A hyphen is used in some cases for indicating breaks in a word at end of line (see section 316)

314 Digits

Digits are commonly used in Dogra documents (see figure 11) The style of digits varies across written andprinted sources Some digits in Old Dogra sources resemble those of Takri (see figure 4) while those innew Dogra sources are similar to Devanagar The excerpt below shows digits 1ndash9 and the number 10 asrepresented in the Līlavatī written in New Dogra

The majority of the digits shown here resemble Devanagari forms however the exception is lsquofourrsquo Theform of Dogra lsquofourrsquo does not resemble 71364 +116C4 which is nearly identical to theDevanagari digit four Information about its origins are not known at this time but it may be related to 70100+111D4 The forms of lsquoeightrsquo and lsquoninersquo are well-known variant forms of the regular

Devanagari digits

Additional research into Dogra digits is required before they may be considered for encoding The availablesources suggest that there may have been a cultural understanding that Dogra possessed unique digits Forinstance on the bank notes shown in figure 20 the digits written in Devanagari are rather similar but arenonetheless displayed twice in order to align the display of the currency value with each script used for theinscriptions

At present script-specific digits are not proposed for inclusion in the Dogra repertoire Devanagari digitsshould be used normatively and should be extended for usage with Dogra If future research yields distinctivedigits for Dogra there is sufficient space in the block to acccommodate their encoding

315 Number forms

Fraction signs and currency marks are commonly used in Dogra sources (see figure 12) These may berepresented using the fraction and currency signs already encoded in the lsquoCommon Indic Number Formsrsquoblock (U+A830) These characters are specified as extensions for Dogra The lsquoCommon Indic NumberFormsrsquo are described briefly in The Unicode Standard (chapter 223 lsquoNumeralsrsquo pp 770ndash771) and in moredetail in the original encoding proposal (L207-354)

316 Linebreaking

Linebreaking occurs after an orthographic syllable Breaks are generally not indicated in informal sourcesbut are marked in literary materials using hyphens

10

Proposal to encode the Dogra script in Unicode Anshuman Pandey

317 Collation

The primary collating order for Dogra is as follows

71680 lt 71681 lt 71682 lt 71683 lt 71684 lt 71685 lt 71686 lt 71687 lt 71688 lt 71689 lt

71690 lt 71691 lt 71692 lt 71693 lt 71694 lt 71695 lt 71696 lt 71697 lt

71698 lt 71699 lt 71700 lt 71701 lt 71702 lt 71703 lt 71704 lt 71705 lt

71706 lt 71707 lt 71708 lt 71709 lt 71710 lt 71711 lt 71712 lt 71713 lt

71714 lt 71715 lt 71716 lt 71717 lt 71718 lt 71719 lt 71720 lt 71721 lt 71722 lt

71723 lt 71724 lt 71725 lt 71726 lt

71727 lt 71728 lt 71729 lt 71730 lt

71731 lt 71732 lt 71733

The following characters have secondary weights 71734 71735 71736 71737

4 Character data

41 Character properties

The properties for Dogra in the Unicode Character Database format are

11800DOGRA LETTER ALo0LN11801DOGRA LETTER AALo0LN11802DOGRA LETTER ILo0LN11804DOGRA LETTER ULo0LN11806DOGRA LETTER ELo0LN11807DOGRA LETTER AILo0LN11808DOGRA LETTER OLo0LN1180ADOGRA LETTER KALo0LN1180BDOGRA LETTER KHALo0LN1180CDOGRA LETTER GALo0LN1180DDOGRA LETTER GHALo0LN1180EDOGRA LETTER NGALo0LN1180FDOGRA LETTER CALo0LN11810DOGRA LETTER CHALo0LN11811DOGRA LETTER JALo0LN11812DOGRA LETTER JHALo0LN11813DOGRA LETTER NYALo0LN

11

Proposal to encode the Dogra script in Unicode Anshuman Pandey

11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN

42 Linebreaking

Linebreaking properties given in the data format of LineBreaktxt

118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN

43 Syllabic categories

Syllabic categories given in the format of IndicSyllabicCategorytxt

Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA

12

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA

Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA

Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA

Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU

Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU

Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA

44 Positional categories

Positional data for characters in the format of IndicPositionalCategorytxt

Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA

Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I

Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU

Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA

45 Script extensions

The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt

0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA

0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE

A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS

A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK

13

Proposal to encode the Dogra script in Unicode Anshuman Pandey

46 lsquoConfusablersquo characters

Some Dogra characters that resemble characters encoded in other script blocks are

DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN

5 References

Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press

Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India

Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press

Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf

mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf

mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf

Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club

von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml

14

Proposal to encode the Dogra script in Unicode Anshuman Pandey

6 Acknowledgments

I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal

This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities

15

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 1

1184FDogra11800

1180 1181 1182 1183 1184

a

71681

i

71683

u

71685

e

71687

o

71689

71690

71691

71692

71693

71694

71695

71696

71697

71698

71699

71700

71701

71702

71703

71704

71705

71706

71707

71708

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

71721

71722

71723

$71724

$ i

$71726

$ u

$ 71728

$ r

$ e

$ 71731

$ o

$ 71733

$ 71734

$71735

$ 71736

$ 71737

71738

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 2

1183ADogra11800

Punctuation1183A 71738 DOGRA ABBREVIATION SIGN

Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU

Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA

Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU

Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

18

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

19

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)

20

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)

21

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)

22

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)

23

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

317 Collation

The primary collating order for Dogra is as follows

71680 lt 71681 lt 71682 lt 71683 lt 71684 lt 71685 lt 71686 lt 71687 lt 71688 lt 71689 lt

71690 lt 71691 lt 71692 lt 71693 lt 71694 lt 71695 lt 71696 lt 71697 lt

71698 lt 71699 lt 71700 lt 71701 lt 71702 lt 71703 lt 71704 lt 71705 lt

71706 lt 71707 lt 71708 lt 71709 lt 71710 lt 71711 lt 71712 lt 71713 lt

71714 lt 71715 lt 71716 lt 71717 lt 71718 lt 71719 lt 71720 lt 71721 lt 71722 lt

71723 lt 71724 lt 71725 lt 71726 lt

71727 lt 71728 lt 71729 lt 71730 lt

71731 lt 71732 lt 71733

The following characters have secondary weights 71734 71735 71736 71737

4 Character data

41 Character properties

The properties for Dogra in the Unicode Character Database format are

11800DOGRA LETTER ALo0LN11801DOGRA LETTER AALo0LN11802DOGRA LETTER ILo0LN11804DOGRA LETTER ULo0LN11806DOGRA LETTER ELo0LN11807DOGRA LETTER AILo0LN11808DOGRA LETTER OLo0LN1180ADOGRA LETTER KALo0LN1180BDOGRA LETTER KHALo0LN1180CDOGRA LETTER GALo0LN1180DDOGRA LETTER GHALo0LN1180EDOGRA LETTER NGALo0LN1180FDOGRA LETTER CALo0LN11810DOGRA LETTER CHALo0LN11811DOGRA LETTER JALo0LN11812DOGRA LETTER JHALo0LN11813DOGRA LETTER NYALo0LN

11

Proposal to encode the Dogra script in Unicode Anshuman Pandey

11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN

42 Linebreaking

Linebreaking properties given in the data format of LineBreaktxt

118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN

43 Syllabic categories

Syllabic categories given in the format of IndicSyllabicCategorytxt

Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA

12

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA

Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA

Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA

Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU

Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU

Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA

44 Positional categories

Positional data for characters in the format of IndicPositionalCategorytxt

Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA

Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I

Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU

Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA

45 Script extensions

The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt

0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA

0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE

A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS

A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK

13

Proposal to encode the Dogra script in Unicode Anshuman Pandey

46 lsquoConfusablersquo characters

Some Dogra characters that resemble characters encoded in other script blocks are

DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN

5 References

Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press

Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India

Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press

Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf

mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf

mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf

Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club

von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml

14

Proposal to encode the Dogra script in Unicode Anshuman Pandey

6 Acknowledgments

I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal

This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities

15

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 1

1184FDogra11800

1180 1181 1182 1183 1184

a

71681

i

71683

u

71685

e

71687

o

71689

71690

71691

71692

71693

71694

71695

71696

71697

71698

71699

71700

71701

71702

71703

71704

71705

71706

71707

71708

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

71721

71722

71723

$71724

$ i

$71726

$ u

$ 71728

$ r

$ e

$ 71731

$ o

$ 71733

$ 71734

$71735

$ 71736

$ 71737

71738

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 2

1183ADogra11800

Punctuation1183A 71738 DOGRA ABBREVIATION SIGN

Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU

Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA

Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU

Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

18

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

19

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)

20

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)

21

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)

22

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)

23

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN

42 Linebreaking

Linebreaking properties given in the data format of LineBreaktxt

118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN

43 Syllabic categories

Syllabic categories given in the format of IndicSyllabicCategorytxt

Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA

12

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA

Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA

Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA

Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU

Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU

Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA

44 Positional categories

Positional data for characters in the format of IndicPositionalCategorytxt

Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA

Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I

Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU

Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA

45 Script extensions

The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt

0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA

0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE

A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS

A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK

13

Proposal to encode the Dogra script in Unicode Anshuman Pandey

46 lsquoConfusablersquo characters

Some Dogra characters that resemble characters encoded in other script blocks are

DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN

5 References

Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press

Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India

Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press

Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf

mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf

mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf

Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club

von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml

14

Proposal to encode the Dogra script in Unicode Anshuman Pandey

6 Acknowledgments

I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal

This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities

15

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 1

1184FDogra11800

1180 1181 1182 1183 1184

a

71681

i

71683

u

71685

e

71687

o

71689

71690

71691

71692

71693

71694

71695

71696

71697

71698

71699

71700

71701

71702

71703

71704

71705

71706

71707

71708

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

71721

71722

71723

$71724

$ i

$71726

$ u

$ 71728

$ r

$ e

$ 71731

$ o

$ 71733

$ 71734

$71735

$ 71736

$ 71737

71738

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 2

1183ADogra11800

Punctuation1183A 71738 DOGRA ABBREVIATION SIGN

Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU

Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA

Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU

Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

18

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

19

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)

20

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)

21

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)

22

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)

23

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA

Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA

Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA

Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU

Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU

Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA

44 Positional categories

Positional data for characters in the format of IndicPositionalCategorytxt

Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA

Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I

Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU

Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA

45 Script extensions

The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt

0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA

0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE

A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS

A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK

13

Proposal to encode the Dogra script in Unicode Anshuman Pandey

46 lsquoConfusablersquo characters

Some Dogra characters that resemble characters encoded in other script blocks are

DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN

5 References

Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press

Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India

Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press

Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf

mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf

mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf

Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club

von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml

14

Proposal to encode the Dogra script in Unicode Anshuman Pandey

6 Acknowledgments

I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal

This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities

15

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 1

1184FDogra11800

1180 1181 1182 1183 1184

a

71681

i

71683

u

71685

e

71687

o

71689

71690

71691

71692

71693

71694

71695

71696

71697

71698

71699

71700

71701

71702

71703

71704

71705

71706

71707

71708

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

71721

71722

71723

$71724

$ i

$71726

$ u

$ 71728

$ r

$ e

$ 71731

$ o

$ 71733

$ 71734

$71735

$ 71736

$ 71737

71738

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 2

1183ADogra11800

Punctuation1183A 71738 DOGRA ABBREVIATION SIGN

Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU

Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA

Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU

Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

18

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

19

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)

20

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)

21

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)

22

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)

23

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

46 lsquoConfusablersquo characters

Some Dogra characters that resemble characters encoded in other script blocks are

DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN

5 References

Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press

Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India

Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press

Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf

mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf

mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf

Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club

von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml

14

Proposal to encode the Dogra script in Unicode Anshuman Pandey

6 Acknowledgments

I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal

This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities

15

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 1

1184FDogra11800

1180 1181 1182 1183 1184

a

71681

i

71683

u

71685

e

71687

o

71689

71690

71691

71692

71693

71694

71695

71696

71697

71698

71699

71700

71701

71702

71703

71704

71705

71706

71707

71708

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

71721

71722

71723

$71724

$ i

$71726

$ u

$ 71728

$ r

$ e

$ 71731

$ o

$ 71733

$ 71734

$71735

$ 71736

$ 71737

71738

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 2

1183ADogra11800

Punctuation1183A 71738 DOGRA ABBREVIATION SIGN

Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU

Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA

Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU

Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

18

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

19

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)

20

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)

21

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)

22

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)

23

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

6 Acknowledgments

I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal

This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities

15

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 1

1184FDogra11800

1180 1181 1182 1183 1184

a

71681

i

71683

u

71685

e

71687

o

71689

71690

71691

71692

71693

71694

71695

71696

71697

71698

71699

71700

71701

71702

71703

71704

71705

71706

71707

71708

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

71721

71722

71723

$71724

$ i

$71726

$ u

$ 71728

$ r

$ e

$ 71731

$ o

$ 71733

$ 71734

$71735

$ 71736

$ 71737

71738

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 2

1183ADogra11800

Punctuation1183A 71738 DOGRA ABBREVIATION SIGN

Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU

Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA

Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU

Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

18

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

19

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)

20

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)

21

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)

22

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)

23

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 1

1184FDogra11800

1180 1181 1182 1183 1184

a

71681

i

71683

u

71685

e

71687

o

71689

71690

71691

71692

71693

71694

71695

71696

71697

71698

71699

71700

71701

71702

71703

71704

71705

71706

71707

71708

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

71721

71722

71723

$71724

$ i

$71726

$ u

$ 71728

$ r

$ e

$ 71731

$ o

$ 71733

$ 71734

$71735

$ 71736

$ 71737

71738

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 2

1183ADogra11800

Punctuation1183A 71738 DOGRA ABBREVIATION SIGN

Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU

Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA

Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU

Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

18

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

19

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)

20

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)

21

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)

22

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)

23

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Printed 04-Nov-2015 2

1183ADogra11800

Punctuation1183A 71738 DOGRA ABBREVIATION SIGN

Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU

Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA

Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU

Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

18

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

19

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)

20

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)

21

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)

22

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)

23

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

18

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

19

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)

20

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)

21

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)

22

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)

23

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)

19

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)

20

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)

21

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)

22

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)

23

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)

20

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)

21

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)

22

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)

23

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)

21

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)

22

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)

23

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)

22

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)

23

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)

23

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)

Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)

24

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book

25

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 10 Another page from the Līlavatī in New Dogra (p 2)

26

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)

27

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode

28

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प

Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo

29

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Detail of top left (Devanagari)

Detail of top right (New Dogra)

Detail of center (New Dogra Devanagari and Urdu)

Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State

30

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts

Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)

31

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark

32

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo

isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo

jamu sinagarldquoJammurdquo ldquoSrinagarrdquo

bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo

Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra

33

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

One rupee banknote

Twenty rupee banknote

Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts

34

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71296 71680

71297 71681

71298 71682

71299 mdash

71300 71684

71301 mdash

71302 71686

71303 71687

71304 71688

71305 mdash

Chamba Dogra

no dependent form

71341 71724

71342 71725

71343 71726

71344 71727

71345 71728

71346 71730

71347 71731

71348 71732

71349 71733

Table 2 Comparison of Chamba and Dogra vowel letters and signs

35

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Proposal to encode the Dogra script in Unicode Anshuman Pandey

Chamba Dogra

71306 71690

71307 71691

71308 71692

71309 71693

71310 71694

71311 71695

71312 71696

71313 71697

71314 71698

71315 71699

71316 71700

71317 71701

71318 71702

71319 71703

71320 71704

71321 71705

71322 71706

Chamba Dogra

71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723

Table 3 Comparison of Chamba and Dogra consonant letters

36

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP

1PT

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for

guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH

See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps

A Administrative

1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later

B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block

2 Number of characters in proposal 59

3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols

4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review

5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey

6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes

7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes

8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

TP

1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11

2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

C Technical - Justification

1 Has this proposal for addition of character(s) been submitted before No If YES explain

2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents

3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used

4 The context of use for the proposed characters (type of use common or rare) Common Reference

5 Are the proposed characters in current use by the user community No If YES where Reference

6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference

7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference

9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference

10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No

If YES is a rationale for its inclusion provided If YES reference

11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference

12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama

see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference