ProposaltoencodetheDograscriptinUnicode · L2/15-234R 2015-11-04...
Transcript of ProposaltoencodetheDograscriptinUnicode · L2/15-234R 2015-11-04...
L215-234R2015-11-04
Proposal to encode the Dogra script in Unicode
Anshuman PandeyDepartment of Linguistics
University of California BerkeleyBerkeley California USA
anshumanpandeyberkeleyedu
November 4 2015
1 Introduction
This is a proposal to encode the Dogra script in Unicode Dogra was briefly described in ldquoProposal to Encodethe Takri Script in ISOIEC 10646rdquo (L209-424) as one of several scripts to be unified in the lsquoTakrirsquo blockThe block was intended for representing several related scripts of Himachal Pradesh Jammu Kashmir andPanjab by means of a generic character repertoire until additional research on individual scripts could beconducted As described in L209-424 the representative glyphs for characters in the lsquoTakrirsquo block arebased upon the standard printed form of the Chamba or Chambeali form of Takri The Chamba form wasselected because materials in the script were readily available Although Dogra has its own standardizedform it was stated in L209-424 that there were insufficient materials available for developing a formalencoding Dogra was therefore grouped with lsquoTakrirsquo on the basis of historical typology and as a matter ofpracticality My recent research upon various lsquoTakrirsquo scripts has yielded additional sources and informationthat that support of the encoding of Dogra as an independent script1 especially because of its official statusin the former State of Jammu and Kashmir and on account of the differences between it and the standardChamba script (see tables 2 and 3) A step towards encoding Dogra was taken through the submission ofa preliminary proposal (L215-213) The present document supersedes L215-213 and provides additionalinformation and a number of new sources that support the encoding of Dogra in Unicode
2 Background
TheDogra script is historically associatedwith theDogri language (ISO 639-3 doi) but it is not actively usedat present In 1916 George A Grierson wrote in the Linguistic Survey of India that ldquoḌōgrā has an alphabetof its own which is allied to the Ṭakrī alphabet current in the Punjab Himalayasrdquo (1916 638) Althoughit was used mostly for informal communication and commercial activities Dogra was standardized in the1860s Grierson noted that ldquo[s]ome thirty or forty years ago the then Mahārājā of Jammu and Kashmircaused to be invented a modified form of the current Ṭakrī so as to bring it more into line with Dēvanāgarīand Gurmukhīrdquo and ldquo[t]his improved Ḍōgrī is used for official documents but it has not generally displacedthe old Ṭakrī form of scriptrdquo (ibid) This official form of Dogra is known as lsquoName Dogra Akkharrsquo or
1 The Jaunsari or Sirmauri script was also briefly described in L209-424 and unified with the Takri block It is another potentialcandidate for separate encoding in Unicode
1
Proposal to encode the Dogra script in Unicode Anshuman Pandey
the lsquoNew Dogra Scriptrsquo It was the official script of the State of Jammu and Kashmir during the reign ofMaharaja Ranbir Singh (r 1857ndash1885) who is the ruler referred to by Grierson The New Dogra script wasused for administrative purposes and upon currency judicial and non-judicial stamp papers postcards andpostage stamps It was also used for literary activities The first book published in the standard Dogra was atranslation of a Sanskrit mathematical treatise by Bhāskarācārya titled Līlavatī The work was commissionedby Ranbir Singh and printed in 1872 at Vidya Vilas Press the first printing press in Jammu
The old and standard forms of Dogra differ primarily in the shapes of letterforms and the representation ofconsonant clusters Specific differences are described throughout this proposal As mentioned by Griersonstandard Dogra was influenced by Devanagari and Gurmukhi The motivation behind such a move is notclear but it could be related to the idea of providing the Dogra script with an air of lsquoauthorityrsquo by giving it aform that resembled scripts which were already endowed with lsquoofficialrsquo and lsquoliteraryrsquo attributes The stan-dardization of Dogra in the 1860s established two distinctive forms of the script Throughout this documentthe common written form is called lsquoOld Dograrsquo and the standard form is referred to as lsquoNew Dograrsquo
The Dogra script is no longer used actively and the Dogri language is now generally written in DevanagariNonetheless the script continues to attract the interest of philatelists who collect ephemera from Jammu andKashmir Although there is very little scholarship on the script it was described briefly by the prominentSanskrit scholar Fritz Staal in his Stamps of Jammu amp Kashmir (New York The Collectors Club 1984)Information on Dogra is also curated on a philately website called ldquoCollecting Kashmirrdquo maintained byCarol von der Lin
3 Proposed Encoding
31 Approach
The justification for encoding Dogra as a separate script in Unicode is based upon the following criteria 1)graphical differences between Dogra Chamba and other forms of Takri 2) usage of the script in an officialcapacity 3) cultural and political recognition of it as a distinctive script 4) differentiation into lsquooldrsquo andstandardrsquo forms Of these the fourth point requires a means for handling the internal distinctions of thescript A separate block for Dogra would provide a means to maintain the scriptrsquos external identity fromother related scripts and to also maintain the identity of styles internally
32 Structure
The Dogra script is a Brahmi-based alphasyllabary that is written from left to right Independent and initialvowels are written using letters while dependent vowels are expressed using combining signs Consonantletters possess the inherent vowel a which is changed by the attachment of a vowel sign Consonant clustersare written in various ways such as with visible halanta as an atomic ligature and with half-forms rephais attested but rarely used There are no formal conventions regarding representation of conjuncts
33 Script name
The designation lsquoDograrsquo has been assigned to the script block in Unicode The script is referred to as bothlsquoDograrsquo and lsquoDogrirsquo in English secondary sources The name lsquoDograrsquo is preferred over lsquoDogrirsquo because itlinks the script with the Dogra dynasty of Jammu andKashmir under whose government it was established asthe official script Secondly lsquoDogrirsquo suggests a connection with the language of the same name however thelanguage is no longer written in the script The script was officially referred to as dvigarta akṣara or ldquoDogra
2
Proposal to encode the Dogra script in Unicode Anshuman Pandey
lettersrdquo where lsquoDvigartarsquo is presented as the Sanskritic root of the name lsquoDograrsquo Another indigenous namefor the script is name dogra akkhar or ldquoNew Dogra lettersrdquo
34 Representative glyphs
The representative glyphs for Dogra characters are based primarily upon forms used in New Dogra Charac-ters from Old Dogra have been used when necessary sourced primarily from figures 1 and 2 The font hasbeen designed by the proposal author
35 Vowel letters
There are 10 vowel letters
71680
71681
71682
71683
71684
71685
71686
71687
71688
71689
In Old Dogra the values of the vowel letters 71683 71682 and are variable As shown in the accompanyingsources the letter 71683 was used for both i and ī 71682 for u and ū and for u ū o and au In New Dograthe values of these three letters became fixed and new letters were introduced in order to provide distinctiverepresentations for all basic vowels It is apparent that the attempt to stabilize the vowel repertoire resultedin overlap and reassignment of several letters The Old Dogra 71683 (i ī) was retained solely for ī and 71682 (u ū)was reassigned for i A modified form of (u o) came to represent u while the form71685 was introduced forū which is certainly an adaptation of the letter ū from Devanagari or Sharada Distinct letters for71688 o and71689au were provided by placing the corresponding dependent vowel signs upon the modified u
Old New
71683 71682
mdash 71683
71682 71684
mdash 71685
71688
71689
The proposed vowels letters are based upon the repertoire of New Dogra This approach completely accom-modates vowels in the old and new forms of the script Although the names of New Dogra vowel letters do
3
Proposal to encode the Dogra script in Unicode Anshuman Pandey
not strictly correlate with values of Old Dogra vowels the proposed letters can be used for representing theold script The 71682 can be used for u 71683 can be used for i and 71684
can be used for o and au the height and curvature of the left stroke may be considered a stylisticattribute
36 Vowel Signs
There are 10 dependent vowel signs
71724
71725
71726
71727
71728
71729
71730
71731
71732
71733
The 71729 appears to have been introduced as part of New Dogra It is usedfor representing words of Sanskrit origin that contain the vocalic sound r Its shape is derived from either+11190 or +0943 There is
no corresponding independent vowel letter
There is an alphabetic aspect to the representation of consonant-vowel syllables in written Dogra The de-pendent vowel may be represented using the independent vowel letter as well as the vowel sign eg 7169071686may be used in place of 7169071730 for the syllable ke
There are differences in the shape of certain vowel letters between the old and new script The typical Takriforms of the vowel signs -u and -u which also resemble corresponding forms in Gurmukhi were replacedwith forms resembling Devanagari signs 71727 and 71728 respectively The typical Takri style of the vowel sign o was applied for writing 71733 au while a simplified of o in the form 71732 was used for o The duplication of o as for writing au was eliminated A comparison of vowel letters in the old and new scripts as they alignwith the proposed characters is as follows
Old New
71725
71726
71727
71728
71732
71733
4
Proposal to encode the Dogra script in Unicode Anshuman Pandey
37 Consonants
There are 34 consonant letters
71690
71691
71692
71693
71694
71695
71696
71697
71698
71699
71700
71701
71702
71703
71704
71705
71706
71707
71708
71709
71710
71711
71712
71713
71714
71715
71716
71717
71718
71719
71720
71721
71722
71723
The 71720 was introduced into New Dogra for representing the retroflex sibilant ṣa whichoccurs in words of Sanskrit origin It resembles 70064 +111B0
There are glyphic variants of several consonant letters
Regular Variant
71692
71693
71696
71698
71706
5
Proposal to encode the Dogra script in Unicode Anshuman Pandey
71708
71716
There is evidence of usage of different forms of the same letter in the available sources For instance both71696 and are both used for
38 Anusvara
The 71734 is used for marking nasalization
39 Visarga
The 71735 is used for indicating post-vocalic aspiration in words of Sanskrit origin It isshown below in the word 717097171471735 namaḥ ldquohomage tordquo
310 Nukta
The 71737 is used for representing sounds that are not native to Dogri and related languages
311 Virama
The 71736 has two functions similar to the corresponding character of Devanagari It isused as a halanta for marking the absence of the inherent vowel of a consonant letter It is also a controlcharacter that is used for producing conjuncts
312 Consonant Conjuncts
Consonant clusters are rendered in three ways 1) with visible halanta beneath the full form of each bareconsonant 2) with a half-form of the initial consonant followed by the full form of the following letter and3) as ligatures There are no formal conventions for the written expression of conjuncts The 71736
is used for the encoded representation of conjuncts
3121 Types of conjuncts
Some representations of Dogra conjuncts as found in the available sources are illustrated below
bull Visible halanta In both Old and New Dogra the most common method of representing consonantclusters is by placing a halanta beneath each bare consonant
6
Proposal to encode the Dogra script in Unicode Anshuman Pandey
bull Half-forms Another method is to use half-forms of letters if the graphical structure provides such anopportunity
The above shows the conjunct 71716 śra in the word 7171671726 śrī (a Sanskrit honorific) The half form isproduced by dropping the right vertical bar of 71719 The conjunct 71716 is used consistently in NewDogra for representing the cluster śra the form 717197173671716 with visible halanta beneath a full form of 71719does not occur in the available sources
bull Ligatures There are two types of ligature In the first the shapes of individual letters are visible Inthe below the boxed text contains the conjunct sṭa from the word717347172471710 sṭāṃp a transcription of theEnglish ldquostamprdquo
The second type is an atomic (akhaṇḍa) ligature whose form may be considered a distinctive letterSuch a ligature occurs in the excerpt below
The boxed text above contains the atomic ligature kṣa which occurs in the word7168071716 akṣaroṃ ldquoinlettersrdquo (oblique form of7168071716 akṣara ldquoletterrdquo) The conjunct kṣa is used consistently in New Dografor representing the cluster ksa Its shape may be influenced by Devanangari kṣa
bull Forms of Special conjunct forms of 71716 do not generally occur eg repha or vattu Cluster-initialis represented as a full form with halanta as in rṇa in the word saṃpūrṇa below
and non-initial is treated similarly as in pra in the word prabhu below
7
Proposal to encode the Dogra script in Unicode Anshuman Pandey
However there is one instance in the available sources in which cluster-initial is represented asa repha The occurrence is curious because the word caturbhuja ldquosquarerdquo in which repha occursappears three times in proximity but is spelled differently in all three instances
In the boxed text at the top the in caturbhuja is written with halanta 7169571705717277171671736717137172771697 caturbhuja In thetext at bottom left occurs with its inherent vowel intact without halanta 71695717057172771716717137172771697 caturabhuja Atbottom right ra is represented as repha 716957170571727717137172771697 caturbhuja
The usage of repha in Dogra is to be considered anomalous There is no present need to support thisform of in encoded representation
3122 Encoded representation of conjuncts
The Devanagari model for the representation of conjuncts is appropriate for Dogra In encoded text con-juncts are produced by placing the control character 71736 after each non-initial consonantin a cluster ie lt(C 71736 ) Cgt
Representations of conjuncts rendered as atomic ligatures are
kṣa lt71690 71736 71720 gt
sṭa lt71721 71736 71700 gt
Examples of conjuncts rendered using half-forms are
śra 71716 lt71719 71736 71716 gt
Conjuncts displayed using visible halanta are
pra 717107173671716 lt71710 71736 71716 gt
rṇa 717167173671704 lt71716 71736 71704 gt
8
Proposal to encode the Dogra script in Unicode Anshuman Pandey
3123 Modifying conjunct formation
As is the case in Devanagari and other Indic scripts a ligature may be broken using the generic controlcharacter +200C -
kṣa lt71690 71736 71720 gt
kṣa 716907173671720 lt71690 71736 - 71720 gt
The rendering of a conjunct as a ligature or with visible is a matter of font design
313 Punctuation
Abbreviation sign The 71738 is used for indicating truncation of text In the excerptbelow the highlighted text717177172671738 7171871738 717137172471738 (lī va bhā 23) is is an abbreviation for līla vatī bhāga 23
Danda-s The daṇḍā and double daṇḍā are commonly used in Dogra documents for marking the ends ofsentences and paragraphs The below excerpt shows usage of the daṇḍā
and the following shows use of the double daṇḍā
These daṇḍā-s are not proposed for encoding as separate characters in the lsquoDograrsquo block They resemble +0964 and +0965 and may be represented using
these characters The Devanagari daṇḍā-s are to be specified as extensions for Dogra
Other signs Other marks of punctuation such as dashes are also used
9
Proposal to encode the Dogra script in Unicode Anshuman Pandey
A hyphen is used in some cases for indicating breaks in a word at end of line (see section 316)
314 Digits
Digits are commonly used in Dogra documents (see figure 11) The style of digits varies across written andprinted sources Some digits in Old Dogra sources resemble those of Takri (see figure 4) while those innew Dogra sources are similar to Devanagar The excerpt below shows digits 1ndash9 and the number 10 asrepresented in the Līlavatī written in New Dogra
The majority of the digits shown here resemble Devanagari forms however the exception is lsquofourrsquo Theform of Dogra lsquofourrsquo does not resemble 71364 +116C4 which is nearly identical to theDevanagari digit four Information about its origins are not known at this time but it may be related to 70100+111D4 The forms of lsquoeightrsquo and lsquoninersquo are well-known variant forms of the regular
Devanagari digits
Additional research into Dogra digits is required before they may be considered for encoding The availablesources suggest that there may have been a cultural understanding that Dogra possessed unique digits Forinstance on the bank notes shown in figure 20 the digits written in Devanagari are rather similar but arenonetheless displayed twice in order to align the display of the currency value with each script used for theinscriptions
At present script-specific digits are not proposed for inclusion in the Dogra repertoire Devanagari digitsshould be used normatively and should be extended for usage with Dogra If future research yields distinctivedigits for Dogra there is sufficient space in the block to acccommodate their encoding
315 Number forms
Fraction signs and currency marks are commonly used in Dogra sources (see figure 12) These may berepresented using the fraction and currency signs already encoded in the lsquoCommon Indic Number Formsrsquoblock (U+A830) These characters are specified as extensions for Dogra The lsquoCommon Indic NumberFormsrsquo are described briefly in The Unicode Standard (chapter 223 lsquoNumeralsrsquo pp 770ndash771) and in moredetail in the original encoding proposal (L207-354)
316 Linebreaking
Linebreaking occurs after an orthographic syllable Breaks are generally not indicated in informal sourcesbut are marked in literary materials using hyphens
10
Proposal to encode the Dogra script in Unicode Anshuman Pandey
317 Collation
The primary collating order for Dogra is as follows
71680 lt 71681 lt 71682 lt 71683 lt 71684 lt 71685 lt 71686 lt 71687 lt 71688 lt 71689 lt
71690 lt 71691 lt 71692 lt 71693 lt 71694 lt 71695 lt 71696 lt 71697 lt
71698 lt 71699 lt 71700 lt 71701 lt 71702 lt 71703 lt 71704 lt 71705 lt
71706 lt 71707 lt 71708 lt 71709 lt 71710 lt 71711 lt 71712 lt 71713 lt
71714 lt 71715 lt 71716 lt 71717 lt 71718 lt 71719 lt 71720 lt 71721 lt 71722 lt
71723 lt 71724 lt 71725 lt 71726 lt
71727 lt 71728 lt 71729 lt 71730 lt
71731 lt 71732 lt 71733
The following characters have secondary weights 71734 71735 71736 71737
4 Character data
41 Character properties
The properties for Dogra in the Unicode Character Database format are
11800DOGRA LETTER ALo0LN11801DOGRA LETTER AALo0LN11802DOGRA LETTER ILo0LN11804DOGRA LETTER ULo0LN11806DOGRA LETTER ELo0LN11807DOGRA LETTER AILo0LN11808DOGRA LETTER OLo0LN1180ADOGRA LETTER KALo0LN1180BDOGRA LETTER KHALo0LN1180CDOGRA LETTER GALo0LN1180DDOGRA LETTER GHALo0LN1180EDOGRA LETTER NGALo0LN1180FDOGRA LETTER CALo0LN11810DOGRA LETTER CHALo0LN11811DOGRA LETTER JALo0LN11812DOGRA LETTER JHALo0LN11813DOGRA LETTER NYALo0LN
11
Proposal to encode the Dogra script in Unicode Anshuman Pandey
11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN
42 Linebreaking
Linebreaking properties given in the data format of LineBreaktxt
118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN
43 Syllabic categories
Syllabic categories given in the format of IndicSyllabicCategorytxt
Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA
12
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA
Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA
Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA
Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU
Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU
Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA
44 Positional categories
Positional data for characters in the format of IndicPositionalCategorytxt
Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA
Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I
Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU
Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA
45 Script extensions
The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt
0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA
0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE
A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS
A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK
13
Proposal to encode the Dogra script in Unicode Anshuman Pandey
46 lsquoConfusablersquo characters
Some Dogra characters that resemble characters encoded in other script blocks are
DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN
5 References
Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press
Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India
Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press
Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf
mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf
mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf
Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club
von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml
14
Proposal to encode the Dogra script in Unicode Anshuman Pandey
6 Acknowledgments
I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal
This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities
15
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 1
1184FDogra11800
1180 1181 1182 1183 1184
a
71681
i
71683
u
71685
e
71687
o
71689
71690
71691
71692
71693
71694
71695
71696
71697
71698
71699
71700
71701
71702
71703
71704
71705
71706
71707
71708
71709
71710
71711
71712
71713
71714
71715
71716
71717
71718
71719
71720
71721
71722
71723
$71724
$ i
$71726
$ u
$ 71728
$ r
$ e
$ 71731
$ o
$ 71733
$ 71734
$71735
$ 71736
$ 71737
71738
11800
11801
11802
11803
11804
11805
11806
11807
11808
11809
1180A
1180B
1180C
1180D
1180E
1180F
11810
11811
11812
11813
11814
11815
11816
11817
11818
11819
1181A
1181B
1181C
1181D
1181E
1181F
11820
11821
11822
11823
11824
11825
11826
11827
11828
11829
1182A
1182B
1182C
1182D
1182E
1182F
11830
11831
11832
11833
11834
11835
11836
11837
11838
11839
1183A
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 2
1183ADogra11800
Punctuation1183A 71738 DOGRA ABBREVIATION SIGN
Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU
Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA
Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU
Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
18
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
19
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)
20
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)
21
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)
22
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)
23
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
the lsquoNew Dogra Scriptrsquo It was the official script of the State of Jammu and Kashmir during the reign ofMaharaja Ranbir Singh (r 1857ndash1885) who is the ruler referred to by Grierson The New Dogra script wasused for administrative purposes and upon currency judicial and non-judicial stamp papers postcards andpostage stamps It was also used for literary activities The first book published in the standard Dogra was atranslation of a Sanskrit mathematical treatise by Bhāskarācārya titled Līlavatī The work was commissionedby Ranbir Singh and printed in 1872 at Vidya Vilas Press the first printing press in Jammu
The old and standard forms of Dogra differ primarily in the shapes of letterforms and the representation ofconsonant clusters Specific differences are described throughout this proposal As mentioned by Griersonstandard Dogra was influenced by Devanagari and Gurmukhi The motivation behind such a move is notclear but it could be related to the idea of providing the Dogra script with an air of lsquoauthorityrsquo by giving it aform that resembled scripts which were already endowed with lsquoofficialrsquo and lsquoliteraryrsquo attributes The stan-dardization of Dogra in the 1860s established two distinctive forms of the script Throughout this documentthe common written form is called lsquoOld Dograrsquo and the standard form is referred to as lsquoNew Dograrsquo
The Dogra script is no longer used actively and the Dogri language is now generally written in DevanagariNonetheless the script continues to attract the interest of philatelists who collect ephemera from Jammu andKashmir Although there is very little scholarship on the script it was described briefly by the prominentSanskrit scholar Fritz Staal in his Stamps of Jammu amp Kashmir (New York The Collectors Club 1984)Information on Dogra is also curated on a philately website called ldquoCollecting Kashmirrdquo maintained byCarol von der Lin
3 Proposed Encoding
31 Approach
The justification for encoding Dogra as a separate script in Unicode is based upon the following criteria 1)graphical differences between Dogra Chamba and other forms of Takri 2) usage of the script in an officialcapacity 3) cultural and political recognition of it as a distinctive script 4) differentiation into lsquooldrsquo andstandardrsquo forms Of these the fourth point requires a means for handling the internal distinctions of thescript A separate block for Dogra would provide a means to maintain the scriptrsquos external identity fromother related scripts and to also maintain the identity of styles internally
32 Structure
The Dogra script is a Brahmi-based alphasyllabary that is written from left to right Independent and initialvowels are written using letters while dependent vowels are expressed using combining signs Consonantletters possess the inherent vowel a which is changed by the attachment of a vowel sign Consonant clustersare written in various ways such as with visible halanta as an atomic ligature and with half-forms rephais attested but rarely used There are no formal conventions regarding representation of conjuncts
33 Script name
The designation lsquoDograrsquo has been assigned to the script block in Unicode The script is referred to as bothlsquoDograrsquo and lsquoDogrirsquo in English secondary sources The name lsquoDograrsquo is preferred over lsquoDogrirsquo because itlinks the script with the Dogra dynasty of Jammu andKashmir under whose government it was established asthe official script Secondly lsquoDogrirsquo suggests a connection with the language of the same name however thelanguage is no longer written in the script The script was officially referred to as dvigarta akṣara or ldquoDogra
2
Proposal to encode the Dogra script in Unicode Anshuman Pandey
lettersrdquo where lsquoDvigartarsquo is presented as the Sanskritic root of the name lsquoDograrsquo Another indigenous namefor the script is name dogra akkhar or ldquoNew Dogra lettersrdquo
34 Representative glyphs
The representative glyphs for Dogra characters are based primarily upon forms used in New Dogra Charac-ters from Old Dogra have been used when necessary sourced primarily from figures 1 and 2 The font hasbeen designed by the proposal author
35 Vowel letters
There are 10 vowel letters
71680
71681
71682
71683
71684
71685
71686
71687
71688
71689
In Old Dogra the values of the vowel letters 71683 71682 and are variable As shown in the accompanyingsources the letter 71683 was used for both i and ī 71682 for u and ū and for u ū o and au In New Dograthe values of these three letters became fixed and new letters were introduced in order to provide distinctiverepresentations for all basic vowels It is apparent that the attempt to stabilize the vowel repertoire resultedin overlap and reassignment of several letters The Old Dogra 71683 (i ī) was retained solely for ī and 71682 (u ū)was reassigned for i A modified form of (u o) came to represent u while the form71685 was introduced forū which is certainly an adaptation of the letter ū from Devanagari or Sharada Distinct letters for71688 o and71689au were provided by placing the corresponding dependent vowel signs upon the modified u
Old New
71683 71682
mdash 71683
71682 71684
mdash 71685
71688
71689
The proposed vowels letters are based upon the repertoire of New Dogra This approach completely accom-modates vowels in the old and new forms of the script Although the names of New Dogra vowel letters do
3
Proposal to encode the Dogra script in Unicode Anshuman Pandey
not strictly correlate with values of Old Dogra vowels the proposed letters can be used for representing theold script The 71682 can be used for u 71683 can be used for i and 71684
can be used for o and au the height and curvature of the left stroke may be considered a stylisticattribute
36 Vowel Signs
There are 10 dependent vowel signs
71724
71725
71726
71727
71728
71729
71730
71731
71732
71733
The 71729 appears to have been introduced as part of New Dogra It is usedfor representing words of Sanskrit origin that contain the vocalic sound r Its shape is derived from either+11190 or +0943 There is
no corresponding independent vowel letter
There is an alphabetic aspect to the representation of consonant-vowel syllables in written Dogra The de-pendent vowel may be represented using the independent vowel letter as well as the vowel sign eg 7169071686may be used in place of 7169071730 for the syllable ke
There are differences in the shape of certain vowel letters between the old and new script The typical Takriforms of the vowel signs -u and -u which also resemble corresponding forms in Gurmukhi were replacedwith forms resembling Devanagari signs 71727 and 71728 respectively The typical Takri style of the vowel sign o was applied for writing 71733 au while a simplified of o in the form 71732 was used for o The duplication of o as for writing au was eliminated A comparison of vowel letters in the old and new scripts as they alignwith the proposed characters is as follows
Old New
71725
71726
71727
71728
71732
71733
4
Proposal to encode the Dogra script in Unicode Anshuman Pandey
37 Consonants
There are 34 consonant letters
71690
71691
71692
71693
71694
71695
71696
71697
71698
71699
71700
71701
71702
71703
71704
71705
71706
71707
71708
71709
71710
71711
71712
71713
71714
71715
71716
71717
71718
71719
71720
71721
71722
71723
The 71720 was introduced into New Dogra for representing the retroflex sibilant ṣa whichoccurs in words of Sanskrit origin It resembles 70064 +111B0
There are glyphic variants of several consonant letters
Regular Variant
71692
71693
71696
71698
71706
5
Proposal to encode the Dogra script in Unicode Anshuman Pandey
71708
71716
There is evidence of usage of different forms of the same letter in the available sources For instance both71696 and are both used for
38 Anusvara
The 71734 is used for marking nasalization
39 Visarga
The 71735 is used for indicating post-vocalic aspiration in words of Sanskrit origin It isshown below in the word 717097171471735 namaḥ ldquohomage tordquo
310 Nukta
The 71737 is used for representing sounds that are not native to Dogri and related languages
311 Virama
The 71736 has two functions similar to the corresponding character of Devanagari It isused as a halanta for marking the absence of the inherent vowel of a consonant letter It is also a controlcharacter that is used for producing conjuncts
312 Consonant Conjuncts
Consonant clusters are rendered in three ways 1) with visible halanta beneath the full form of each bareconsonant 2) with a half-form of the initial consonant followed by the full form of the following letter and3) as ligatures There are no formal conventions for the written expression of conjuncts The 71736
is used for the encoded representation of conjuncts
3121 Types of conjuncts
Some representations of Dogra conjuncts as found in the available sources are illustrated below
bull Visible halanta In both Old and New Dogra the most common method of representing consonantclusters is by placing a halanta beneath each bare consonant
6
Proposal to encode the Dogra script in Unicode Anshuman Pandey
bull Half-forms Another method is to use half-forms of letters if the graphical structure provides such anopportunity
The above shows the conjunct 71716 śra in the word 7171671726 śrī (a Sanskrit honorific) The half form isproduced by dropping the right vertical bar of 71719 The conjunct 71716 is used consistently in NewDogra for representing the cluster śra the form 717197173671716 with visible halanta beneath a full form of 71719does not occur in the available sources
bull Ligatures There are two types of ligature In the first the shapes of individual letters are visible Inthe below the boxed text contains the conjunct sṭa from the word717347172471710 sṭāṃp a transcription of theEnglish ldquostamprdquo
The second type is an atomic (akhaṇḍa) ligature whose form may be considered a distinctive letterSuch a ligature occurs in the excerpt below
The boxed text above contains the atomic ligature kṣa which occurs in the word7168071716 akṣaroṃ ldquoinlettersrdquo (oblique form of7168071716 akṣara ldquoletterrdquo) The conjunct kṣa is used consistently in New Dografor representing the cluster ksa Its shape may be influenced by Devanangari kṣa
bull Forms of Special conjunct forms of 71716 do not generally occur eg repha or vattu Cluster-initialis represented as a full form with halanta as in rṇa in the word saṃpūrṇa below
and non-initial is treated similarly as in pra in the word prabhu below
7
Proposal to encode the Dogra script in Unicode Anshuman Pandey
However there is one instance in the available sources in which cluster-initial is represented asa repha The occurrence is curious because the word caturbhuja ldquosquarerdquo in which repha occursappears three times in proximity but is spelled differently in all three instances
In the boxed text at the top the in caturbhuja is written with halanta 7169571705717277171671736717137172771697 caturbhuja In thetext at bottom left occurs with its inherent vowel intact without halanta 71695717057172771716717137172771697 caturabhuja Atbottom right ra is represented as repha 716957170571727717137172771697 caturbhuja
The usage of repha in Dogra is to be considered anomalous There is no present need to support thisform of in encoded representation
3122 Encoded representation of conjuncts
The Devanagari model for the representation of conjuncts is appropriate for Dogra In encoded text con-juncts are produced by placing the control character 71736 after each non-initial consonantin a cluster ie lt(C 71736 ) Cgt
Representations of conjuncts rendered as atomic ligatures are
kṣa lt71690 71736 71720 gt
sṭa lt71721 71736 71700 gt
Examples of conjuncts rendered using half-forms are
śra 71716 lt71719 71736 71716 gt
Conjuncts displayed using visible halanta are
pra 717107173671716 lt71710 71736 71716 gt
rṇa 717167173671704 lt71716 71736 71704 gt
8
Proposal to encode the Dogra script in Unicode Anshuman Pandey
3123 Modifying conjunct formation
As is the case in Devanagari and other Indic scripts a ligature may be broken using the generic controlcharacter +200C -
kṣa lt71690 71736 71720 gt
kṣa 716907173671720 lt71690 71736 - 71720 gt
The rendering of a conjunct as a ligature or with visible is a matter of font design
313 Punctuation
Abbreviation sign The 71738 is used for indicating truncation of text In the excerptbelow the highlighted text717177172671738 7171871738 717137172471738 (lī va bhā 23) is is an abbreviation for līla vatī bhāga 23
Danda-s The daṇḍā and double daṇḍā are commonly used in Dogra documents for marking the ends ofsentences and paragraphs The below excerpt shows usage of the daṇḍā
and the following shows use of the double daṇḍā
These daṇḍā-s are not proposed for encoding as separate characters in the lsquoDograrsquo block They resemble +0964 and +0965 and may be represented using
these characters The Devanagari daṇḍā-s are to be specified as extensions for Dogra
Other signs Other marks of punctuation such as dashes are also used
9
Proposal to encode the Dogra script in Unicode Anshuman Pandey
A hyphen is used in some cases for indicating breaks in a word at end of line (see section 316)
314 Digits
Digits are commonly used in Dogra documents (see figure 11) The style of digits varies across written andprinted sources Some digits in Old Dogra sources resemble those of Takri (see figure 4) while those innew Dogra sources are similar to Devanagar The excerpt below shows digits 1ndash9 and the number 10 asrepresented in the Līlavatī written in New Dogra
The majority of the digits shown here resemble Devanagari forms however the exception is lsquofourrsquo Theform of Dogra lsquofourrsquo does not resemble 71364 +116C4 which is nearly identical to theDevanagari digit four Information about its origins are not known at this time but it may be related to 70100+111D4 The forms of lsquoeightrsquo and lsquoninersquo are well-known variant forms of the regular
Devanagari digits
Additional research into Dogra digits is required before they may be considered for encoding The availablesources suggest that there may have been a cultural understanding that Dogra possessed unique digits Forinstance on the bank notes shown in figure 20 the digits written in Devanagari are rather similar but arenonetheless displayed twice in order to align the display of the currency value with each script used for theinscriptions
At present script-specific digits are not proposed for inclusion in the Dogra repertoire Devanagari digitsshould be used normatively and should be extended for usage with Dogra If future research yields distinctivedigits for Dogra there is sufficient space in the block to acccommodate their encoding
315 Number forms
Fraction signs and currency marks are commonly used in Dogra sources (see figure 12) These may berepresented using the fraction and currency signs already encoded in the lsquoCommon Indic Number Formsrsquoblock (U+A830) These characters are specified as extensions for Dogra The lsquoCommon Indic NumberFormsrsquo are described briefly in The Unicode Standard (chapter 223 lsquoNumeralsrsquo pp 770ndash771) and in moredetail in the original encoding proposal (L207-354)
316 Linebreaking
Linebreaking occurs after an orthographic syllable Breaks are generally not indicated in informal sourcesbut are marked in literary materials using hyphens
10
Proposal to encode the Dogra script in Unicode Anshuman Pandey
317 Collation
The primary collating order for Dogra is as follows
71680 lt 71681 lt 71682 lt 71683 lt 71684 lt 71685 lt 71686 lt 71687 lt 71688 lt 71689 lt
71690 lt 71691 lt 71692 lt 71693 lt 71694 lt 71695 lt 71696 lt 71697 lt
71698 lt 71699 lt 71700 lt 71701 lt 71702 lt 71703 lt 71704 lt 71705 lt
71706 lt 71707 lt 71708 lt 71709 lt 71710 lt 71711 lt 71712 lt 71713 lt
71714 lt 71715 lt 71716 lt 71717 lt 71718 lt 71719 lt 71720 lt 71721 lt 71722 lt
71723 lt 71724 lt 71725 lt 71726 lt
71727 lt 71728 lt 71729 lt 71730 lt
71731 lt 71732 lt 71733
The following characters have secondary weights 71734 71735 71736 71737
4 Character data
41 Character properties
The properties for Dogra in the Unicode Character Database format are
11800DOGRA LETTER ALo0LN11801DOGRA LETTER AALo0LN11802DOGRA LETTER ILo0LN11804DOGRA LETTER ULo0LN11806DOGRA LETTER ELo0LN11807DOGRA LETTER AILo0LN11808DOGRA LETTER OLo0LN1180ADOGRA LETTER KALo0LN1180BDOGRA LETTER KHALo0LN1180CDOGRA LETTER GALo0LN1180DDOGRA LETTER GHALo0LN1180EDOGRA LETTER NGALo0LN1180FDOGRA LETTER CALo0LN11810DOGRA LETTER CHALo0LN11811DOGRA LETTER JALo0LN11812DOGRA LETTER JHALo0LN11813DOGRA LETTER NYALo0LN
11
Proposal to encode the Dogra script in Unicode Anshuman Pandey
11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN
42 Linebreaking
Linebreaking properties given in the data format of LineBreaktxt
118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN
43 Syllabic categories
Syllabic categories given in the format of IndicSyllabicCategorytxt
Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA
12
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA
Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA
Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA
Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU
Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU
Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA
44 Positional categories
Positional data for characters in the format of IndicPositionalCategorytxt
Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA
Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I
Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU
Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA
45 Script extensions
The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt
0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA
0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE
A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS
A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK
13
Proposal to encode the Dogra script in Unicode Anshuman Pandey
46 lsquoConfusablersquo characters
Some Dogra characters that resemble characters encoded in other script blocks are
DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN
5 References
Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press
Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India
Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press
Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf
mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf
mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf
Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club
von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml
14
Proposal to encode the Dogra script in Unicode Anshuman Pandey
6 Acknowledgments
I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal
This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities
15
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 1
1184FDogra11800
1180 1181 1182 1183 1184
a
71681
i
71683
u
71685
e
71687
o
71689
71690
71691
71692
71693
71694
71695
71696
71697
71698
71699
71700
71701
71702
71703
71704
71705
71706
71707
71708
71709
71710
71711
71712
71713
71714
71715
71716
71717
71718
71719
71720
71721
71722
71723
$71724
$ i
$71726
$ u
$ 71728
$ r
$ e
$ 71731
$ o
$ 71733
$ 71734
$71735
$ 71736
$ 71737
71738
11800
11801
11802
11803
11804
11805
11806
11807
11808
11809
1180A
1180B
1180C
1180D
1180E
1180F
11810
11811
11812
11813
11814
11815
11816
11817
11818
11819
1181A
1181B
1181C
1181D
1181E
1181F
11820
11821
11822
11823
11824
11825
11826
11827
11828
11829
1182A
1182B
1182C
1182D
1182E
1182F
11830
11831
11832
11833
11834
11835
11836
11837
11838
11839
1183A
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 2
1183ADogra11800
Punctuation1183A 71738 DOGRA ABBREVIATION SIGN
Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU
Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA
Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU
Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
18
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
19
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)
20
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)
21
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)
22
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)
23
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
lettersrdquo where lsquoDvigartarsquo is presented as the Sanskritic root of the name lsquoDograrsquo Another indigenous namefor the script is name dogra akkhar or ldquoNew Dogra lettersrdquo
34 Representative glyphs
The representative glyphs for Dogra characters are based primarily upon forms used in New Dogra Charac-ters from Old Dogra have been used when necessary sourced primarily from figures 1 and 2 The font hasbeen designed by the proposal author
35 Vowel letters
There are 10 vowel letters
71680
71681
71682
71683
71684
71685
71686
71687
71688
71689
In Old Dogra the values of the vowel letters 71683 71682 and are variable As shown in the accompanyingsources the letter 71683 was used for both i and ī 71682 for u and ū and for u ū o and au In New Dograthe values of these three letters became fixed and new letters were introduced in order to provide distinctiverepresentations for all basic vowels It is apparent that the attempt to stabilize the vowel repertoire resultedin overlap and reassignment of several letters The Old Dogra 71683 (i ī) was retained solely for ī and 71682 (u ū)was reassigned for i A modified form of (u o) came to represent u while the form71685 was introduced forū which is certainly an adaptation of the letter ū from Devanagari or Sharada Distinct letters for71688 o and71689au were provided by placing the corresponding dependent vowel signs upon the modified u
Old New
71683 71682
mdash 71683
71682 71684
mdash 71685
71688
71689
The proposed vowels letters are based upon the repertoire of New Dogra This approach completely accom-modates vowels in the old and new forms of the script Although the names of New Dogra vowel letters do
3
Proposal to encode the Dogra script in Unicode Anshuman Pandey
not strictly correlate with values of Old Dogra vowels the proposed letters can be used for representing theold script The 71682 can be used for u 71683 can be used for i and 71684
can be used for o and au the height and curvature of the left stroke may be considered a stylisticattribute
36 Vowel Signs
There are 10 dependent vowel signs
71724
71725
71726
71727
71728
71729
71730
71731
71732
71733
The 71729 appears to have been introduced as part of New Dogra It is usedfor representing words of Sanskrit origin that contain the vocalic sound r Its shape is derived from either+11190 or +0943 There is
no corresponding independent vowel letter
There is an alphabetic aspect to the representation of consonant-vowel syllables in written Dogra The de-pendent vowel may be represented using the independent vowel letter as well as the vowel sign eg 7169071686may be used in place of 7169071730 for the syllable ke
There are differences in the shape of certain vowel letters between the old and new script The typical Takriforms of the vowel signs -u and -u which also resemble corresponding forms in Gurmukhi were replacedwith forms resembling Devanagari signs 71727 and 71728 respectively The typical Takri style of the vowel sign o was applied for writing 71733 au while a simplified of o in the form 71732 was used for o The duplication of o as for writing au was eliminated A comparison of vowel letters in the old and new scripts as they alignwith the proposed characters is as follows
Old New
71725
71726
71727
71728
71732
71733
4
Proposal to encode the Dogra script in Unicode Anshuman Pandey
37 Consonants
There are 34 consonant letters
71690
71691
71692
71693
71694
71695
71696
71697
71698
71699
71700
71701
71702
71703
71704
71705
71706
71707
71708
71709
71710
71711
71712
71713
71714
71715
71716
71717
71718
71719
71720
71721
71722
71723
The 71720 was introduced into New Dogra for representing the retroflex sibilant ṣa whichoccurs in words of Sanskrit origin It resembles 70064 +111B0
There are glyphic variants of several consonant letters
Regular Variant
71692
71693
71696
71698
71706
5
Proposal to encode the Dogra script in Unicode Anshuman Pandey
71708
71716
There is evidence of usage of different forms of the same letter in the available sources For instance both71696 and are both used for
38 Anusvara
The 71734 is used for marking nasalization
39 Visarga
The 71735 is used for indicating post-vocalic aspiration in words of Sanskrit origin It isshown below in the word 717097171471735 namaḥ ldquohomage tordquo
310 Nukta
The 71737 is used for representing sounds that are not native to Dogri and related languages
311 Virama
The 71736 has two functions similar to the corresponding character of Devanagari It isused as a halanta for marking the absence of the inherent vowel of a consonant letter It is also a controlcharacter that is used for producing conjuncts
312 Consonant Conjuncts
Consonant clusters are rendered in three ways 1) with visible halanta beneath the full form of each bareconsonant 2) with a half-form of the initial consonant followed by the full form of the following letter and3) as ligatures There are no formal conventions for the written expression of conjuncts The 71736
is used for the encoded representation of conjuncts
3121 Types of conjuncts
Some representations of Dogra conjuncts as found in the available sources are illustrated below
bull Visible halanta In both Old and New Dogra the most common method of representing consonantclusters is by placing a halanta beneath each bare consonant
6
Proposal to encode the Dogra script in Unicode Anshuman Pandey
bull Half-forms Another method is to use half-forms of letters if the graphical structure provides such anopportunity
The above shows the conjunct 71716 śra in the word 7171671726 śrī (a Sanskrit honorific) The half form isproduced by dropping the right vertical bar of 71719 The conjunct 71716 is used consistently in NewDogra for representing the cluster śra the form 717197173671716 with visible halanta beneath a full form of 71719does not occur in the available sources
bull Ligatures There are two types of ligature In the first the shapes of individual letters are visible Inthe below the boxed text contains the conjunct sṭa from the word717347172471710 sṭāṃp a transcription of theEnglish ldquostamprdquo
The second type is an atomic (akhaṇḍa) ligature whose form may be considered a distinctive letterSuch a ligature occurs in the excerpt below
The boxed text above contains the atomic ligature kṣa which occurs in the word7168071716 akṣaroṃ ldquoinlettersrdquo (oblique form of7168071716 akṣara ldquoletterrdquo) The conjunct kṣa is used consistently in New Dografor representing the cluster ksa Its shape may be influenced by Devanangari kṣa
bull Forms of Special conjunct forms of 71716 do not generally occur eg repha or vattu Cluster-initialis represented as a full form with halanta as in rṇa in the word saṃpūrṇa below
and non-initial is treated similarly as in pra in the word prabhu below
7
Proposal to encode the Dogra script in Unicode Anshuman Pandey
However there is one instance in the available sources in which cluster-initial is represented asa repha The occurrence is curious because the word caturbhuja ldquosquarerdquo in which repha occursappears three times in proximity but is spelled differently in all three instances
In the boxed text at the top the in caturbhuja is written with halanta 7169571705717277171671736717137172771697 caturbhuja In thetext at bottom left occurs with its inherent vowel intact without halanta 71695717057172771716717137172771697 caturabhuja Atbottom right ra is represented as repha 716957170571727717137172771697 caturbhuja
The usage of repha in Dogra is to be considered anomalous There is no present need to support thisform of in encoded representation
3122 Encoded representation of conjuncts
The Devanagari model for the representation of conjuncts is appropriate for Dogra In encoded text con-juncts are produced by placing the control character 71736 after each non-initial consonantin a cluster ie lt(C 71736 ) Cgt
Representations of conjuncts rendered as atomic ligatures are
kṣa lt71690 71736 71720 gt
sṭa lt71721 71736 71700 gt
Examples of conjuncts rendered using half-forms are
śra 71716 lt71719 71736 71716 gt
Conjuncts displayed using visible halanta are
pra 717107173671716 lt71710 71736 71716 gt
rṇa 717167173671704 lt71716 71736 71704 gt
8
Proposal to encode the Dogra script in Unicode Anshuman Pandey
3123 Modifying conjunct formation
As is the case in Devanagari and other Indic scripts a ligature may be broken using the generic controlcharacter +200C -
kṣa lt71690 71736 71720 gt
kṣa 716907173671720 lt71690 71736 - 71720 gt
The rendering of a conjunct as a ligature or with visible is a matter of font design
313 Punctuation
Abbreviation sign The 71738 is used for indicating truncation of text In the excerptbelow the highlighted text717177172671738 7171871738 717137172471738 (lī va bhā 23) is is an abbreviation for līla vatī bhāga 23
Danda-s The daṇḍā and double daṇḍā are commonly used in Dogra documents for marking the ends ofsentences and paragraphs The below excerpt shows usage of the daṇḍā
and the following shows use of the double daṇḍā
These daṇḍā-s are not proposed for encoding as separate characters in the lsquoDograrsquo block They resemble +0964 and +0965 and may be represented using
these characters The Devanagari daṇḍā-s are to be specified as extensions for Dogra
Other signs Other marks of punctuation such as dashes are also used
9
Proposal to encode the Dogra script in Unicode Anshuman Pandey
A hyphen is used in some cases for indicating breaks in a word at end of line (see section 316)
314 Digits
Digits are commonly used in Dogra documents (see figure 11) The style of digits varies across written andprinted sources Some digits in Old Dogra sources resemble those of Takri (see figure 4) while those innew Dogra sources are similar to Devanagar The excerpt below shows digits 1ndash9 and the number 10 asrepresented in the Līlavatī written in New Dogra
The majority of the digits shown here resemble Devanagari forms however the exception is lsquofourrsquo Theform of Dogra lsquofourrsquo does not resemble 71364 +116C4 which is nearly identical to theDevanagari digit four Information about its origins are not known at this time but it may be related to 70100+111D4 The forms of lsquoeightrsquo and lsquoninersquo are well-known variant forms of the regular
Devanagari digits
Additional research into Dogra digits is required before they may be considered for encoding The availablesources suggest that there may have been a cultural understanding that Dogra possessed unique digits Forinstance on the bank notes shown in figure 20 the digits written in Devanagari are rather similar but arenonetheless displayed twice in order to align the display of the currency value with each script used for theinscriptions
At present script-specific digits are not proposed for inclusion in the Dogra repertoire Devanagari digitsshould be used normatively and should be extended for usage with Dogra If future research yields distinctivedigits for Dogra there is sufficient space in the block to acccommodate their encoding
315 Number forms
Fraction signs and currency marks are commonly used in Dogra sources (see figure 12) These may berepresented using the fraction and currency signs already encoded in the lsquoCommon Indic Number Formsrsquoblock (U+A830) These characters are specified as extensions for Dogra The lsquoCommon Indic NumberFormsrsquo are described briefly in The Unicode Standard (chapter 223 lsquoNumeralsrsquo pp 770ndash771) and in moredetail in the original encoding proposal (L207-354)
316 Linebreaking
Linebreaking occurs after an orthographic syllable Breaks are generally not indicated in informal sourcesbut are marked in literary materials using hyphens
10
Proposal to encode the Dogra script in Unicode Anshuman Pandey
317 Collation
The primary collating order for Dogra is as follows
71680 lt 71681 lt 71682 lt 71683 lt 71684 lt 71685 lt 71686 lt 71687 lt 71688 lt 71689 lt
71690 lt 71691 lt 71692 lt 71693 lt 71694 lt 71695 lt 71696 lt 71697 lt
71698 lt 71699 lt 71700 lt 71701 lt 71702 lt 71703 lt 71704 lt 71705 lt
71706 lt 71707 lt 71708 lt 71709 lt 71710 lt 71711 lt 71712 lt 71713 lt
71714 lt 71715 lt 71716 lt 71717 lt 71718 lt 71719 lt 71720 lt 71721 lt 71722 lt
71723 lt 71724 lt 71725 lt 71726 lt
71727 lt 71728 lt 71729 lt 71730 lt
71731 lt 71732 lt 71733
The following characters have secondary weights 71734 71735 71736 71737
4 Character data
41 Character properties
The properties for Dogra in the Unicode Character Database format are
11800DOGRA LETTER ALo0LN11801DOGRA LETTER AALo0LN11802DOGRA LETTER ILo0LN11804DOGRA LETTER ULo0LN11806DOGRA LETTER ELo0LN11807DOGRA LETTER AILo0LN11808DOGRA LETTER OLo0LN1180ADOGRA LETTER KALo0LN1180BDOGRA LETTER KHALo0LN1180CDOGRA LETTER GALo0LN1180DDOGRA LETTER GHALo0LN1180EDOGRA LETTER NGALo0LN1180FDOGRA LETTER CALo0LN11810DOGRA LETTER CHALo0LN11811DOGRA LETTER JALo0LN11812DOGRA LETTER JHALo0LN11813DOGRA LETTER NYALo0LN
11
Proposal to encode the Dogra script in Unicode Anshuman Pandey
11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN
42 Linebreaking
Linebreaking properties given in the data format of LineBreaktxt
118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN
43 Syllabic categories
Syllabic categories given in the format of IndicSyllabicCategorytxt
Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA
12
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA
Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA
Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA
Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU
Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU
Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA
44 Positional categories
Positional data for characters in the format of IndicPositionalCategorytxt
Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA
Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I
Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU
Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA
45 Script extensions
The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt
0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA
0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE
A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS
A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK
13
Proposal to encode the Dogra script in Unicode Anshuman Pandey
46 lsquoConfusablersquo characters
Some Dogra characters that resemble characters encoded in other script blocks are
DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN
5 References
Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press
Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India
Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press
Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf
mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf
mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf
Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club
von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml
14
Proposal to encode the Dogra script in Unicode Anshuman Pandey
6 Acknowledgments
I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal
This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities
15
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 1
1184FDogra11800
1180 1181 1182 1183 1184
a
71681
i
71683
u
71685
e
71687
o
71689
71690
71691
71692
71693
71694
71695
71696
71697
71698
71699
71700
71701
71702
71703
71704
71705
71706
71707
71708
71709
71710
71711
71712
71713
71714
71715
71716
71717
71718
71719
71720
71721
71722
71723
$71724
$ i
$71726
$ u
$ 71728
$ r
$ e
$ 71731
$ o
$ 71733
$ 71734
$71735
$ 71736
$ 71737
71738
11800
11801
11802
11803
11804
11805
11806
11807
11808
11809
1180A
1180B
1180C
1180D
1180E
1180F
11810
11811
11812
11813
11814
11815
11816
11817
11818
11819
1181A
1181B
1181C
1181D
1181E
1181F
11820
11821
11822
11823
11824
11825
11826
11827
11828
11829
1182A
1182B
1182C
1182D
1182E
1182F
11830
11831
11832
11833
11834
11835
11836
11837
11838
11839
1183A
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 2
1183ADogra11800
Punctuation1183A 71738 DOGRA ABBREVIATION SIGN
Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU
Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA
Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU
Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
18
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
19
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)
20
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)
21
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)
22
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)
23
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
not strictly correlate with values of Old Dogra vowels the proposed letters can be used for representing theold script The 71682 can be used for u 71683 can be used for i and 71684
can be used for o and au the height and curvature of the left stroke may be considered a stylisticattribute
36 Vowel Signs
There are 10 dependent vowel signs
71724
71725
71726
71727
71728
71729
71730
71731
71732
71733
The 71729 appears to have been introduced as part of New Dogra It is usedfor representing words of Sanskrit origin that contain the vocalic sound r Its shape is derived from either+11190 or +0943 There is
no corresponding independent vowel letter
There is an alphabetic aspect to the representation of consonant-vowel syllables in written Dogra The de-pendent vowel may be represented using the independent vowel letter as well as the vowel sign eg 7169071686may be used in place of 7169071730 for the syllable ke
There are differences in the shape of certain vowel letters between the old and new script The typical Takriforms of the vowel signs -u and -u which also resemble corresponding forms in Gurmukhi were replacedwith forms resembling Devanagari signs 71727 and 71728 respectively The typical Takri style of the vowel sign o was applied for writing 71733 au while a simplified of o in the form 71732 was used for o The duplication of o as for writing au was eliminated A comparison of vowel letters in the old and new scripts as they alignwith the proposed characters is as follows
Old New
71725
71726
71727
71728
71732
71733
4
Proposal to encode the Dogra script in Unicode Anshuman Pandey
37 Consonants
There are 34 consonant letters
71690
71691
71692
71693
71694
71695
71696
71697
71698
71699
71700
71701
71702
71703
71704
71705
71706
71707
71708
71709
71710
71711
71712
71713
71714
71715
71716
71717
71718
71719
71720
71721
71722
71723
The 71720 was introduced into New Dogra for representing the retroflex sibilant ṣa whichoccurs in words of Sanskrit origin It resembles 70064 +111B0
There are glyphic variants of several consonant letters
Regular Variant
71692
71693
71696
71698
71706
5
Proposal to encode the Dogra script in Unicode Anshuman Pandey
71708
71716
There is evidence of usage of different forms of the same letter in the available sources For instance both71696 and are both used for
38 Anusvara
The 71734 is used for marking nasalization
39 Visarga
The 71735 is used for indicating post-vocalic aspiration in words of Sanskrit origin It isshown below in the word 717097171471735 namaḥ ldquohomage tordquo
310 Nukta
The 71737 is used for representing sounds that are not native to Dogri and related languages
311 Virama
The 71736 has two functions similar to the corresponding character of Devanagari It isused as a halanta for marking the absence of the inherent vowel of a consonant letter It is also a controlcharacter that is used for producing conjuncts
312 Consonant Conjuncts
Consonant clusters are rendered in three ways 1) with visible halanta beneath the full form of each bareconsonant 2) with a half-form of the initial consonant followed by the full form of the following letter and3) as ligatures There are no formal conventions for the written expression of conjuncts The 71736
is used for the encoded representation of conjuncts
3121 Types of conjuncts
Some representations of Dogra conjuncts as found in the available sources are illustrated below
bull Visible halanta In both Old and New Dogra the most common method of representing consonantclusters is by placing a halanta beneath each bare consonant
6
Proposal to encode the Dogra script in Unicode Anshuman Pandey
bull Half-forms Another method is to use half-forms of letters if the graphical structure provides such anopportunity
The above shows the conjunct 71716 śra in the word 7171671726 śrī (a Sanskrit honorific) The half form isproduced by dropping the right vertical bar of 71719 The conjunct 71716 is used consistently in NewDogra for representing the cluster śra the form 717197173671716 with visible halanta beneath a full form of 71719does not occur in the available sources
bull Ligatures There are two types of ligature In the first the shapes of individual letters are visible Inthe below the boxed text contains the conjunct sṭa from the word717347172471710 sṭāṃp a transcription of theEnglish ldquostamprdquo
The second type is an atomic (akhaṇḍa) ligature whose form may be considered a distinctive letterSuch a ligature occurs in the excerpt below
The boxed text above contains the atomic ligature kṣa which occurs in the word7168071716 akṣaroṃ ldquoinlettersrdquo (oblique form of7168071716 akṣara ldquoletterrdquo) The conjunct kṣa is used consistently in New Dografor representing the cluster ksa Its shape may be influenced by Devanangari kṣa
bull Forms of Special conjunct forms of 71716 do not generally occur eg repha or vattu Cluster-initialis represented as a full form with halanta as in rṇa in the word saṃpūrṇa below
and non-initial is treated similarly as in pra in the word prabhu below
7
Proposal to encode the Dogra script in Unicode Anshuman Pandey
However there is one instance in the available sources in which cluster-initial is represented asa repha The occurrence is curious because the word caturbhuja ldquosquarerdquo in which repha occursappears three times in proximity but is spelled differently in all three instances
In the boxed text at the top the in caturbhuja is written with halanta 7169571705717277171671736717137172771697 caturbhuja In thetext at bottom left occurs with its inherent vowel intact without halanta 71695717057172771716717137172771697 caturabhuja Atbottom right ra is represented as repha 716957170571727717137172771697 caturbhuja
The usage of repha in Dogra is to be considered anomalous There is no present need to support thisform of in encoded representation
3122 Encoded representation of conjuncts
The Devanagari model for the representation of conjuncts is appropriate for Dogra In encoded text con-juncts are produced by placing the control character 71736 after each non-initial consonantin a cluster ie lt(C 71736 ) Cgt
Representations of conjuncts rendered as atomic ligatures are
kṣa lt71690 71736 71720 gt
sṭa lt71721 71736 71700 gt
Examples of conjuncts rendered using half-forms are
śra 71716 lt71719 71736 71716 gt
Conjuncts displayed using visible halanta are
pra 717107173671716 lt71710 71736 71716 gt
rṇa 717167173671704 lt71716 71736 71704 gt
8
Proposal to encode the Dogra script in Unicode Anshuman Pandey
3123 Modifying conjunct formation
As is the case in Devanagari and other Indic scripts a ligature may be broken using the generic controlcharacter +200C -
kṣa lt71690 71736 71720 gt
kṣa 716907173671720 lt71690 71736 - 71720 gt
The rendering of a conjunct as a ligature or with visible is a matter of font design
313 Punctuation
Abbreviation sign The 71738 is used for indicating truncation of text In the excerptbelow the highlighted text717177172671738 7171871738 717137172471738 (lī va bhā 23) is is an abbreviation for līla vatī bhāga 23
Danda-s The daṇḍā and double daṇḍā are commonly used in Dogra documents for marking the ends ofsentences and paragraphs The below excerpt shows usage of the daṇḍā
and the following shows use of the double daṇḍā
These daṇḍā-s are not proposed for encoding as separate characters in the lsquoDograrsquo block They resemble +0964 and +0965 and may be represented using
these characters The Devanagari daṇḍā-s are to be specified as extensions for Dogra
Other signs Other marks of punctuation such as dashes are also used
9
Proposal to encode the Dogra script in Unicode Anshuman Pandey
A hyphen is used in some cases for indicating breaks in a word at end of line (see section 316)
314 Digits
Digits are commonly used in Dogra documents (see figure 11) The style of digits varies across written andprinted sources Some digits in Old Dogra sources resemble those of Takri (see figure 4) while those innew Dogra sources are similar to Devanagar The excerpt below shows digits 1ndash9 and the number 10 asrepresented in the Līlavatī written in New Dogra
The majority of the digits shown here resemble Devanagari forms however the exception is lsquofourrsquo Theform of Dogra lsquofourrsquo does not resemble 71364 +116C4 which is nearly identical to theDevanagari digit four Information about its origins are not known at this time but it may be related to 70100+111D4 The forms of lsquoeightrsquo and lsquoninersquo are well-known variant forms of the regular
Devanagari digits
Additional research into Dogra digits is required before they may be considered for encoding The availablesources suggest that there may have been a cultural understanding that Dogra possessed unique digits Forinstance on the bank notes shown in figure 20 the digits written in Devanagari are rather similar but arenonetheless displayed twice in order to align the display of the currency value with each script used for theinscriptions
At present script-specific digits are not proposed for inclusion in the Dogra repertoire Devanagari digitsshould be used normatively and should be extended for usage with Dogra If future research yields distinctivedigits for Dogra there is sufficient space in the block to acccommodate their encoding
315 Number forms
Fraction signs and currency marks are commonly used in Dogra sources (see figure 12) These may berepresented using the fraction and currency signs already encoded in the lsquoCommon Indic Number Formsrsquoblock (U+A830) These characters are specified as extensions for Dogra The lsquoCommon Indic NumberFormsrsquo are described briefly in The Unicode Standard (chapter 223 lsquoNumeralsrsquo pp 770ndash771) and in moredetail in the original encoding proposal (L207-354)
316 Linebreaking
Linebreaking occurs after an orthographic syllable Breaks are generally not indicated in informal sourcesbut are marked in literary materials using hyphens
10
Proposal to encode the Dogra script in Unicode Anshuman Pandey
317 Collation
The primary collating order for Dogra is as follows
71680 lt 71681 lt 71682 lt 71683 lt 71684 lt 71685 lt 71686 lt 71687 lt 71688 lt 71689 lt
71690 lt 71691 lt 71692 lt 71693 lt 71694 lt 71695 lt 71696 lt 71697 lt
71698 lt 71699 lt 71700 lt 71701 lt 71702 lt 71703 lt 71704 lt 71705 lt
71706 lt 71707 lt 71708 lt 71709 lt 71710 lt 71711 lt 71712 lt 71713 lt
71714 lt 71715 lt 71716 lt 71717 lt 71718 lt 71719 lt 71720 lt 71721 lt 71722 lt
71723 lt 71724 lt 71725 lt 71726 lt
71727 lt 71728 lt 71729 lt 71730 lt
71731 lt 71732 lt 71733
The following characters have secondary weights 71734 71735 71736 71737
4 Character data
41 Character properties
The properties for Dogra in the Unicode Character Database format are
11800DOGRA LETTER ALo0LN11801DOGRA LETTER AALo0LN11802DOGRA LETTER ILo0LN11804DOGRA LETTER ULo0LN11806DOGRA LETTER ELo0LN11807DOGRA LETTER AILo0LN11808DOGRA LETTER OLo0LN1180ADOGRA LETTER KALo0LN1180BDOGRA LETTER KHALo0LN1180CDOGRA LETTER GALo0LN1180DDOGRA LETTER GHALo0LN1180EDOGRA LETTER NGALo0LN1180FDOGRA LETTER CALo0LN11810DOGRA LETTER CHALo0LN11811DOGRA LETTER JALo0LN11812DOGRA LETTER JHALo0LN11813DOGRA LETTER NYALo0LN
11
Proposal to encode the Dogra script in Unicode Anshuman Pandey
11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN
42 Linebreaking
Linebreaking properties given in the data format of LineBreaktxt
118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN
43 Syllabic categories
Syllabic categories given in the format of IndicSyllabicCategorytxt
Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA
12
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA
Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA
Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA
Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU
Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU
Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA
44 Positional categories
Positional data for characters in the format of IndicPositionalCategorytxt
Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA
Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I
Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU
Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA
45 Script extensions
The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt
0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA
0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE
A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS
A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK
13
Proposal to encode the Dogra script in Unicode Anshuman Pandey
46 lsquoConfusablersquo characters
Some Dogra characters that resemble characters encoded in other script blocks are
DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN
5 References
Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press
Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India
Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press
Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf
mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf
mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf
Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club
von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml
14
Proposal to encode the Dogra script in Unicode Anshuman Pandey
6 Acknowledgments
I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal
This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities
15
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 1
1184FDogra11800
1180 1181 1182 1183 1184
a
71681
i
71683
u
71685
e
71687
o
71689
71690
71691
71692
71693
71694
71695
71696
71697
71698
71699
71700
71701
71702
71703
71704
71705
71706
71707
71708
71709
71710
71711
71712
71713
71714
71715
71716
71717
71718
71719
71720
71721
71722
71723
$71724
$ i
$71726
$ u
$ 71728
$ r
$ e
$ 71731
$ o
$ 71733
$ 71734
$71735
$ 71736
$ 71737
71738
11800
11801
11802
11803
11804
11805
11806
11807
11808
11809
1180A
1180B
1180C
1180D
1180E
1180F
11810
11811
11812
11813
11814
11815
11816
11817
11818
11819
1181A
1181B
1181C
1181D
1181E
1181F
11820
11821
11822
11823
11824
11825
11826
11827
11828
11829
1182A
1182B
1182C
1182D
1182E
1182F
11830
11831
11832
11833
11834
11835
11836
11837
11838
11839
1183A
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 2
1183ADogra11800
Punctuation1183A 71738 DOGRA ABBREVIATION SIGN
Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU
Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA
Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU
Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
18
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
19
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)
20
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)
21
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)
22
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)
23
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
37 Consonants
There are 34 consonant letters
71690
71691
71692
71693
71694
71695
71696
71697
71698
71699
71700
71701
71702
71703
71704
71705
71706
71707
71708
71709
71710
71711
71712
71713
71714
71715
71716
71717
71718
71719
71720
71721
71722
71723
The 71720 was introduced into New Dogra for representing the retroflex sibilant ṣa whichoccurs in words of Sanskrit origin It resembles 70064 +111B0
There are glyphic variants of several consonant letters
Regular Variant
71692
71693
71696
71698
71706
5
Proposal to encode the Dogra script in Unicode Anshuman Pandey
71708
71716
There is evidence of usage of different forms of the same letter in the available sources For instance both71696 and are both used for
38 Anusvara
The 71734 is used for marking nasalization
39 Visarga
The 71735 is used for indicating post-vocalic aspiration in words of Sanskrit origin It isshown below in the word 717097171471735 namaḥ ldquohomage tordquo
310 Nukta
The 71737 is used for representing sounds that are not native to Dogri and related languages
311 Virama
The 71736 has two functions similar to the corresponding character of Devanagari It isused as a halanta for marking the absence of the inherent vowel of a consonant letter It is also a controlcharacter that is used for producing conjuncts
312 Consonant Conjuncts
Consonant clusters are rendered in three ways 1) with visible halanta beneath the full form of each bareconsonant 2) with a half-form of the initial consonant followed by the full form of the following letter and3) as ligatures There are no formal conventions for the written expression of conjuncts The 71736
is used for the encoded representation of conjuncts
3121 Types of conjuncts
Some representations of Dogra conjuncts as found in the available sources are illustrated below
bull Visible halanta In both Old and New Dogra the most common method of representing consonantclusters is by placing a halanta beneath each bare consonant
6
Proposal to encode the Dogra script in Unicode Anshuman Pandey
bull Half-forms Another method is to use half-forms of letters if the graphical structure provides such anopportunity
The above shows the conjunct 71716 śra in the word 7171671726 śrī (a Sanskrit honorific) The half form isproduced by dropping the right vertical bar of 71719 The conjunct 71716 is used consistently in NewDogra for representing the cluster śra the form 717197173671716 with visible halanta beneath a full form of 71719does not occur in the available sources
bull Ligatures There are two types of ligature In the first the shapes of individual letters are visible Inthe below the boxed text contains the conjunct sṭa from the word717347172471710 sṭāṃp a transcription of theEnglish ldquostamprdquo
The second type is an atomic (akhaṇḍa) ligature whose form may be considered a distinctive letterSuch a ligature occurs in the excerpt below
The boxed text above contains the atomic ligature kṣa which occurs in the word7168071716 akṣaroṃ ldquoinlettersrdquo (oblique form of7168071716 akṣara ldquoletterrdquo) The conjunct kṣa is used consistently in New Dografor representing the cluster ksa Its shape may be influenced by Devanangari kṣa
bull Forms of Special conjunct forms of 71716 do not generally occur eg repha or vattu Cluster-initialis represented as a full form with halanta as in rṇa in the word saṃpūrṇa below
and non-initial is treated similarly as in pra in the word prabhu below
7
Proposal to encode the Dogra script in Unicode Anshuman Pandey
However there is one instance in the available sources in which cluster-initial is represented asa repha The occurrence is curious because the word caturbhuja ldquosquarerdquo in which repha occursappears three times in proximity but is spelled differently in all three instances
In the boxed text at the top the in caturbhuja is written with halanta 7169571705717277171671736717137172771697 caturbhuja In thetext at bottom left occurs with its inherent vowel intact without halanta 71695717057172771716717137172771697 caturabhuja Atbottom right ra is represented as repha 716957170571727717137172771697 caturbhuja
The usage of repha in Dogra is to be considered anomalous There is no present need to support thisform of in encoded representation
3122 Encoded representation of conjuncts
The Devanagari model for the representation of conjuncts is appropriate for Dogra In encoded text con-juncts are produced by placing the control character 71736 after each non-initial consonantin a cluster ie lt(C 71736 ) Cgt
Representations of conjuncts rendered as atomic ligatures are
kṣa lt71690 71736 71720 gt
sṭa lt71721 71736 71700 gt
Examples of conjuncts rendered using half-forms are
śra 71716 lt71719 71736 71716 gt
Conjuncts displayed using visible halanta are
pra 717107173671716 lt71710 71736 71716 gt
rṇa 717167173671704 lt71716 71736 71704 gt
8
Proposal to encode the Dogra script in Unicode Anshuman Pandey
3123 Modifying conjunct formation
As is the case in Devanagari and other Indic scripts a ligature may be broken using the generic controlcharacter +200C -
kṣa lt71690 71736 71720 gt
kṣa 716907173671720 lt71690 71736 - 71720 gt
The rendering of a conjunct as a ligature or with visible is a matter of font design
313 Punctuation
Abbreviation sign The 71738 is used for indicating truncation of text In the excerptbelow the highlighted text717177172671738 7171871738 717137172471738 (lī va bhā 23) is is an abbreviation for līla vatī bhāga 23
Danda-s The daṇḍā and double daṇḍā are commonly used in Dogra documents for marking the ends ofsentences and paragraphs The below excerpt shows usage of the daṇḍā
and the following shows use of the double daṇḍā
These daṇḍā-s are not proposed for encoding as separate characters in the lsquoDograrsquo block They resemble +0964 and +0965 and may be represented using
these characters The Devanagari daṇḍā-s are to be specified as extensions for Dogra
Other signs Other marks of punctuation such as dashes are also used
9
Proposal to encode the Dogra script in Unicode Anshuman Pandey
A hyphen is used in some cases for indicating breaks in a word at end of line (see section 316)
314 Digits
Digits are commonly used in Dogra documents (see figure 11) The style of digits varies across written andprinted sources Some digits in Old Dogra sources resemble those of Takri (see figure 4) while those innew Dogra sources are similar to Devanagar The excerpt below shows digits 1ndash9 and the number 10 asrepresented in the Līlavatī written in New Dogra
The majority of the digits shown here resemble Devanagari forms however the exception is lsquofourrsquo Theform of Dogra lsquofourrsquo does not resemble 71364 +116C4 which is nearly identical to theDevanagari digit four Information about its origins are not known at this time but it may be related to 70100+111D4 The forms of lsquoeightrsquo and lsquoninersquo are well-known variant forms of the regular
Devanagari digits
Additional research into Dogra digits is required before they may be considered for encoding The availablesources suggest that there may have been a cultural understanding that Dogra possessed unique digits Forinstance on the bank notes shown in figure 20 the digits written in Devanagari are rather similar but arenonetheless displayed twice in order to align the display of the currency value with each script used for theinscriptions
At present script-specific digits are not proposed for inclusion in the Dogra repertoire Devanagari digitsshould be used normatively and should be extended for usage with Dogra If future research yields distinctivedigits for Dogra there is sufficient space in the block to acccommodate their encoding
315 Number forms
Fraction signs and currency marks are commonly used in Dogra sources (see figure 12) These may berepresented using the fraction and currency signs already encoded in the lsquoCommon Indic Number Formsrsquoblock (U+A830) These characters are specified as extensions for Dogra The lsquoCommon Indic NumberFormsrsquo are described briefly in The Unicode Standard (chapter 223 lsquoNumeralsrsquo pp 770ndash771) and in moredetail in the original encoding proposal (L207-354)
316 Linebreaking
Linebreaking occurs after an orthographic syllable Breaks are generally not indicated in informal sourcesbut are marked in literary materials using hyphens
10
Proposal to encode the Dogra script in Unicode Anshuman Pandey
317 Collation
The primary collating order for Dogra is as follows
71680 lt 71681 lt 71682 lt 71683 lt 71684 lt 71685 lt 71686 lt 71687 lt 71688 lt 71689 lt
71690 lt 71691 lt 71692 lt 71693 lt 71694 lt 71695 lt 71696 lt 71697 lt
71698 lt 71699 lt 71700 lt 71701 lt 71702 lt 71703 lt 71704 lt 71705 lt
71706 lt 71707 lt 71708 lt 71709 lt 71710 lt 71711 lt 71712 lt 71713 lt
71714 lt 71715 lt 71716 lt 71717 lt 71718 lt 71719 lt 71720 lt 71721 lt 71722 lt
71723 lt 71724 lt 71725 lt 71726 lt
71727 lt 71728 lt 71729 lt 71730 lt
71731 lt 71732 lt 71733
The following characters have secondary weights 71734 71735 71736 71737
4 Character data
41 Character properties
The properties for Dogra in the Unicode Character Database format are
11800DOGRA LETTER ALo0LN11801DOGRA LETTER AALo0LN11802DOGRA LETTER ILo0LN11804DOGRA LETTER ULo0LN11806DOGRA LETTER ELo0LN11807DOGRA LETTER AILo0LN11808DOGRA LETTER OLo0LN1180ADOGRA LETTER KALo0LN1180BDOGRA LETTER KHALo0LN1180CDOGRA LETTER GALo0LN1180DDOGRA LETTER GHALo0LN1180EDOGRA LETTER NGALo0LN1180FDOGRA LETTER CALo0LN11810DOGRA LETTER CHALo0LN11811DOGRA LETTER JALo0LN11812DOGRA LETTER JHALo0LN11813DOGRA LETTER NYALo0LN
11
Proposal to encode the Dogra script in Unicode Anshuman Pandey
11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN
42 Linebreaking
Linebreaking properties given in the data format of LineBreaktxt
118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN
43 Syllabic categories
Syllabic categories given in the format of IndicSyllabicCategorytxt
Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA
12
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA
Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA
Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA
Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU
Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU
Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA
44 Positional categories
Positional data for characters in the format of IndicPositionalCategorytxt
Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA
Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I
Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU
Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA
45 Script extensions
The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt
0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA
0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE
A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS
A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK
13
Proposal to encode the Dogra script in Unicode Anshuman Pandey
46 lsquoConfusablersquo characters
Some Dogra characters that resemble characters encoded in other script blocks are
DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN
5 References
Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press
Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India
Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press
Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf
mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf
mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf
Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club
von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml
14
Proposal to encode the Dogra script in Unicode Anshuman Pandey
6 Acknowledgments
I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal
This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities
15
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 1
1184FDogra11800
1180 1181 1182 1183 1184
a
71681
i
71683
u
71685
e
71687
o
71689
71690
71691
71692
71693
71694
71695
71696
71697
71698
71699
71700
71701
71702
71703
71704
71705
71706
71707
71708
71709
71710
71711
71712
71713
71714
71715
71716
71717
71718
71719
71720
71721
71722
71723
$71724
$ i
$71726
$ u
$ 71728
$ r
$ e
$ 71731
$ o
$ 71733
$ 71734
$71735
$ 71736
$ 71737
71738
11800
11801
11802
11803
11804
11805
11806
11807
11808
11809
1180A
1180B
1180C
1180D
1180E
1180F
11810
11811
11812
11813
11814
11815
11816
11817
11818
11819
1181A
1181B
1181C
1181D
1181E
1181F
11820
11821
11822
11823
11824
11825
11826
11827
11828
11829
1182A
1182B
1182C
1182D
1182E
1182F
11830
11831
11832
11833
11834
11835
11836
11837
11838
11839
1183A
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 2
1183ADogra11800
Punctuation1183A 71738 DOGRA ABBREVIATION SIGN
Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU
Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA
Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU
Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
18
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
19
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)
20
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)
21
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)
22
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)
23
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
71708
71716
There is evidence of usage of different forms of the same letter in the available sources For instance both71696 and are both used for
38 Anusvara
The 71734 is used for marking nasalization
39 Visarga
The 71735 is used for indicating post-vocalic aspiration in words of Sanskrit origin It isshown below in the word 717097171471735 namaḥ ldquohomage tordquo
310 Nukta
The 71737 is used for representing sounds that are not native to Dogri and related languages
311 Virama
The 71736 has two functions similar to the corresponding character of Devanagari It isused as a halanta for marking the absence of the inherent vowel of a consonant letter It is also a controlcharacter that is used for producing conjuncts
312 Consonant Conjuncts
Consonant clusters are rendered in three ways 1) with visible halanta beneath the full form of each bareconsonant 2) with a half-form of the initial consonant followed by the full form of the following letter and3) as ligatures There are no formal conventions for the written expression of conjuncts The 71736
is used for the encoded representation of conjuncts
3121 Types of conjuncts
Some representations of Dogra conjuncts as found in the available sources are illustrated below
bull Visible halanta In both Old and New Dogra the most common method of representing consonantclusters is by placing a halanta beneath each bare consonant
6
Proposal to encode the Dogra script in Unicode Anshuman Pandey
bull Half-forms Another method is to use half-forms of letters if the graphical structure provides such anopportunity
The above shows the conjunct 71716 śra in the word 7171671726 śrī (a Sanskrit honorific) The half form isproduced by dropping the right vertical bar of 71719 The conjunct 71716 is used consistently in NewDogra for representing the cluster śra the form 717197173671716 with visible halanta beneath a full form of 71719does not occur in the available sources
bull Ligatures There are two types of ligature In the first the shapes of individual letters are visible Inthe below the boxed text contains the conjunct sṭa from the word717347172471710 sṭāṃp a transcription of theEnglish ldquostamprdquo
The second type is an atomic (akhaṇḍa) ligature whose form may be considered a distinctive letterSuch a ligature occurs in the excerpt below
The boxed text above contains the atomic ligature kṣa which occurs in the word7168071716 akṣaroṃ ldquoinlettersrdquo (oblique form of7168071716 akṣara ldquoletterrdquo) The conjunct kṣa is used consistently in New Dografor representing the cluster ksa Its shape may be influenced by Devanangari kṣa
bull Forms of Special conjunct forms of 71716 do not generally occur eg repha or vattu Cluster-initialis represented as a full form with halanta as in rṇa in the word saṃpūrṇa below
and non-initial is treated similarly as in pra in the word prabhu below
7
Proposal to encode the Dogra script in Unicode Anshuman Pandey
However there is one instance in the available sources in which cluster-initial is represented asa repha The occurrence is curious because the word caturbhuja ldquosquarerdquo in which repha occursappears three times in proximity but is spelled differently in all three instances
In the boxed text at the top the in caturbhuja is written with halanta 7169571705717277171671736717137172771697 caturbhuja In thetext at bottom left occurs with its inherent vowel intact without halanta 71695717057172771716717137172771697 caturabhuja Atbottom right ra is represented as repha 716957170571727717137172771697 caturbhuja
The usage of repha in Dogra is to be considered anomalous There is no present need to support thisform of in encoded representation
3122 Encoded representation of conjuncts
The Devanagari model for the representation of conjuncts is appropriate for Dogra In encoded text con-juncts are produced by placing the control character 71736 after each non-initial consonantin a cluster ie lt(C 71736 ) Cgt
Representations of conjuncts rendered as atomic ligatures are
kṣa lt71690 71736 71720 gt
sṭa lt71721 71736 71700 gt
Examples of conjuncts rendered using half-forms are
śra 71716 lt71719 71736 71716 gt
Conjuncts displayed using visible halanta are
pra 717107173671716 lt71710 71736 71716 gt
rṇa 717167173671704 lt71716 71736 71704 gt
8
Proposal to encode the Dogra script in Unicode Anshuman Pandey
3123 Modifying conjunct formation
As is the case in Devanagari and other Indic scripts a ligature may be broken using the generic controlcharacter +200C -
kṣa lt71690 71736 71720 gt
kṣa 716907173671720 lt71690 71736 - 71720 gt
The rendering of a conjunct as a ligature or with visible is a matter of font design
313 Punctuation
Abbreviation sign The 71738 is used for indicating truncation of text In the excerptbelow the highlighted text717177172671738 7171871738 717137172471738 (lī va bhā 23) is is an abbreviation for līla vatī bhāga 23
Danda-s The daṇḍā and double daṇḍā are commonly used in Dogra documents for marking the ends ofsentences and paragraphs The below excerpt shows usage of the daṇḍā
and the following shows use of the double daṇḍā
These daṇḍā-s are not proposed for encoding as separate characters in the lsquoDograrsquo block They resemble +0964 and +0965 and may be represented using
these characters The Devanagari daṇḍā-s are to be specified as extensions for Dogra
Other signs Other marks of punctuation such as dashes are also used
9
Proposal to encode the Dogra script in Unicode Anshuman Pandey
A hyphen is used in some cases for indicating breaks in a word at end of line (see section 316)
314 Digits
Digits are commonly used in Dogra documents (see figure 11) The style of digits varies across written andprinted sources Some digits in Old Dogra sources resemble those of Takri (see figure 4) while those innew Dogra sources are similar to Devanagar The excerpt below shows digits 1ndash9 and the number 10 asrepresented in the Līlavatī written in New Dogra
The majority of the digits shown here resemble Devanagari forms however the exception is lsquofourrsquo Theform of Dogra lsquofourrsquo does not resemble 71364 +116C4 which is nearly identical to theDevanagari digit four Information about its origins are not known at this time but it may be related to 70100+111D4 The forms of lsquoeightrsquo and lsquoninersquo are well-known variant forms of the regular
Devanagari digits
Additional research into Dogra digits is required before they may be considered for encoding The availablesources suggest that there may have been a cultural understanding that Dogra possessed unique digits Forinstance on the bank notes shown in figure 20 the digits written in Devanagari are rather similar but arenonetheless displayed twice in order to align the display of the currency value with each script used for theinscriptions
At present script-specific digits are not proposed for inclusion in the Dogra repertoire Devanagari digitsshould be used normatively and should be extended for usage with Dogra If future research yields distinctivedigits for Dogra there is sufficient space in the block to acccommodate their encoding
315 Number forms
Fraction signs and currency marks are commonly used in Dogra sources (see figure 12) These may berepresented using the fraction and currency signs already encoded in the lsquoCommon Indic Number Formsrsquoblock (U+A830) These characters are specified as extensions for Dogra The lsquoCommon Indic NumberFormsrsquo are described briefly in The Unicode Standard (chapter 223 lsquoNumeralsrsquo pp 770ndash771) and in moredetail in the original encoding proposal (L207-354)
316 Linebreaking
Linebreaking occurs after an orthographic syllable Breaks are generally not indicated in informal sourcesbut are marked in literary materials using hyphens
10
Proposal to encode the Dogra script in Unicode Anshuman Pandey
317 Collation
The primary collating order for Dogra is as follows
71680 lt 71681 lt 71682 lt 71683 lt 71684 lt 71685 lt 71686 lt 71687 lt 71688 lt 71689 lt
71690 lt 71691 lt 71692 lt 71693 lt 71694 lt 71695 lt 71696 lt 71697 lt
71698 lt 71699 lt 71700 lt 71701 lt 71702 lt 71703 lt 71704 lt 71705 lt
71706 lt 71707 lt 71708 lt 71709 lt 71710 lt 71711 lt 71712 lt 71713 lt
71714 lt 71715 lt 71716 lt 71717 lt 71718 lt 71719 lt 71720 lt 71721 lt 71722 lt
71723 lt 71724 lt 71725 lt 71726 lt
71727 lt 71728 lt 71729 lt 71730 lt
71731 lt 71732 lt 71733
The following characters have secondary weights 71734 71735 71736 71737
4 Character data
41 Character properties
The properties for Dogra in the Unicode Character Database format are
11800DOGRA LETTER ALo0LN11801DOGRA LETTER AALo0LN11802DOGRA LETTER ILo0LN11804DOGRA LETTER ULo0LN11806DOGRA LETTER ELo0LN11807DOGRA LETTER AILo0LN11808DOGRA LETTER OLo0LN1180ADOGRA LETTER KALo0LN1180BDOGRA LETTER KHALo0LN1180CDOGRA LETTER GALo0LN1180DDOGRA LETTER GHALo0LN1180EDOGRA LETTER NGALo0LN1180FDOGRA LETTER CALo0LN11810DOGRA LETTER CHALo0LN11811DOGRA LETTER JALo0LN11812DOGRA LETTER JHALo0LN11813DOGRA LETTER NYALo0LN
11
Proposal to encode the Dogra script in Unicode Anshuman Pandey
11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN
42 Linebreaking
Linebreaking properties given in the data format of LineBreaktxt
118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN
43 Syllabic categories
Syllabic categories given in the format of IndicSyllabicCategorytxt
Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA
12
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA
Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA
Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA
Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU
Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU
Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA
44 Positional categories
Positional data for characters in the format of IndicPositionalCategorytxt
Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA
Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I
Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU
Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA
45 Script extensions
The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt
0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA
0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE
A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS
A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK
13
Proposal to encode the Dogra script in Unicode Anshuman Pandey
46 lsquoConfusablersquo characters
Some Dogra characters that resemble characters encoded in other script blocks are
DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN
5 References
Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press
Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India
Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press
Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf
mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf
mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf
Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club
von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml
14
Proposal to encode the Dogra script in Unicode Anshuman Pandey
6 Acknowledgments
I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal
This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities
15
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 1
1184FDogra11800
1180 1181 1182 1183 1184
a
71681
i
71683
u
71685
e
71687
o
71689
71690
71691
71692
71693
71694
71695
71696
71697
71698
71699
71700
71701
71702
71703
71704
71705
71706
71707
71708
71709
71710
71711
71712
71713
71714
71715
71716
71717
71718
71719
71720
71721
71722
71723
$71724
$ i
$71726
$ u
$ 71728
$ r
$ e
$ 71731
$ o
$ 71733
$ 71734
$71735
$ 71736
$ 71737
71738
11800
11801
11802
11803
11804
11805
11806
11807
11808
11809
1180A
1180B
1180C
1180D
1180E
1180F
11810
11811
11812
11813
11814
11815
11816
11817
11818
11819
1181A
1181B
1181C
1181D
1181E
1181F
11820
11821
11822
11823
11824
11825
11826
11827
11828
11829
1182A
1182B
1182C
1182D
1182E
1182F
11830
11831
11832
11833
11834
11835
11836
11837
11838
11839
1183A
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 2
1183ADogra11800
Punctuation1183A 71738 DOGRA ABBREVIATION SIGN
Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU
Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA
Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU
Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
18
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
19
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)
20
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)
21
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)
22
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)
23
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
bull Half-forms Another method is to use half-forms of letters if the graphical structure provides such anopportunity
The above shows the conjunct 71716 śra in the word 7171671726 śrī (a Sanskrit honorific) The half form isproduced by dropping the right vertical bar of 71719 The conjunct 71716 is used consistently in NewDogra for representing the cluster śra the form 717197173671716 with visible halanta beneath a full form of 71719does not occur in the available sources
bull Ligatures There are two types of ligature In the first the shapes of individual letters are visible Inthe below the boxed text contains the conjunct sṭa from the word717347172471710 sṭāṃp a transcription of theEnglish ldquostamprdquo
The second type is an atomic (akhaṇḍa) ligature whose form may be considered a distinctive letterSuch a ligature occurs in the excerpt below
The boxed text above contains the atomic ligature kṣa which occurs in the word7168071716 akṣaroṃ ldquoinlettersrdquo (oblique form of7168071716 akṣara ldquoletterrdquo) The conjunct kṣa is used consistently in New Dografor representing the cluster ksa Its shape may be influenced by Devanangari kṣa
bull Forms of Special conjunct forms of 71716 do not generally occur eg repha or vattu Cluster-initialis represented as a full form with halanta as in rṇa in the word saṃpūrṇa below
and non-initial is treated similarly as in pra in the word prabhu below
7
Proposal to encode the Dogra script in Unicode Anshuman Pandey
However there is one instance in the available sources in which cluster-initial is represented asa repha The occurrence is curious because the word caturbhuja ldquosquarerdquo in which repha occursappears three times in proximity but is spelled differently in all three instances
In the boxed text at the top the in caturbhuja is written with halanta 7169571705717277171671736717137172771697 caturbhuja In thetext at bottom left occurs with its inherent vowel intact without halanta 71695717057172771716717137172771697 caturabhuja Atbottom right ra is represented as repha 716957170571727717137172771697 caturbhuja
The usage of repha in Dogra is to be considered anomalous There is no present need to support thisform of in encoded representation
3122 Encoded representation of conjuncts
The Devanagari model for the representation of conjuncts is appropriate for Dogra In encoded text con-juncts are produced by placing the control character 71736 after each non-initial consonantin a cluster ie lt(C 71736 ) Cgt
Representations of conjuncts rendered as atomic ligatures are
kṣa lt71690 71736 71720 gt
sṭa lt71721 71736 71700 gt
Examples of conjuncts rendered using half-forms are
śra 71716 lt71719 71736 71716 gt
Conjuncts displayed using visible halanta are
pra 717107173671716 lt71710 71736 71716 gt
rṇa 717167173671704 lt71716 71736 71704 gt
8
Proposal to encode the Dogra script in Unicode Anshuman Pandey
3123 Modifying conjunct formation
As is the case in Devanagari and other Indic scripts a ligature may be broken using the generic controlcharacter +200C -
kṣa lt71690 71736 71720 gt
kṣa 716907173671720 lt71690 71736 - 71720 gt
The rendering of a conjunct as a ligature or with visible is a matter of font design
313 Punctuation
Abbreviation sign The 71738 is used for indicating truncation of text In the excerptbelow the highlighted text717177172671738 7171871738 717137172471738 (lī va bhā 23) is is an abbreviation for līla vatī bhāga 23
Danda-s The daṇḍā and double daṇḍā are commonly used in Dogra documents for marking the ends ofsentences and paragraphs The below excerpt shows usage of the daṇḍā
and the following shows use of the double daṇḍā
These daṇḍā-s are not proposed for encoding as separate characters in the lsquoDograrsquo block They resemble +0964 and +0965 and may be represented using
these characters The Devanagari daṇḍā-s are to be specified as extensions for Dogra
Other signs Other marks of punctuation such as dashes are also used
9
Proposal to encode the Dogra script in Unicode Anshuman Pandey
A hyphen is used in some cases for indicating breaks in a word at end of line (see section 316)
314 Digits
Digits are commonly used in Dogra documents (see figure 11) The style of digits varies across written andprinted sources Some digits in Old Dogra sources resemble those of Takri (see figure 4) while those innew Dogra sources are similar to Devanagar The excerpt below shows digits 1ndash9 and the number 10 asrepresented in the Līlavatī written in New Dogra
The majority of the digits shown here resemble Devanagari forms however the exception is lsquofourrsquo Theform of Dogra lsquofourrsquo does not resemble 71364 +116C4 which is nearly identical to theDevanagari digit four Information about its origins are not known at this time but it may be related to 70100+111D4 The forms of lsquoeightrsquo and lsquoninersquo are well-known variant forms of the regular
Devanagari digits
Additional research into Dogra digits is required before they may be considered for encoding The availablesources suggest that there may have been a cultural understanding that Dogra possessed unique digits Forinstance on the bank notes shown in figure 20 the digits written in Devanagari are rather similar but arenonetheless displayed twice in order to align the display of the currency value with each script used for theinscriptions
At present script-specific digits are not proposed for inclusion in the Dogra repertoire Devanagari digitsshould be used normatively and should be extended for usage with Dogra If future research yields distinctivedigits for Dogra there is sufficient space in the block to acccommodate their encoding
315 Number forms
Fraction signs and currency marks are commonly used in Dogra sources (see figure 12) These may berepresented using the fraction and currency signs already encoded in the lsquoCommon Indic Number Formsrsquoblock (U+A830) These characters are specified as extensions for Dogra The lsquoCommon Indic NumberFormsrsquo are described briefly in The Unicode Standard (chapter 223 lsquoNumeralsrsquo pp 770ndash771) and in moredetail in the original encoding proposal (L207-354)
316 Linebreaking
Linebreaking occurs after an orthographic syllable Breaks are generally not indicated in informal sourcesbut are marked in literary materials using hyphens
10
Proposal to encode the Dogra script in Unicode Anshuman Pandey
317 Collation
The primary collating order for Dogra is as follows
71680 lt 71681 lt 71682 lt 71683 lt 71684 lt 71685 lt 71686 lt 71687 lt 71688 lt 71689 lt
71690 lt 71691 lt 71692 lt 71693 lt 71694 lt 71695 lt 71696 lt 71697 lt
71698 lt 71699 lt 71700 lt 71701 lt 71702 lt 71703 lt 71704 lt 71705 lt
71706 lt 71707 lt 71708 lt 71709 lt 71710 lt 71711 lt 71712 lt 71713 lt
71714 lt 71715 lt 71716 lt 71717 lt 71718 lt 71719 lt 71720 lt 71721 lt 71722 lt
71723 lt 71724 lt 71725 lt 71726 lt
71727 lt 71728 lt 71729 lt 71730 lt
71731 lt 71732 lt 71733
The following characters have secondary weights 71734 71735 71736 71737
4 Character data
41 Character properties
The properties for Dogra in the Unicode Character Database format are
11800DOGRA LETTER ALo0LN11801DOGRA LETTER AALo0LN11802DOGRA LETTER ILo0LN11804DOGRA LETTER ULo0LN11806DOGRA LETTER ELo0LN11807DOGRA LETTER AILo0LN11808DOGRA LETTER OLo0LN1180ADOGRA LETTER KALo0LN1180BDOGRA LETTER KHALo0LN1180CDOGRA LETTER GALo0LN1180DDOGRA LETTER GHALo0LN1180EDOGRA LETTER NGALo0LN1180FDOGRA LETTER CALo0LN11810DOGRA LETTER CHALo0LN11811DOGRA LETTER JALo0LN11812DOGRA LETTER JHALo0LN11813DOGRA LETTER NYALo0LN
11
Proposal to encode the Dogra script in Unicode Anshuman Pandey
11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN
42 Linebreaking
Linebreaking properties given in the data format of LineBreaktxt
118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN
43 Syllabic categories
Syllabic categories given in the format of IndicSyllabicCategorytxt
Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA
12
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA
Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA
Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA
Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU
Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU
Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA
44 Positional categories
Positional data for characters in the format of IndicPositionalCategorytxt
Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA
Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I
Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU
Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA
45 Script extensions
The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt
0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA
0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE
A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS
A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK
13
Proposal to encode the Dogra script in Unicode Anshuman Pandey
46 lsquoConfusablersquo characters
Some Dogra characters that resemble characters encoded in other script blocks are
DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN
5 References
Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press
Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India
Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press
Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf
mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf
mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf
Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club
von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml
14
Proposal to encode the Dogra script in Unicode Anshuman Pandey
6 Acknowledgments
I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal
This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities
15
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 1
1184FDogra11800
1180 1181 1182 1183 1184
a
71681
i
71683
u
71685
e
71687
o
71689
71690
71691
71692
71693
71694
71695
71696
71697
71698
71699
71700
71701
71702
71703
71704
71705
71706
71707
71708
71709
71710
71711
71712
71713
71714
71715
71716
71717
71718
71719
71720
71721
71722
71723
$71724
$ i
$71726
$ u
$ 71728
$ r
$ e
$ 71731
$ o
$ 71733
$ 71734
$71735
$ 71736
$ 71737
71738
11800
11801
11802
11803
11804
11805
11806
11807
11808
11809
1180A
1180B
1180C
1180D
1180E
1180F
11810
11811
11812
11813
11814
11815
11816
11817
11818
11819
1181A
1181B
1181C
1181D
1181E
1181F
11820
11821
11822
11823
11824
11825
11826
11827
11828
11829
1182A
1182B
1182C
1182D
1182E
1182F
11830
11831
11832
11833
11834
11835
11836
11837
11838
11839
1183A
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 2
1183ADogra11800
Punctuation1183A 71738 DOGRA ABBREVIATION SIGN
Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU
Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA
Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU
Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
18
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
19
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)
20
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)
21
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)
22
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)
23
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
However there is one instance in the available sources in which cluster-initial is represented asa repha The occurrence is curious because the word caturbhuja ldquosquarerdquo in which repha occursappears three times in proximity but is spelled differently in all three instances
In the boxed text at the top the in caturbhuja is written with halanta 7169571705717277171671736717137172771697 caturbhuja In thetext at bottom left occurs with its inherent vowel intact without halanta 71695717057172771716717137172771697 caturabhuja Atbottom right ra is represented as repha 716957170571727717137172771697 caturbhuja
The usage of repha in Dogra is to be considered anomalous There is no present need to support thisform of in encoded representation
3122 Encoded representation of conjuncts
The Devanagari model for the representation of conjuncts is appropriate for Dogra In encoded text con-juncts are produced by placing the control character 71736 after each non-initial consonantin a cluster ie lt(C 71736 ) Cgt
Representations of conjuncts rendered as atomic ligatures are
kṣa lt71690 71736 71720 gt
sṭa lt71721 71736 71700 gt
Examples of conjuncts rendered using half-forms are
śra 71716 lt71719 71736 71716 gt
Conjuncts displayed using visible halanta are
pra 717107173671716 lt71710 71736 71716 gt
rṇa 717167173671704 lt71716 71736 71704 gt
8
Proposal to encode the Dogra script in Unicode Anshuman Pandey
3123 Modifying conjunct formation
As is the case in Devanagari and other Indic scripts a ligature may be broken using the generic controlcharacter +200C -
kṣa lt71690 71736 71720 gt
kṣa 716907173671720 lt71690 71736 - 71720 gt
The rendering of a conjunct as a ligature or with visible is a matter of font design
313 Punctuation
Abbreviation sign The 71738 is used for indicating truncation of text In the excerptbelow the highlighted text717177172671738 7171871738 717137172471738 (lī va bhā 23) is is an abbreviation for līla vatī bhāga 23
Danda-s The daṇḍā and double daṇḍā are commonly used in Dogra documents for marking the ends ofsentences and paragraphs The below excerpt shows usage of the daṇḍā
and the following shows use of the double daṇḍā
These daṇḍā-s are not proposed for encoding as separate characters in the lsquoDograrsquo block They resemble +0964 and +0965 and may be represented using
these characters The Devanagari daṇḍā-s are to be specified as extensions for Dogra
Other signs Other marks of punctuation such as dashes are also used
9
Proposal to encode the Dogra script in Unicode Anshuman Pandey
A hyphen is used in some cases for indicating breaks in a word at end of line (see section 316)
314 Digits
Digits are commonly used in Dogra documents (see figure 11) The style of digits varies across written andprinted sources Some digits in Old Dogra sources resemble those of Takri (see figure 4) while those innew Dogra sources are similar to Devanagar The excerpt below shows digits 1ndash9 and the number 10 asrepresented in the Līlavatī written in New Dogra
The majority of the digits shown here resemble Devanagari forms however the exception is lsquofourrsquo Theform of Dogra lsquofourrsquo does not resemble 71364 +116C4 which is nearly identical to theDevanagari digit four Information about its origins are not known at this time but it may be related to 70100+111D4 The forms of lsquoeightrsquo and lsquoninersquo are well-known variant forms of the regular
Devanagari digits
Additional research into Dogra digits is required before they may be considered for encoding The availablesources suggest that there may have been a cultural understanding that Dogra possessed unique digits Forinstance on the bank notes shown in figure 20 the digits written in Devanagari are rather similar but arenonetheless displayed twice in order to align the display of the currency value with each script used for theinscriptions
At present script-specific digits are not proposed for inclusion in the Dogra repertoire Devanagari digitsshould be used normatively and should be extended for usage with Dogra If future research yields distinctivedigits for Dogra there is sufficient space in the block to acccommodate their encoding
315 Number forms
Fraction signs and currency marks are commonly used in Dogra sources (see figure 12) These may berepresented using the fraction and currency signs already encoded in the lsquoCommon Indic Number Formsrsquoblock (U+A830) These characters are specified as extensions for Dogra The lsquoCommon Indic NumberFormsrsquo are described briefly in The Unicode Standard (chapter 223 lsquoNumeralsrsquo pp 770ndash771) and in moredetail in the original encoding proposal (L207-354)
316 Linebreaking
Linebreaking occurs after an orthographic syllable Breaks are generally not indicated in informal sourcesbut are marked in literary materials using hyphens
10
Proposal to encode the Dogra script in Unicode Anshuman Pandey
317 Collation
The primary collating order for Dogra is as follows
71680 lt 71681 lt 71682 lt 71683 lt 71684 lt 71685 lt 71686 lt 71687 lt 71688 lt 71689 lt
71690 lt 71691 lt 71692 lt 71693 lt 71694 lt 71695 lt 71696 lt 71697 lt
71698 lt 71699 lt 71700 lt 71701 lt 71702 lt 71703 lt 71704 lt 71705 lt
71706 lt 71707 lt 71708 lt 71709 lt 71710 lt 71711 lt 71712 lt 71713 lt
71714 lt 71715 lt 71716 lt 71717 lt 71718 lt 71719 lt 71720 lt 71721 lt 71722 lt
71723 lt 71724 lt 71725 lt 71726 lt
71727 lt 71728 lt 71729 lt 71730 lt
71731 lt 71732 lt 71733
The following characters have secondary weights 71734 71735 71736 71737
4 Character data
41 Character properties
The properties for Dogra in the Unicode Character Database format are
11800DOGRA LETTER ALo0LN11801DOGRA LETTER AALo0LN11802DOGRA LETTER ILo0LN11804DOGRA LETTER ULo0LN11806DOGRA LETTER ELo0LN11807DOGRA LETTER AILo0LN11808DOGRA LETTER OLo0LN1180ADOGRA LETTER KALo0LN1180BDOGRA LETTER KHALo0LN1180CDOGRA LETTER GALo0LN1180DDOGRA LETTER GHALo0LN1180EDOGRA LETTER NGALo0LN1180FDOGRA LETTER CALo0LN11810DOGRA LETTER CHALo0LN11811DOGRA LETTER JALo0LN11812DOGRA LETTER JHALo0LN11813DOGRA LETTER NYALo0LN
11
Proposal to encode the Dogra script in Unicode Anshuman Pandey
11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN
42 Linebreaking
Linebreaking properties given in the data format of LineBreaktxt
118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN
43 Syllabic categories
Syllabic categories given in the format of IndicSyllabicCategorytxt
Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA
12
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA
Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA
Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA
Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU
Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU
Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA
44 Positional categories
Positional data for characters in the format of IndicPositionalCategorytxt
Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA
Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I
Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU
Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA
45 Script extensions
The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt
0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA
0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE
A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS
A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK
13
Proposal to encode the Dogra script in Unicode Anshuman Pandey
46 lsquoConfusablersquo characters
Some Dogra characters that resemble characters encoded in other script blocks are
DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN
5 References
Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press
Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India
Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press
Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf
mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf
mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf
Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club
von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml
14
Proposal to encode the Dogra script in Unicode Anshuman Pandey
6 Acknowledgments
I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal
This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities
15
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 1
1184FDogra11800
1180 1181 1182 1183 1184
a
71681
i
71683
u
71685
e
71687
o
71689
71690
71691
71692
71693
71694
71695
71696
71697
71698
71699
71700
71701
71702
71703
71704
71705
71706
71707
71708
71709
71710
71711
71712
71713
71714
71715
71716
71717
71718
71719
71720
71721
71722
71723
$71724
$ i
$71726
$ u
$ 71728
$ r
$ e
$ 71731
$ o
$ 71733
$ 71734
$71735
$ 71736
$ 71737
71738
11800
11801
11802
11803
11804
11805
11806
11807
11808
11809
1180A
1180B
1180C
1180D
1180E
1180F
11810
11811
11812
11813
11814
11815
11816
11817
11818
11819
1181A
1181B
1181C
1181D
1181E
1181F
11820
11821
11822
11823
11824
11825
11826
11827
11828
11829
1182A
1182B
1182C
1182D
1182E
1182F
11830
11831
11832
11833
11834
11835
11836
11837
11838
11839
1183A
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 2
1183ADogra11800
Punctuation1183A 71738 DOGRA ABBREVIATION SIGN
Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU
Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA
Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU
Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
18
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
19
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)
20
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)
21
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)
22
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)
23
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
3123 Modifying conjunct formation
As is the case in Devanagari and other Indic scripts a ligature may be broken using the generic controlcharacter +200C -
kṣa lt71690 71736 71720 gt
kṣa 716907173671720 lt71690 71736 - 71720 gt
The rendering of a conjunct as a ligature or with visible is a matter of font design
313 Punctuation
Abbreviation sign The 71738 is used for indicating truncation of text In the excerptbelow the highlighted text717177172671738 7171871738 717137172471738 (lī va bhā 23) is is an abbreviation for līla vatī bhāga 23
Danda-s The daṇḍā and double daṇḍā are commonly used in Dogra documents for marking the ends ofsentences and paragraphs The below excerpt shows usage of the daṇḍā
and the following shows use of the double daṇḍā
These daṇḍā-s are not proposed for encoding as separate characters in the lsquoDograrsquo block They resemble +0964 and +0965 and may be represented using
these characters The Devanagari daṇḍā-s are to be specified as extensions for Dogra
Other signs Other marks of punctuation such as dashes are also used
9
Proposal to encode the Dogra script in Unicode Anshuman Pandey
A hyphen is used in some cases for indicating breaks in a word at end of line (see section 316)
314 Digits
Digits are commonly used in Dogra documents (see figure 11) The style of digits varies across written andprinted sources Some digits in Old Dogra sources resemble those of Takri (see figure 4) while those innew Dogra sources are similar to Devanagar The excerpt below shows digits 1ndash9 and the number 10 asrepresented in the Līlavatī written in New Dogra
The majority of the digits shown here resemble Devanagari forms however the exception is lsquofourrsquo Theform of Dogra lsquofourrsquo does not resemble 71364 +116C4 which is nearly identical to theDevanagari digit four Information about its origins are not known at this time but it may be related to 70100+111D4 The forms of lsquoeightrsquo and lsquoninersquo are well-known variant forms of the regular
Devanagari digits
Additional research into Dogra digits is required before they may be considered for encoding The availablesources suggest that there may have been a cultural understanding that Dogra possessed unique digits Forinstance on the bank notes shown in figure 20 the digits written in Devanagari are rather similar but arenonetheless displayed twice in order to align the display of the currency value with each script used for theinscriptions
At present script-specific digits are not proposed for inclusion in the Dogra repertoire Devanagari digitsshould be used normatively and should be extended for usage with Dogra If future research yields distinctivedigits for Dogra there is sufficient space in the block to acccommodate their encoding
315 Number forms
Fraction signs and currency marks are commonly used in Dogra sources (see figure 12) These may berepresented using the fraction and currency signs already encoded in the lsquoCommon Indic Number Formsrsquoblock (U+A830) These characters are specified as extensions for Dogra The lsquoCommon Indic NumberFormsrsquo are described briefly in The Unicode Standard (chapter 223 lsquoNumeralsrsquo pp 770ndash771) and in moredetail in the original encoding proposal (L207-354)
316 Linebreaking
Linebreaking occurs after an orthographic syllable Breaks are generally not indicated in informal sourcesbut are marked in literary materials using hyphens
10
Proposal to encode the Dogra script in Unicode Anshuman Pandey
317 Collation
The primary collating order for Dogra is as follows
71680 lt 71681 lt 71682 lt 71683 lt 71684 lt 71685 lt 71686 lt 71687 lt 71688 lt 71689 lt
71690 lt 71691 lt 71692 lt 71693 lt 71694 lt 71695 lt 71696 lt 71697 lt
71698 lt 71699 lt 71700 lt 71701 lt 71702 lt 71703 lt 71704 lt 71705 lt
71706 lt 71707 lt 71708 lt 71709 lt 71710 lt 71711 lt 71712 lt 71713 lt
71714 lt 71715 lt 71716 lt 71717 lt 71718 lt 71719 lt 71720 lt 71721 lt 71722 lt
71723 lt 71724 lt 71725 lt 71726 lt
71727 lt 71728 lt 71729 lt 71730 lt
71731 lt 71732 lt 71733
The following characters have secondary weights 71734 71735 71736 71737
4 Character data
41 Character properties
The properties for Dogra in the Unicode Character Database format are
11800DOGRA LETTER ALo0LN11801DOGRA LETTER AALo0LN11802DOGRA LETTER ILo0LN11804DOGRA LETTER ULo0LN11806DOGRA LETTER ELo0LN11807DOGRA LETTER AILo0LN11808DOGRA LETTER OLo0LN1180ADOGRA LETTER KALo0LN1180BDOGRA LETTER KHALo0LN1180CDOGRA LETTER GALo0LN1180DDOGRA LETTER GHALo0LN1180EDOGRA LETTER NGALo0LN1180FDOGRA LETTER CALo0LN11810DOGRA LETTER CHALo0LN11811DOGRA LETTER JALo0LN11812DOGRA LETTER JHALo0LN11813DOGRA LETTER NYALo0LN
11
Proposal to encode the Dogra script in Unicode Anshuman Pandey
11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN
42 Linebreaking
Linebreaking properties given in the data format of LineBreaktxt
118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN
43 Syllabic categories
Syllabic categories given in the format of IndicSyllabicCategorytxt
Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA
12
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA
Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA
Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA
Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU
Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU
Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA
44 Positional categories
Positional data for characters in the format of IndicPositionalCategorytxt
Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA
Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I
Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU
Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA
45 Script extensions
The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt
0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA
0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE
A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS
A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK
13
Proposal to encode the Dogra script in Unicode Anshuman Pandey
46 lsquoConfusablersquo characters
Some Dogra characters that resemble characters encoded in other script blocks are
DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN
5 References
Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press
Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India
Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press
Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf
mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf
mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf
Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club
von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml
14
Proposal to encode the Dogra script in Unicode Anshuman Pandey
6 Acknowledgments
I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal
This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities
15
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 1
1184FDogra11800
1180 1181 1182 1183 1184
a
71681
i
71683
u
71685
e
71687
o
71689
71690
71691
71692
71693
71694
71695
71696
71697
71698
71699
71700
71701
71702
71703
71704
71705
71706
71707
71708
71709
71710
71711
71712
71713
71714
71715
71716
71717
71718
71719
71720
71721
71722
71723
$71724
$ i
$71726
$ u
$ 71728
$ r
$ e
$ 71731
$ o
$ 71733
$ 71734
$71735
$ 71736
$ 71737
71738
11800
11801
11802
11803
11804
11805
11806
11807
11808
11809
1180A
1180B
1180C
1180D
1180E
1180F
11810
11811
11812
11813
11814
11815
11816
11817
11818
11819
1181A
1181B
1181C
1181D
1181E
1181F
11820
11821
11822
11823
11824
11825
11826
11827
11828
11829
1182A
1182B
1182C
1182D
1182E
1182F
11830
11831
11832
11833
11834
11835
11836
11837
11838
11839
1183A
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 2
1183ADogra11800
Punctuation1183A 71738 DOGRA ABBREVIATION SIGN
Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU
Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA
Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU
Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
18
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
19
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)
20
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)
21
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)
22
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)
23
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
A hyphen is used in some cases for indicating breaks in a word at end of line (see section 316)
314 Digits
Digits are commonly used in Dogra documents (see figure 11) The style of digits varies across written andprinted sources Some digits in Old Dogra sources resemble those of Takri (see figure 4) while those innew Dogra sources are similar to Devanagar The excerpt below shows digits 1ndash9 and the number 10 asrepresented in the Līlavatī written in New Dogra
The majority of the digits shown here resemble Devanagari forms however the exception is lsquofourrsquo Theform of Dogra lsquofourrsquo does not resemble 71364 +116C4 which is nearly identical to theDevanagari digit four Information about its origins are not known at this time but it may be related to 70100+111D4 The forms of lsquoeightrsquo and lsquoninersquo are well-known variant forms of the regular
Devanagari digits
Additional research into Dogra digits is required before they may be considered for encoding The availablesources suggest that there may have been a cultural understanding that Dogra possessed unique digits Forinstance on the bank notes shown in figure 20 the digits written in Devanagari are rather similar but arenonetheless displayed twice in order to align the display of the currency value with each script used for theinscriptions
At present script-specific digits are not proposed for inclusion in the Dogra repertoire Devanagari digitsshould be used normatively and should be extended for usage with Dogra If future research yields distinctivedigits for Dogra there is sufficient space in the block to acccommodate their encoding
315 Number forms
Fraction signs and currency marks are commonly used in Dogra sources (see figure 12) These may berepresented using the fraction and currency signs already encoded in the lsquoCommon Indic Number Formsrsquoblock (U+A830) These characters are specified as extensions for Dogra The lsquoCommon Indic NumberFormsrsquo are described briefly in The Unicode Standard (chapter 223 lsquoNumeralsrsquo pp 770ndash771) and in moredetail in the original encoding proposal (L207-354)
316 Linebreaking
Linebreaking occurs after an orthographic syllable Breaks are generally not indicated in informal sourcesbut are marked in literary materials using hyphens
10
Proposal to encode the Dogra script in Unicode Anshuman Pandey
317 Collation
The primary collating order for Dogra is as follows
71680 lt 71681 lt 71682 lt 71683 lt 71684 lt 71685 lt 71686 lt 71687 lt 71688 lt 71689 lt
71690 lt 71691 lt 71692 lt 71693 lt 71694 lt 71695 lt 71696 lt 71697 lt
71698 lt 71699 lt 71700 lt 71701 lt 71702 lt 71703 lt 71704 lt 71705 lt
71706 lt 71707 lt 71708 lt 71709 lt 71710 lt 71711 lt 71712 lt 71713 lt
71714 lt 71715 lt 71716 lt 71717 lt 71718 lt 71719 lt 71720 lt 71721 lt 71722 lt
71723 lt 71724 lt 71725 lt 71726 lt
71727 lt 71728 lt 71729 lt 71730 lt
71731 lt 71732 lt 71733
The following characters have secondary weights 71734 71735 71736 71737
4 Character data
41 Character properties
The properties for Dogra in the Unicode Character Database format are
11800DOGRA LETTER ALo0LN11801DOGRA LETTER AALo0LN11802DOGRA LETTER ILo0LN11804DOGRA LETTER ULo0LN11806DOGRA LETTER ELo0LN11807DOGRA LETTER AILo0LN11808DOGRA LETTER OLo0LN1180ADOGRA LETTER KALo0LN1180BDOGRA LETTER KHALo0LN1180CDOGRA LETTER GALo0LN1180DDOGRA LETTER GHALo0LN1180EDOGRA LETTER NGALo0LN1180FDOGRA LETTER CALo0LN11810DOGRA LETTER CHALo0LN11811DOGRA LETTER JALo0LN11812DOGRA LETTER JHALo0LN11813DOGRA LETTER NYALo0LN
11
Proposal to encode the Dogra script in Unicode Anshuman Pandey
11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN
42 Linebreaking
Linebreaking properties given in the data format of LineBreaktxt
118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN
43 Syllabic categories
Syllabic categories given in the format of IndicSyllabicCategorytxt
Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA
12
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA
Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA
Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA
Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU
Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU
Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA
44 Positional categories
Positional data for characters in the format of IndicPositionalCategorytxt
Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA
Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I
Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU
Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA
45 Script extensions
The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt
0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA
0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE
A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS
A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK
13
Proposal to encode the Dogra script in Unicode Anshuman Pandey
46 lsquoConfusablersquo characters
Some Dogra characters that resemble characters encoded in other script blocks are
DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN
5 References
Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press
Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India
Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press
Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf
mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf
mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf
Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club
von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml
14
Proposal to encode the Dogra script in Unicode Anshuman Pandey
6 Acknowledgments
I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal
This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities
15
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 1
1184FDogra11800
1180 1181 1182 1183 1184
a
71681
i
71683
u
71685
e
71687
o
71689
71690
71691
71692
71693
71694
71695
71696
71697
71698
71699
71700
71701
71702
71703
71704
71705
71706
71707
71708
71709
71710
71711
71712
71713
71714
71715
71716
71717
71718
71719
71720
71721
71722
71723
$71724
$ i
$71726
$ u
$ 71728
$ r
$ e
$ 71731
$ o
$ 71733
$ 71734
$71735
$ 71736
$ 71737
71738
11800
11801
11802
11803
11804
11805
11806
11807
11808
11809
1180A
1180B
1180C
1180D
1180E
1180F
11810
11811
11812
11813
11814
11815
11816
11817
11818
11819
1181A
1181B
1181C
1181D
1181E
1181F
11820
11821
11822
11823
11824
11825
11826
11827
11828
11829
1182A
1182B
1182C
1182D
1182E
1182F
11830
11831
11832
11833
11834
11835
11836
11837
11838
11839
1183A
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 2
1183ADogra11800
Punctuation1183A 71738 DOGRA ABBREVIATION SIGN
Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU
Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA
Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU
Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
18
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
19
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)
20
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)
21
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)
22
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)
23
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
317 Collation
The primary collating order for Dogra is as follows
71680 lt 71681 lt 71682 lt 71683 lt 71684 lt 71685 lt 71686 lt 71687 lt 71688 lt 71689 lt
71690 lt 71691 lt 71692 lt 71693 lt 71694 lt 71695 lt 71696 lt 71697 lt
71698 lt 71699 lt 71700 lt 71701 lt 71702 lt 71703 lt 71704 lt 71705 lt
71706 lt 71707 lt 71708 lt 71709 lt 71710 lt 71711 lt 71712 lt 71713 lt
71714 lt 71715 lt 71716 lt 71717 lt 71718 lt 71719 lt 71720 lt 71721 lt 71722 lt
71723 lt 71724 lt 71725 lt 71726 lt
71727 lt 71728 lt 71729 lt 71730 lt
71731 lt 71732 lt 71733
The following characters have secondary weights 71734 71735 71736 71737
4 Character data
41 Character properties
The properties for Dogra in the Unicode Character Database format are
11800DOGRA LETTER ALo0LN11801DOGRA LETTER AALo0LN11802DOGRA LETTER ILo0LN11804DOGRA LETTER ULo0LN11806DOGRA LETTER ELo0LN11807DOGRA LETTER AILo0LN11808DOGRA LETTER OLo0LN1180ADOGRA LETTER KALo0LN1180BDOGRA LETTER KHALo0LN1180CDOGRA LETTER GALo0LN1180DDOGRA LETTER GHALo0LN1180EDOGRA LETTER NGALo0LN1180FDOGRA LETTER CALo0LN11810DOGRA LETTER CHALo0LN11811DOGRA LETTER JALo0LN11812DOGRA LETTER JHALo0LN11813DOGRA LETTER NYALo0LN
11
Proposal to encode the Dogra script in Unicode Anshuman Pandey
11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN
42 Linebreaking
Linebreaking properties given in the data format of LineBreaktxt
118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN
43 Syllabic categories
Syllabic categories given in the format of IndicSyllabicCategorytxt
Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA
12
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA
Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA
Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA
Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU
Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU
Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA
44 Positional categories
Positional data for characters in the format of IndicPositionalCategorytxt
Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA
Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I
Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU
Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA
45 Script extensions
The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt
0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA
0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE
A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS
A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK
13
Proposal to encode the Dogra script in Unicode Anshuman Pandey
46 lsquoConfusablersquo characters
Some Dogra characters that resemble characters encoded in other script blocks are
DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN
5 References
Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press
Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India
Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press
Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf
mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf
mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf
Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club
von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml
14
Proposal to encode the Dogra script in Unicode Anshuman Pandey
6 Acknowledgments
I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal
This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities
15
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 1
1184FDogra11800
1180 1181 1182 1183 1184
a
71681
i
71683
u
71685
e
71687
o
71689
71690
71691
71692
71693
71694
71695
71696
71697
71698
71699
71700
71701
71702
71703
71704
71705
71706
71707
71708
71709
71710
71711
71712
71713
71714
71715
71716
71717
71718
71719
71720
71721
71722
71723
$71724
$ i
$71726
$ u
$ 71728
$ r
$ e
$ 71731
$ o
$ 71733
$ 71734
$71735
$ 71736
$ 71737
71738
11800
11801
11802
11803
11804
11805
11806
11807
11808
11809
1180A
1180B
1180C
1180D
1180E
1180F
11810
11811
11812
11813
11814
11815
11816
11817
11818
11819
1181A
1181B
1181C
1181D
1181E
1181F
11820
11821
11822
11823
11824
11825
11826
11827
11828
11829
1182A
1182B
1182C
1182D
1182E
1182F
11830
11831
11832
11833
11834
11835
11836
11837
11838
11839
1183A
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 2
1183ADogra11800
Punctuation1183A 71738 DOGRA ABBREVIATION SIGN
Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU
Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA
Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU
Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
18
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
19
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)
20
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)
21
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)
22
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)
23
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
11814DOGRA LETTER TTALo0LN11815DOGRA LETTER TTHALo0LN11816DOGRA LETTER DDALo0LN11817DOGRA LETTER DDHALo0LN11818DOGRA LETTER NNALo0LN11819DOGRA LETTER TALo0LN1181ADOGRA LETTER THALo0LN1181BDOGRA LETTER DALo0LN1181CDOGRA LETTER DHALo0LN1181DDOGRA LETTER NALo0LN1181EDOGRA LETTER PALo0LN1181FDOGRA LETTER PHALo0LN11820DOGRA LETTER BALo0LN11821DOGRA LETTER BHALo0LN11822DOGRA LETTER MALo0LN11823DOGRA LETTER YALo0LN11824DOGRA LETTER RALo0LN11825DOGRA LETTER LALo0LN11826DOGRA LETTER VALo0LN11827DOGRA LETTER SHALo0LN11828DOGRA LETTER SSALo0LN11829DOGRA LETTER SALo0LN1182ADOGRA LETTER HALo0LN1182BDOGRA LETTER RRALo0LN1182CDOGRA VOWEL SIGN AAMc0LN1182DDOGRA VOWEL SIGN IMc0LN1182EDOGRA VOWEL SIGN IIMc0LN1182FDOGRA VOWEL SIGN UMn0NSMN11830DOGRA VOWEL SIGN UUMn0NSMN11831DOGRA VOWEL SIGN VOCALIC RMn0NSMN11832DOGRA VOWEL SIGN EMn0NSMN11833DOGRA VOWEL SIGN AIMn0NSMN11834DOGRA VOWEL SIGN OMn0NSMN11835DOGRA VOWEL SIGN AUMn0NSMN11836DOGRA SIGN ANUSVARAMn0NSMN11837DOGRA SIGN VISARGAMc0LN11838DOGRA SIGN VIRAMAMn9NSMN11839DOGRA SIGN NUKTAMn7NSMN1183ADOGRA ABBREVIATION SIGNPo0LN
42 Linebreaking
Linebreaking properties given in the data format of LineBreaktxt
118001182B AL Lo [44] DOGRA LETTER A DOGRA LETTER RRA1182C1182E CM Mc [3] DOGRA VOWEL SIGN AA DOGRA VOWEL SIGN II1182F11835 CM Mn [7] DOGRA VOWEL SIGN U DOGRA VOWEL SIGN AU11836 CM Mn DOGRA SIGN ANUSVARA11837 CM Mc DOGRA SIGN VISARGA1183811839 CM Mn [2] DOGRA SIGN VIRAMA DOGRA SIGN NUKTA1183A AL Po DOGRA ABBREVIATION SIGN
43 Syllabic categories
Syllabic categories given in the format of IndicSyllabicCategorytxt
Indic_Syllabic_Category=Bindu11836 Bindu Mn DOGRA SIGN ANUSVARA
12
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA
Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA
Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA
Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU
Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU
Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA
44 Positional categories
Positional data for characters in the format of IndicPositionalCategorytxt
Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA
Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I
Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU
Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA
45 Script extensions
The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt
0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA
0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE
A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS
A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK
13
Proposal to encode the Dogra script in Unicode Anshuman Pandey
46 lsquoConfusablersquo characters
Some Dogra characters that resemble characters encoded in other script blocks are
DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN
5 References
Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press
Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India
Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press
Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf
mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf
mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf
Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club
von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml
14
Proposal to encode the Dogra script in Unicode Anshuman Pandey
6 Acknowledgments
I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal
This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities
15
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 1
1184FDogra11800
1180 1181 1182 1183 1184
a
71681
i
71683
u
71685
e
71687
o
71689
71690
71691
71692
71693
71694
71695
71696
71697
71698
71699
71700
71701
71702
71703
71704
71705
71706
71707
71708
71709
71710
71711
71712
71713
71714
71715
71716
71717
71718
71719
71720
71721
71722
71723
$71724
$ i
$71726
$ u
$ 71728
$ r
$ e
$ 71731
$ o
$ 71733
$ 71734
$71735
$ 71736
$ 71737
71738
11800
11801
11802
11803
11804
11805
11806
11807
11808
11809
1180A
1180B
1180C
1180D
1180E
1180F
11810
11811
11812
11813
11814
11815
11816
11817
11818
11819
1181A
1181B
1181C
1181D
1181E
1181F
11820
11821
11822
11823
11824
11825
11826
11827
11828
11829
1182A
1182B
1182C
1182D
1182E
1182F
11830
11831
11832
11833
11834
11835
11836
11837
11838
11839
1183A
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 2
1183ADogra11800
Punctuation1183A 71738 DOGRA ABBREVIATION SIGN
Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU
Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA
Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU
Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
18
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
19
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)
20
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)
21
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)
22
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)
23
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Indic_Syllabic_Category=Visarga11837 Visarga Mc DOGRA SIGN VISARGA
Indic_Syllabic_Category=Nukta11839 Nukta Mn DOGRA SIGN NUKTA
Indic_Syllabic_Category=Virama11838 Virama Mn DOGRA SIGN VIRAMA
Indic_Syllabic_Category=Vowel_Independent1180011808 Vowel_Independent Lo [3] DOGRA LETTER ADOGRA LETTER AU
Indic_Syllabic_Category=Vowel_Dependent1182C1182E Vowel_Dependent Mc [3] DOGRA VOWEL SIGN AADOGRA VOWEL SIGN I1182F11835 Vowel_Dependent Mn [7] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN AU
Indic_Syllabic_Category=Consonant1180A1182B Consonant Lo [34] DOGRA LETTER KADOGRA LETTER RRA
44 Positional categories
Positional data for characters in the format of IndicPositionalCategorytxt
Indic_Positional_Category=Right1182C Right Mc DOGRA VOWEL SIGN A1182E Right Mc DOGRA VOWEL SIGN II11837 Right Mc DOGRA SIGN VISARGA
Indic_Positional_Category=Left1182D Left Mc DOGRA VOWEL SIGN I
Indic_Positional_Category=Top11836 Top Mn DOGRA SIGN ANUSVARA1183211835 Top Mn [4] DOGRA VOWEL SIGN EDOGRA VOWEL SIGN AU
Indic_Positional_Category=Bottom1182F11831 Bottom Mn [3] DOGRA VOWEL SIGN UDOGRA VOWEL SIGN VOCALIC R11838 Bottom Mn DOGRA SIGN VIRAMA11839 Bottom Mn DOGRA SIGN NUKTA
45 Script extensions
The following characters should be extended for usage with the Dogra script in ScriptExtensionstxt
0964 Po DEVANAGARI DANDA0965 Po DEVANAGARI DOUBLE DANDA
0966096F Nd [10] DEVANAGARI DIGIT ZERODEVANAGARI DIGIT NINE
A830A835 No [6] NORTH INDIC FRACTION ONE QUARTERNORTH INDIC FRACTION THREE SIXTEENTHS
A836A837 So [2] NORTH INDIC QUARTER MARKNORTH INDIC PLACEHOLDER MARKA838 Sc NORTH INDIC RUPEE MARKA839 So NORTH INDIC QUANTITY MARK
13
Proposal to encode the Dogra script in Unicode Anshuman Pandey
46 lsquoConfusablersquo characters
Some Dogra characters that resemble characters encoded in other script blocks are
DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN
5 References
Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press
Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India
Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press
Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf
mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf
mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf
Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club
von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml
14
Proposal to encode the Dogra script in Unicode Anshuman Pandey
6 Acknowledgments
I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal
This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities
15
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 1
1184FDogra11800
1180 1181 1182 1183 1184
a
71681
i
71683
u
71685
e
71687
o
71689
71690
71691
71692
71693
71694
71695
71696
71697
71698
71699
71700
71701
71702
71703
71704
71705
71706
71707
71708
71709
71710
71711
71712
71713
71714
71715
71716
71717
71718
71719
71720
71721
71722
71723
$71724
$ i
$71726
$ u
$ 71728
$ r
$ e
$ 71731
$ o
$ 71733
$ 71734
$71735
$ 71736
$ 71737
71738
11800
11801
11802
11803
11804
11805
11806
11807
11808
11809
1180A
1180B
1180C
1180D
1180E
1180F
11810
11811
11812
11813
11814
11815
11816
11817
11818
11819
1181A
1181B
1181C
1181D
1181E
1181F
11820
11821
11822
11823
11824
11825
11826
11827
11828
11829
1182A
1182B
1182C
1182D
1182E
1182F
11830
11831
11832
11833
11834
11835
11836
11837
11838
11839
1183A
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 2
1183ADogra11800
Punctuation1183A 71738 DOGRA ABBREVIATION SIGN
Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU
Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA
Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU
Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
18
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
19
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)
20
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)
21
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)
22
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)
23
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
46 lsquoConfusablersquo characters
Some Dogra characters that resemble characters encoded in other script blocks are
DOGRA LETTER U 11187 SHARADA LETTER UDOGRA LETTER UU 11188 SHARADA LETTER UUDOGRA LETTER O 11684 TAKRI LETTER UDOGRA LETTER KA 0A15 GURMUKHI LETTER KADOGRA LETTER CA 11196 SHARADA LETTER CADOGRA LETTER TTHA 1119C SHARADA LETTER TTHADOGRA LETTER TA 11699 TAKRI LETTER TADOGRA LETTER DHA 11680 TAKRI LETTER ADOGRA LETTER PHA 1169F TAKRI LETTER PHADOGRA LETTER BHA 116A1 TAKRI LETTER BHADOGRA LETTER YA 116A3 TAKRI LETTER YADOGRA LETTER SHA 0936 DEVANAGARI LETTER SHADOGRA LETTER SSA 111B0 SHARADA LETTER SSADOGRA VOWEL SIGN U 0941 DEVANAGARI VOWEL SIGN UDOGRA VOWEL SIGN UU 0942 DEVANAGARI VOWEL SIGN UUDOGRA VOWEL SIGN E 116B2 TAKRI VOWEL SIGN EDOGRA VOWEL SIGN AI 116B3 TAKRI VOWEL SIGN AIDOGRA VOWEL SIGN O 116B4 TAKRI VOWEL SIGN ODOGRA VOWEL SIGN AU 116B5 TAKRI VOWEL SIGN AUDOGRA SIGN ANUSVARA 0902 DEVANAGARI SIGN ANUSVARADOGRA SIGN VIRAMA 094D DEVANAGARI SIGN VIRAMADOGRA SIGN NUKTA 093C DEVANAGARI SIGN NUKTADOGRA ABBREVIATION SIGN 0970 DEVANAGARI ABBREVIATION SIGN
5 References
Bhāskarācārya 1872 Līlavatī Translation into Dogri Jammu Vidya Vilas Press
Grierson George A 1916 The Linguistic Survey of India Vol IX Indo-Aryan Family Central GroupPart I Specimens of Western Hindī and Pantildejābī Calcutta Office of the Superintendent of GovernmentPrinting India
Leitner Gottlieb William 1882 ldquoA Collection of Specimens of Commercial and Other Alphabets andHandwritings as also of Multiplication Tables Current in Various Parts of the Panjab Sind and the NorthWest Provincesrdquo Published as Appendix V of History of Indigenous Education in the Punjab LahoreAnjuman-i-Punjab Press
Pandey Anshuman 2007 ldquoProposal to Encode North Indic Number Forms in ISOIEC 10646rdquo (L207-354)httpwwwunicodeorgL2L200707354-north-indicpdf
mdashmdashmdash 2009 ldquoProposal to Encode the Takri Script in ISOIEC 10646rdquo (L209-424)httpwwwunicodeorgL2L200909424-takripdf
mdashmdashmdash 2015 ldquoPreliminary Proposal to Encode the Dogra Script in Unicoderdquo (L215-213)httpwwwunicodeorgL2L201515213-dograpdf
Staal Fritz 1984 Stamps of Jammu amp Kashmir New York The Collectors Club
von der Lin Carol A ldquoCollection Kashmirrdquo ldquoPhilatelic Glossary Dōgrī-Takrī ScriptsrdquohttpwwwkashmirstampscomDogriGlosshtml
14
Proposal to encode the Dogra script in Unicode Anshuman Pandey
6 Acknowledgments
I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal
This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities
15
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 1
1184FDogra11800
1180 1181 1182 1183 1184
a
71681
i
71683
u
71685
e
71687
o
71689
71690
71691
71692
71693
71694
71695
71696
71697
71698
71699
71700
71701
71702
71703
71704
71705
71706
71707
71708
71709
71710
71711
71712
71713
71714
71715
71716
71717
71718
71719
71720
71721
71722
71723
$71724
$ i
$71726
$ u
$ 71728
$ r
$ e
$ 71731
$ o
$ 71733
$ 71734
$71735
$ 71736
$ 71737
71738
11800
11801
11802
11803
11804
11805
11806
11807
11808
11809
1180A
1180B
1180C
1180D
1180E
1180F
11810
11811
11812
11813
11814
11815
11816
11817
11818
11819
1181A
1181B
1181C
1181D
1181E
1181F
11820
11821
11822
11823
11824
11825
11826
11827
11828
11829
1182A
1182B
1182C
1182D
1182E
1182F
11830
11831
11832
11833
11834
11835
11836
11837
11838
11839
1183A
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 2
1183ADogra11800
Punctuation1183A 71738 DOGRA ABBREVIATION SIGN
Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU
Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA
Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU
Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
18
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
19
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)
20
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)
21
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)
22
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)
23
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
6 Acknowledgments
I would like to thank Christopher Shackle (School of Oriental and African Studies London) for reviewingthis proposal
This project was made possible in part through a Google Research Award granted to Deborah Anderson forthe Script Encoding Initiative and a grant from the United States National Endowment for the Humanities(PR-50205-15) which funds the Universal Scripts Project (part of the Script Encoding Initiative at the Uni-versity of California Berkeley) Any views findings conclusions or recommendations expressed in thispublication do not necessarily reflect those of Google or the National Endowment for the Humanities
15
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 1
1184FDogra11800
1180 1181 1182 1183 1184
a
71681
i
71683
u
71685
e
71687
o
71689
71690
71691
71692
71693
71694
71695
71696
71697
71698
71699
71700
71701
71702
71703
71704
71705
71706
71707
71708
71709
71710
71711
71712
71713
71714
71715
71716
71717
71718
71719
71720
71721
71722
71723
$71724
$ i
$71726
$ u
$ 71728
$ r
$ e
$ 71731
$ o
$ 71733
$ 71734
$71735
$ 71736
$ 71737
71738
11800
11801
11802
11803
11804
11805
11806
11807
11808
11809
1180A
1180B
1180C
1180D
1180E
1180F
11810
11811
11812
11813
11814
11815
11816
11817
11818
11819
1181A
1181B
1181C
1181D
1181E
1181F
11820
11821
11822
11823
11824
11825
11826
11827
11828
11829
1182A
1182B
1182C
1182D
1182E
1182F
11830
11831
11832
11833
11834
11835
11836
11837
11838
11839
1183A
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 2
1183ADogra11800
Punctuation1183A 71738 DOGRA ABBREVIATION SIGN
Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU
Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA
Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU
Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
18
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
19
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)
20
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)
21
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)
22
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)
23
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 1
1184FDogra11800
1180 1181 1182 1183 1184
a
71681
i
71683
u
71685
e
71687
o
71689
71690
71691
71692
71693
71694
71695
71696
71697
71698
71699
71700
71701
71702
71703
71704
71705
71706
71707
71708
71709
71710
71711
71712
71713
71714
71715
71716
71717
71718
71719
71720
71721
71722
71723
$71724
$ i
$71726
$ u
$ 71728
$ r
$ e
$ 71731
$ o
$ 71733
$ 71734
$71735
$ 71736
$ 71737
71738
11800
11801
11802
11803
11804
11805
11806
11807
11808
11809
1180A
1180B
1180C
1180D
1180E
1180F
11810
11811
11812
11813
11814
11815
11816
11817
11818
11819
1181A
1181B
1181C
1181D
1181E
1181F
11820
11821
11822
11823
11824
11825
11826
11827
11828
11829
1182A
1182B
1182C
1182D
1182E
1182F
11830
11831
11832
11833
11834
11835
11836
11837
11838
11839
1183A
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 2
1183ADogra11800
Punctuation1183A 71738 DOGRA ABBREVIATION SIGN
Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU
Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA
Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU
Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
18
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
19
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)
20
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)
21
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)
22
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)
23
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Printed using UniBooktrade(httpwwwunicodeorgunibook)
Printed 04-Nov-2015 2
1183ADogra11800
Punctuation1183A 71738 DOGRA ABBREVIATION SIGN
Independent vowels11800 a DOGRA LETTER A11801 71681 DOGRA LETTER AA11802 i DOGRA LETTER I11803 71683 DOGRA LETTER II11804 u DOGRA LETTER U11805 71685 DOGRA LETTER UU11806 e DOGRA LETTER E11807 71687 DOGRA LETTER AI11808 o DOGRA LETTER O11809 71689 DOGRA LETTER AU
Consonants1180A 71690 DOGRA LETTER KA1180B 71691 DOGRA LETTER KHA1180C 71692 DOGRA LETTER GA1180D 71693 DOGRA LETTER GHA1180E 71694 DOGRA LETTER NGA1180F 71695 DOGRA LETTER CA11810 71696 DOGRA LETTER CHA11811 71697 DOGRA LETTER JA11812 71698 DOGRA LETTER JHA11813 71699 DOGRA LETTER NYA11814 71700 DOGRA LETTER TTA11815 71701 DOGRA LETTER TTHA11816 71702 DOGRA LETTER DDA11817 71703 DOGRA LETTER DDHA11818 71704 DOGRA LETTER NNA11819 71705 DOGRA LETTER TA1181A 71706 DOGRA LETTER THA1181B 71707 DOGRA LETTER DA1181C 71708 DOGRA LETTER DHA1181D 71709 DOGRA LETTER NA1181E 71710 DOGRA LETTER PA1181F 71711 DOGRA LETTER PHA11820 71712 DOGRA LETTER BA11821 71713 DOGRA LETTER BHA11822 71714 DOGRA LETTER MA11823 71715 DOGRA LETTER YA11824 71716 DOGRA LETTER RA11825 71717 DOGRA LETTER LA11826 71718 DOGRA LETTER VA11827 71719 DOGRA LETTER SHA11828 71720 DOGRA LETTER SSA11829 71721 DOGRA LETTER SA1182A 71722 DOGRA LETTER HA1182B 71723 DOGRA LETTER RRA
Dependent vowel signs1182C $71724 DOGRA VOWEL SIGN AA1182D $ i DOGRA VOWEL SIGN I1182E $71726 DOGRA VOWEL SIGN II1182F $ u DOGRA VOWEL SIGN U11830 $ 71728 DOGRA VOWEL SIGN UU11831 $ r DOGRA VOWEL SIGN VOCALIC R11832 $ e DOGRA VOWEL SIGN E11833 $ 71731 DOGRA VOWEL SIGN AI11834 $ o DOGRA VOWEL SIGN O11835 $ 71733 DOGRA VOWEL SIGN AU
Various signs11836 $ 71734 DOGRA SIGN ANUSVARA11837 $71735 DOGRA SIGN VISARGA11838 $ 71736 DOGRA SIGN VIRAMA11839 $ 71737 DOGRA SIGN NUKTA
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
18
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
19
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)
20
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)
21
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)
22
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)
23
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 1 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
18
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
19
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)
20
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)
21
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)
22
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)
23
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 2 Comparison of Old Dogra with Takri and other scripts (from Leitner 1882 Set 2)
19
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)
20
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)
21
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)
22
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)
23
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 3 Specimen of the Dogri language written in Old Dogra (from Grierson 1916 760)
20
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)
21
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)
22
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)
23
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 4 Another specimen of the Dogri language written in Old Dogra (fromGrierson 1916 772)
21
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)
22
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)
23
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 5 Chart of the Old Dogra script (from Grierson 1916 641)
22
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)
23
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 6 Comparison of Gurmukhi Kangra and Old Dogra scripts (from Grierson 1916 642)
23
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 7 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel and consonant letters (fromStaal 1984 33)
Figure 8 Comparison of New Dogra (lsquoDogrirsquo) and Devanagari vowel signs (from Staal 1984 34)
24
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 9 The first page of a Dogri translation of the Līlavatī by Bhāskarācārya a Sanskrit treastiseon mathematics printed in the New Dogra script in 1872 (p 1) Figure 10 contains the second pageof this book
25
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 10 Another page from the Līlavatī in New Dogra (p 2)
26
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 11 Page from the Līlavatī showing the use of printed New Dogra digits (p 3)
27
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 12 Usage of fractions for denoting currency in New Dogra (from Staal 1984 75) The textappears to be typeset using metal fonts The fraction signs are currency marks may be representedusing characters from the lsquoCommon Indic Number Formsrsquo block (U+A830) in Unicode
28
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 13 Two specimens of a trilingual stamp for expedited mail from Jammu State in Old DograPersian and Devanagari scripts The Dogra text reads 716907169071717 7169771716717277171671726 kakal jarurī The Persian readsضروری خط ḵẖat zarūrī the Devanagari reads āvaśyaka patraआव यक प
Figure 14 Four bilingual regular postage stamps from Jammu State with text in the Old Dogra andPersian scripts Stamps and are half lsquoannarsquo stamps and is a one lsquoannarsquo stamp In all stampsthe Dogra text at the top reads 716977171471684 jamu lsquoJammursquo and 7169071696717147172671716 kachamīr (= kaśmīr) lsquoKashmirrsquoIn stamps and the first line of the center is Persian انه نم ۱۹۲۳ 1923 [samvat] nim ānah lsquohalfānnārsquo the second line of the center is Dogra71680717077168071709 ada ana lsquohalf ānnārsquo In the first line of thecenter text is Persian انه یك yak ānah lsquoone ānnārsquo the second line of the center is Dogra71686716907168071709 ekana lsquoone ānnārsquo In all four stamps the third line of the center is the date 1923 [samvat] written inDevanagari-like digits १९२३ The Persian text at the bottom of the stamp reads جمون سرکار قلمروکشمیر و qalamrau sarkār jammūn o kaśmīr lsquodominions of the ruler of Jammu and Kashmirrsquo
29
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Detail of top left (Devanagari)
Detail of top right (New Dogra)
Detail of center (New Dogra Devanagari and Urdu)
Figure 15 The mast of a two rupee non-judicial stamp paper from Kashmir State
30
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 16 The mast of a five rupee non-judicial stamp paper from Kashmir state containing text inNew Dogra Devanagari and Arabic (Persian) scripts
Figure 17 A one anna stamp from Jammu and Kashmir state with text in New Dogra Devanagariand Arabic (Urdu)
31
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Figure 18 Telegraph stamps from Jammu and Kashmir state with text in Dogra Devanagari Urduand English The Dogra denominations show usage of Common Indic fractions and the rupee mark
32
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
postakarda kalamrau jammu kasmir va tibbatahaldquoPostcard of the realm of Jammu Kashmir and Tibetrdquo
isa tarapha siva pate ke kucha na likholdquoDo no write anything on this side except for the addressrdquo
jamu sinagarldquoJammurdquo ldquoSrinagarrdquo
bhaisakh (4 ) 4 vaisakhldquoVaisakh 4rdquo ldquo4 Vaisakhrdquo
Figure 19 A state postcard from Jammu and Kashmir with post mark of 19 April 1891 Theinscription at top is in New Dogra and the content is in Persian The postmarks are in English andDogra
33
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
One rupee banknote
Twenty rupee banknote
Figure 20 Bank notes from Jammu and Kashmir State containing text in New Dogra Devanagariand Arabic (Persian) scripts
34
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71296 71680
71297 71681
71298 71682
71299 mdash
71300 71684
71301 mdash
71302 71686
71303 71687
71304 71688
71305 mdash
Chamba Dogra
no dependent form
71341 71724
71342 71725
71343 71726
71344 71727
71345 71728
71346 71730
71347 71731
71348 71732
71349 71733
Table 2 Comparison of Chamba and Dogra vowel letters and signs
35
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
Proposal to encode the Dogra script in Unicode Anshuman Pandey
Chamba Dogra
71306 71690
71307 71691
71308 71692
71309 71693
71310 71694
71311 71695
71312 71696
71313 71697
71314 71698
71315 71699
71316 71700
71317 71701
71318 71702
71319 71703
71320 71704
71321 71705
71322 71706
Chamba Dogra
71323 7170771324 7170871325 7170971326 7171071327 7171171328 7171271329 7171371330 7171471331 7171571332 7171671333 7171771334 7171871335 71719mdash 7172071336 7172171337 7172271338 71723
Table 3 Comparison of Chamba and Dogra consonant letters
36
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 10646TP
1PT
Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from HTUhttpstddkuugdkJTC1SC2WG2docsprincipleshtml UTH for
guidelines and details before filling this form Please ensure you are using the latest Form from HTUhttpstddkuugdkJTC1SC2WG2docssummaryformhtml UTH
See also HTUhttpstddkuugdkJTC1SC2WG2docsroadmapshtml UTH for latest Roadmaps
A Administrative
1 Title Proposal to encode the Dogra script in Unicode 2 Requesters name Anshuman Pandey Script Encoding Initiative 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 4 November 2015 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later
B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Dogra b The proposal is for addition of character(s) to an existing block Name of the existing block
2 Number of characters in proposal 59
3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct X F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document b Are the character shapes attached in a legible form suitable for review
5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the
standard
Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey
6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes
7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes
8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at HTUhttpwwwunicodeorgUTH for such information on other scripts Also see Unicode Character Database ( Hhttpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard
TP
1PT Form number N4502-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11
2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03 2012-01)
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference
C Technical - Justification
1 Has this proposal for addition of character(s) been submitted before No If YES explain
2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Christopher Shackle (SOAS London) If YES available relevant documents
3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included No Reference Historical script currently not used
4 The context of use for the proposed characters (type of use common or rare) Common Reference
5 Are the proposed characters in current use by the user community No If YES where Reference
6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP NA If YES is a rationale provided If YES reference
7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference
9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference
10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to or could be confused with an existing character No
If YES is a rationale for its inclusion provided If YES reference
11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference See text of proposal Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference
12 Does the proposal contain characters with any special properties such as control function or similar semantics Yes If YES describe in detail (include attachment if necessary) Virama
see text of proposal 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference