A musical structure model and its visual mechanism for hypertext score systems

A Musical Structure Model and Its Visual Mechanism for

Hypertext Score Systems

Yusuke Ariyoshi,1,*

Shinji Shimojo,2 and Hideo Miyahara

1

1Faculty of Engineering Science, Osaka University, Toyonaka, 560-8531 Japan

2Computation Center, Osaka University, Ibaraki, 567-0047 Japan

SUMMARY

In order to perform with consistency a musical score

having multiple parts, as in an orchestra or a chorus, it is

important that the conductor and performers share its mu-

sical semantic structures. However, past studies of the ex-

pression or visualization of musical semantic structure in

the field of music information processing have dealt with

scores having a single voice part. In this paper, we propose

a new electronic musical score system, �HyperScore,�

based on the hypertext data model. Expanding the hypertext

data model and its visualization mechanism, HyperScore

has a flexible semantic structure model and a strong visu-

alization mechanism which help the conductor and per-

formers understand the semantics of the score and

communicate with each other. The musical semantic struc-

ture model consists of three components: phrase�motif�

note tree structures of the musical score, musical structures

such as particular musical forms, and links among these

structures. The visualization mechanism generates various

views of the semantic structure of the music from different

perspectives. The view is generated by a visualization

mechanism specifying four parameters: the XY axes, dis-

play rules, selection, and granularity. The view illustrates

the musical semantics of series of notes clearly by their

position in the view and their display attributes. © 2001

Scripta Technica, Syst Comp Jpn, 32(7): 39�51, 2001

Key words: Hypertext; musical score system; mu-

sical semantic structure; hypermedia; information visuali-

zation.

1. Introduction

Many people consider that the process of creating a

performance consists of practicing the score of a musical

piece. However, in reality, the performance, that is, the plan

for the execution of the various parts piece from the score,

exists before practice begins. Thus, practice and perform-

ance are based on this performance plan. In addition, the

performance plan is refined by using the results of practice

and performance (Fig. 1).

© 2001 Scripta Technica

Systems and Computers in Japan, Vol. 32, No. 7, 2001Translated from Denshi Joho Tsushin Gakkai Ronbunshi, Vol. J81-D-II, No. 9, September 1998, pp. 1937�1948

*Currently affiliated with NEC Human Media Research Laboratories Fig. 1. Process of making performance.

39

The structure of a musical piece, which is the basis

of its performance plan, is a complicated interweaving of

multiple elements such as musical form, rhythmic structure,

and the like. In addition, the structure of the piece is not

clearly indicated in the score and ambiguity is associated

with differences in interpretation of the score. Although the

structure of the piece is usually written into the score, there

are limits to such written expression due to the complexity

and ambiguity noted above.

In a musical piece with multiple parts, such as a piece

for an orchestra or chorus, which is the object of study of

this paper, performance creation involves a division of labor

between the conductor and the performers. First, the con-

ductor creates a performance plan for the entire piece

according to his interpretation of it. Next, the performers of

the parts create detailed performance plans of their parts

based on the musical structure and the conductor�s perform-

ance plan, and the rehearsals and performances are con-

ducted in accordance with these plans.

The conductor needs to impart his knowledge or

interpretation of the musical structure of a piece with mul-

tiple parts to the performers; a performance without consis-

tency results when the musical structure of the piece is not

imparted well, such as when expressions differ between the

voice parts or there is no unity between the conductor and

the performers. On the other hand, the musical structure is

more complicated because of the multiple parts. Thus, the

limitations on expression of the musical structure in a score

with multiple parts are more serious.

In this paper, in order to give support to performance

creation, we propose a musical structure model expressing

a complicated musical structure of a piece and a musical

score hypertext system, called HyperScore, which has a

mechanism for visualizing the musical structure as in the

score.

The musical structure model of HyperScore is an

expanded structured hypertext model based on methods of

analysis of musical pieces, and expresses three elements of

the score structure, namely, musical elements, the logical

structure of the piece, and attribute links. The mechanism

for visualization of the musical piece structure generates a

view display of a complicated musical piece seen from

various points of view by specifying four parameters�se-

lection, granularity, XY axes, and display rules�based on

analyses of the interpretations of the musical piece structure

by the conductor and the performers.

2. Musical Piece Structure Model

In order to support the creation of performance plans,

which is a goal of HyperScore, it is important to express

simply and concisely the parts of the musical piece structure

related to the creation of performance plans. Thus, the

requirements for achieving the above task are analyzed and

then a musical piece structure for flexibly expressing the

structure of the piece is proposed.

2.1. Analyses of musical piece structure

The musical structure, including elements such as the

musical form, harmonic structure, and so on, is extracted

by musical analysis (the entire logical structure of the piece

is called the overall musical structure and is differentiated

from individual musical structures). This musical structure

gives musicological meaning to sequences of notes,

phrases, and the like (Fig. 2). Thus, the performance plan

for the sequence of notes is decided by this musicological

meaning. [1].

The authors interviewed conductors and performers

and surveyed the methods of analyzing a musical piece.

Based on the results, we found that the structures governing

the performance plan are of four kinds: giving musical

meaning to a sequence of notes from musical structures

recognized from the past; giving musical meaning in terms

of elementary musical structures; inclusion relationships of

musical piece elements; and giving relationships by musi-

cal structures.

(1) Giving musical meaning to a sequence of notes

based on the musical structure

For example, in determining a performance plan of a

certain phrase, the performance plan �this soprano phrase

is to be performed so that it clearly stands out, since it is the

beginning of the recurrence of the first theme (musical

meaning)� might be selected.

The performance plan thus is determined by the

structure through which the musical structure gives musical

meaning to a sequence of notes. For example, the type of

musical structure called a musical form directly imparts to

Fig. 2. Analysis of score.

40

a phrase the musical meaning of �recurrence of the first

theme.�

Although the performance plan in this example is not

a detailed plan but merely a plan to �perform so that it

clearly stands out,� in reality, it is incorporated into a more

detailed performance plan by the fact that a musical mean-

ing different from a multiple number of musical structures

is given to one sequence of notes.

(2) Giving meaning to a musical structure in terms of

elementary musical structures

Musical pieces having multiple parts, which are the

targets of HyperScore, frequently have very complicated

musical structures. For example, a symphony can have

several dozen voice parts and more than 100 phrases. Since

the more difficult the performance plan that can be derived,

the larger the number of musical structures, many musical

structures are analyzed in groups and multistage analyses

are performed to create performance plans (simple musical

structures which are results of structural analysis are re-

ferred to below as elemental musical structures).

Taking a simple example, Fig. 3 shows that a musical

structure with a compound ternary form can be decomposed

into elemental musical structures of ternary and binary

form. The performance plan chosen for phrase C1 is one

that combines requirements derived from the musical

meanings of both the third block of the ternary form and the

first block of the binary form.

By performing multistage analyses in this way, the

creation of a performance for a large musical piece is made

simpler by deriving performance plans from the musical

meanings given by the simpler elemental musical struc-

tures, even if the piece has a large number of musical

structures for which performance plans cannot easily be

derived directly.

When multistage analyses are performed, musical

meanings are attached to the original musical structures by

the elementary musical structures, in such a way that the

performance plans can be derived from the musical mean-

ing given to a sequence of notes by musical structures.

Giving musical meaning to a musical structure in terms of

elementary musical structures thus consists of building up

meaningful structures of larger size.

(3) Inclusion relationships of note sequences

Notes, motifs, phrases, and so on, differing in granu-

larity exist in a sequence of notes having musical meaning.

There is an inclusion relationship between a note sequence

of large granularity and a note sequence of small granular-

ity, and a note sequence of large granularity is composed of

note sequences of small granularities.

The musical structures giving musical meaning differ

with the granularity of note sequences. In Fig. 2, a musical

meaning is given to a phrase in terms of three kinds of

musical structure, that is, voice part formation, musical

form, and role. On the other hand, the musical meaning of,

for example, �the third note of harmony I� or �the beginning

note of the phrase� is given at the note level, of small

granularity, in terms of the musical structure of the position

inside the harmony or the position inside the phrase.

In addition, there are cases in which the determina-

tion of a performance plan of a note sequence is influenced

by the performance plans or musical meanings of the note

sequences in the inclusion relationships.

For example, when the meaning of �the third note of

harmony I� is given to a certain note in analyses at the note

level, since the harmony is determined as major or minor

by the musical step of the third note, performing this note

at the correct pitch is required. Thus, the possibility of this

pitch being unstable is eliminated in the performance plan

of the phrase containing this note.

Specifically, a performance plan is not simply deter-

mined by the musical meanings given by the musical struc-

tures, but must be chosen to agree well with the

performance plans or musical meanings of the note se-

quences in the inclusion relationships.

Thus, the structure of the inclusion relationships of

note sequences also influences the determination of a per-

formance plan for a note sequence.

(4) Linking relationships by musical structures

For example, phrase I at the upper left is given the

musical meaning of �soprano, first phrase of A, melody�

from the musical structure of the voice part formation,

musical form, and role, and the phrase below it is given the

meanings of �alto, first phrase of A, harmony.� In addition,

the performance plan of this phrase is chosen so as to

combine the volume and tempo with the melody easily

without allowing it to stand out, since its role is harmony.Fig. 3. Music structure and element music structure.

41

In this example, a performance plan which agrees

well with the performance plan of the note sequence of the

melody, with which it has a linkage relationship in terms of

the musical structure of the role, is selected.

Specifically, the performance plan of a note sequence

is determined not only by the musical meaning of the note

sequence itself, but also by the influences of the perform-

ance plan and the musical meaning of the note sequence

with which this sequence is linked in a structural relation-

ship. Thus, in determining the performance plan of a note

sequence, the structural relationship consisting of the link-

ing between note sequences plays a role.

2.2. Musical piece structure model

The musical structure model is designed on the basis

of a structured hypertext model. The structured hypertext

model has a capability of flexible expression and has al-

ready been used to express the logical structures of docu-

ments or multimedia codings.

The basic elements constituting the hypertext model

are links, expressing the relationships between information,

and nodes, expressing the real body of information. In

structured hypertext, the links or nodes are expanded and

have the structures of a list or a tree.

A purpose of the musical piece structural model is to

express the structure determining the performance plan of

a note sequence in a musical structure. Thus, expressing the

above four kinds of structures is a requirement of the

musical structure model.

The musical structure model is constructed from four

broad elements: musical piece elements, musical piece

logic structure, attribute links, and analysis structure, and it

expresses a frame for determining the performance plan of

a note sequence by these elements.

[Musical piece elements] In the musical piece struc-

ture model, the note sequences which serve as the units in

the creation of the performance plan are expressed by links

of inclusion relationships with the musical piece element

nodes. The musical piece element nodes express note se-

quences having musical meanings, such as phrases and

motifs. The links of inclusion relationships express the

inclusion relationships of note sequences differing in

granularity. In this way the musical piece elements form a

tree structure of musical piece element nodes connected by

the links of inclusion relationships.

[Musical piece theory structure] In the musical piece

structure model, the musical structures such as the musical

form are expressed by links expressing the hierarchical

relationships between musical attribute nodes and attribute

values. The musical attribute nodes express the musical

meanings given to the note sequences, and the hierarchical

relationship links express the relationships between the

musical meanings.

For example, Fig. 5 shows the logical structure of a

musical piece composed of four mixed voice parts. The four

mixed voice parts are composed of male voice parts and

female voice parts, the male voice parts are composed of

the tenor and base parts, and the female voice parts are

composed of the soprano and alto parts. In this example,

the musical meanings of the male and female voice parts or

of the four voice parts are expressed by the musical attribute

nodes, and the male voice parts are expressed by linking the

musical attribute nodes in the relationships of the tenor and

base compositions by hierarchical relationship links. Thus,

the musical structure has a tree structure consisting of

musical attribute nodes connected by the hierarchical rela-

tionship links.

[Attribute linking] In the musical piece structure

model, the relationships giving musical meaning are ex-

pressed by attribute linking links. Figure 6 shows an exam-

ple of linking a meaning to a note sequence from the

musical structure. The attribute links are drawn from the

musical attribute nodes in the musical structures of the

musical form, the voice part formation, and the role to the

musical element nodes expressing phrases. Figure 7 is an

example of linking meanings in multistage analysis. The

attribute-linking links are drawn from the musical attribute

nodes of the elementary musical structures of binary form

and ternary form to musical attribute nodes of the musical

structures of the compound ternary form.

Fig. 4. Example of musical element. Fig. 5. Logical structure (parts formation).

42

[Analysis structure] The musical piece structure of

the results of musical piece analyses may differ for the same

musical piece due to differences in the interpretations and

in schemes of analysis of conductors. In the musical piece

model, the differences in analyses by conductors are ex-

pressed by the analysis structure.

The analysis structure expresses which musical struc-

tures attach meaning to individual granularity levels. It also

indicates the granularity levels of note sequences analyzed

by the conductor, how musical structures are decomposed

into elementary logical structures in multistage analyses by

links expressing the relationships between nodes repre-

senting granularity, logical structure, and elementary logi-

cal structure, and the relationships between the musical

structures in multistage analyses. Figure 8 shows the ana-

lytical structure of a musical piece. This analysis is done at

two levels of granularity, phrases and notes; phrases are

analyzed from the three points of view of their role, form,

and part formation; and notes are analyzed in terms of the

two musical structures of chords and meter. Some musical

structures are decomposed into elementary musical struc-

tures by multistage analyses.

[Satisfying requirements] In the musical piece struc-

ture model, the four kinds of structures determining a

performance plan of a note sequence are expressed as

follows.

x Linking a musical meaning to a note sequence in

terms of a musical structure is expressed by draw-

ing attribute-linking links to the musical piece

elements from the musical attributes of the logic

structure.

x Giving a musical meaning to a musical structure

in terms of elementary musical structures is ex-

pressed by drawing attribute-linking links to the

musical attributes within the logical structure

from the musical attributes within the elementary

logic structure.

x The inclusion relationships of a note sequence are

expressed by the �part of � link.

x Linkage relationships of musical structures are

expressed by the logical structure and the attrib-

ute-linking links.

Fig. 6. Attribute link.

Fig. 7. Multilevel attribute linking. Fig. 8. Analysis structure.

43

3. Visualization

3.1. Requirements for visualization

It is desirable that the visualization mechanism of the

musical piece structure satisfy two requirements in order to

support the creation of a performance plan, which is the

purpose of HyperScore.

The first requirement is that visualization be done

corresponding to the interests of different people. The parts

of the musical piece structure of interest differ depending

on the roles of persons involved in their performance such

as the conductor or the performer.

For example, in order to decide when to send which

signal to which part, a conductor works with the constitu-

tion of the musical piece in which the main theme is relayed

between the voice parts. On the other hand, the performer

leads the performance during the main melody, but must be

in harmony with the main melody by adjusting both tempo

and volume at other times. Thus, a performer must note the

role of his voice part and know which voice part has the

main melody. Thus, as the roles of the people involved

differ, the parts on which they focus in the musical piece

structure differ.

In addition, even the same performer must pay atten-

tion to the attributes on the level of musical elements of

large granularity in order to grasp the outline of the piece

structure, but must also pay attention to the attributes of

musical piece elements of fine granularity in order to un-

derstand the details. Thus, the parts of the musical piece

structure receiving the main focus differ with the persons

involved.

The authors consider these differences in main focus

not only as differences in the parts of the score focused on,

but also as differences in the granularity of the main focus,

the logic structures that confer the attributes, and so on.

Thus, the visualization mechanism must be able to desig-

nate the logic structure and the granularity of the main focus

in accordance with different viewpoints in order to meet the

needs of different persons and cases.

The second requirement is that the visualization

mechanism visualizes the relationships with the environ-

ment in addition to the attributes of the musical elements

themselves. In determining a performance plan for the

musical elements, there is a need for a kind of context

information which gives not only the attributes of the

musical elements themselves, but also their relationships

with the surrounding musical elements, and the nature of

the musical elements surrounding the musical elements that

are the focus.

This requirement corresponds to the visualization of

the third and fourth structures determining the performance

plan of a note sequence discussed in Section 2.1. In the case

of the inclusion relationships of the third musical element,

the performance plan is influenced not only by the attributes

of the musical elements that are the focus but also by the

performance plans and attributes of the surrounding musi-

cal elements which are in inclusion relationships. In addi-

tion, in the case of a linkage relationship involving the

fourth musical structure, a relationship which easily com-

bines tempo, volume, and so on with the melody phrase is

selected as the performance plan of the harmony phrase;

this case is an example of the performance divisions being

influenced not only by the attributes of the musical ele-

ments themselves, but also by the attributes and perform-

ance plans of the surrounding musical elements.

Thus, context information is important in determin-

ing the performance of musical elements. The visualization

mechanism consequently needs to visualize not only the

attributes of the musical elements under focus, but also the

surrounding context information.

In addition to the above two requirements, since the

musician using the system is assumed not to have computer

expertise, HyperScore has been designed by expanding a

score system which is easily understood (and readily be-

comes familiar) by a user, without using metaphors far

removed from the metaphors of a score.

3.2. Visualization mechanism

In a database, a part of the logic data structure (con-

cept schema) modeling a target score singled out in the

specific viewpoint of a specific user is called a view. Cor-

respondingly a display visualizing a part of the musical

structure selected so as to correspond to the viewpoint of a

user is referred to as a view in this paper.

The visualization mechanism of HyperScore satisfies

the above two requirements by generating a view corre-

sponding to the viewpoint of the user with respect to the

musical structure by designating four parameters: the XY

axes, display rules, selection, and granularity.

x XY axes: Designate the musical logical structure

corresponding to the X axis and Y axis of the view

display. The arrangement of the musical elements

on the view display is determined by this. For

example, in an ordinary score, the Y axis shows

the voice part formation, and if the role is shown,

a view display assembled in stages of melody,

harmony, and rhythm is generated.

x Display rules: Designate the correspondence of a

display attribute and an attribute of a musical

element. The musical logic structure which is not

arranged on the XY axes can be visualized by this

means. For example, by linking correspondences

44

between the role of a phrase and the color of a note,

the role of a phrase can be visualized.

x Selection: Designates which musical elements are

to be displayed on a view display. For example, it

can be specified that only the musical elements

having the attribute of melody be displayed.

x Granularity: Designates the degree of detail of a

display. For example, if one wants to know the

outline of the entire piece, the phrase can be the

minimal unit of a display, while if one wants to see

the details of a part, the note can be the minimal

unit of a display.

The visualization mechanism designates the musical

element under focus by selection and granularity, visualizes

the attributes under focus in terms of the positions of the

musical elements on the X and Y display axes, and visual-

izes the musical logical structure which has not been allo-

cated to the X and Y axes by using the display attributes.

Thus, a view display combining the elements of the user

focus is generated, satisfying the first requirement.

With respect to showing the context information,

which is the second requirement, the arrangement of the

attributes of the musical elements on a view display is

shown by the designations of the X and Y axes, expressing

the positional relationships of the attributes in the musical

logical structure. In addition, the musical attributes of the

musical elements can be shown by display attributes such

as the tone of a note by means of a display rule. Specifically,

a view display generated is transmitted to the user by

visualization of the context information in terms of the

display attributes and the spatial arrangement in the view.

4. Implementation of Prototype System

A prototype of HyperScore implemented on the basis

of the musical structure model and the visualization mecha-

nism discussed above is described here.

4.1. Data model

In the data model of the prototype system, a musical

structure is expressed by two kinds of nodes and four kinds

of links.

The two kinds of nodes are the region and the domain.

x The region is a node expressing the real body of

information and the classification attribute values,

and giving the musical elements and the musical

attributes.

x The domain is a node expressing the viewpoint of

the analysis, and expresses the elementary logical

structure, the musical logical structure, and the

granularity level of a musical element in the analy-

sis structure.

In earlier structured hypertext, the nodes expressing

the real body of information and the nodes expressing the

classification attribute values were separate entities. How-

ever, a node of a musical attribute is the starting point of the

attribute linking between a musical meaning and a note

sequence in terms of musical structure, and is the destina-

tion of the attribute linking between a meaning and a

musical structure in terms of the elementary musical struc-

tures. Thus, the object of the starting point and the destina-

tion of the attribute linking in the data model of the

prototype system are expressed by the same class.

The four kinds of links are: �part of,� attribute link-

ing, �member of,� and construction.

x The �part of � link expresses the hierarchical rela-

tionships between the musical attributes in the

musical logical structure, and the inclusion rela-

tionships between musical element nodes in a

musical structure and the musical attributes.

x The attribute linking is a link representing a rela-

tionship between the real body of information and

the attribute values, and expressing the meaning-

ful linking relationships.

x The construction link is a link between domains

expressing the analysis structure.

x The �member of� link expresses the relationships

of domains and the regions belonging to the do-

mains. For example, such a link connects the

domain of voice part formation and the region of

each voice part.

The data model and the classes of this prototype

system can be expressed as in Fig. 9.

4.2. System outline

The prototype system was implemented in Objective-

C on NextStep. In addition, the score system Notation [2]

on NextStep was used to delineate the score.

Fig. 9. Scheme of musical structure model.

45

The prototype system consisted of three modules: a

GUI system, a view management system, and a data man-

agement system as shown in Fig. 10.

The GUI system receives the parameters for generat-

ing a view display from the user and produces the view

display.

Figure 11 is an example of the screen of the GUI

system. Window 1 is the window displaying the musical

logic structure. The musical logic structure is shown in tree

form for each window, and the selection and granularity can

be specified. The node showing the attribute under focus is

selected and the musical element having this attribute is

presented on the view display. In addition, the granularity

is controlled by selecting either a node close to the root or

a leaf of the musical logic structure. For example, if the

child nodes of main melody, secondary melody, and counter

melody are selected without the node �melody� being

selected in the musical logic structure of the �role,� the role

of the melody can be displayed in detail.

Window 2 is the window designating the XY axes

and the display rules. The attributes selected in window 1

Fig. 10. System configuration.

Fig. 11. An example of a view (1).

Table 1. Display attributes

46

are displayed for each musical logical structure. If a display

attribute is designated by selecting an attribute among

these, the display rule relating the entity attribute and the

display attribute is established. In the prototype, the display

attributes in Table 1 can be designated.

Windows 3 and 4 are the view displays generated,

with window 3 shown at phrase granularity and window 4

shown at note granularity.

Figure 12 is an expansion of the lower right view

display window of Fig. 11 and shows different sizes of the

frame lines and the notes depending on the roles of the

phrases.

The view management system assembles a query for

the extraction of musical elements corresponding to the

input parameters by selection and granularity and assigns a

search to the data management system. It then delivers the

presentation commands assembled from the search results,

together with the XY axes and the display rules, to the GUI

system.

The data management system is a module performing

management of the musical structure and the score infor-

mation based on the proposed model. This is a simple

database satisfying search requests from the view manage-

ment system and the input and output into and out of the

musical structure file.

5. Past Studies

5.1. Performance creation support

INTERPRET [3] and Muse [4] are past efforts to

support performance creation. INTERPRET is a tutor sys-

tem which conversationally instructs beginners in the meth-

ods of performing a musical piece. The performing methods

or performance plans produced by INTERPRET are based

on musical structures. However, these musical structures

are the melody grouping structure* [1], the voice parts of

partial chord structures determining the grouping structure

are single voices, and only simple musical structures com-

pared with the complicated musical structures of the mul-

tiple voice musical pieces are dealt with. In addition, in the

function giving the musical structures and performance

Fig. 12. An example of a view (2).

*The grouping structure corresponds to the structure combining the inclu-

sion relationships of the musical piece elements and the musical forms of

this paper.

47

plans to a user, graphical displays of musical staves or piano

rolls are used as supplements to the external information on

the score, and no visualization of the musical structures is

performed.

Muse is an electronic score display for conductors

and performers of an orchestra, which presents a score

incorporated into a part of the display board. Since Muse

converts a score into part scores, it is considered to have an

internal structure consisting of voice parts, but the musical

scores for supporting performance creation are not treated

thoroughly.

As regards studies associated with the expression of

the musical structures other than support for performance

creation, 17 systems for treating individual logic structures

such as musical structures or chord structures are intro-

duced in Ref. 5. However, none of these systems treat

multiple logical structures. There are musical languages

such as SMDL [6] and MODE [7] that have functions

treating multiple logical structures. SMDL is a markup

language for music based on HyTime [8]. Although SMDL

is designed to treat multiple logical structures, no system

actually treating multiple logical structures has been re-

ported. MODE is a Smalltalk class library and tool group

for music and can create tools dealing with multiple logical

structures by using this class library. However, currently,

only tools dealing with one logical structure have been

reported.

5.2. Hypertext model

The basis of a hypertext model is a node link model

which directly connects two nodes with a one-to-one link.

The musical structure model of this paper, the frame-related

axial model [9, 10], the set-to-set model [11], and others

use hypertext models of expanded structure so as to assign

attributes to nodes and express the relationships between

the nodes in terms of the relationships between the attribute

values. These expanded models are characterized by the

ability to express multiple links in a one-to-one link model

in terms of the relationships between single attribute values

by abstracting the links by the attribute values in one step.

For example, in the frame-related axial model, the relation-

ships between the nodes are expressed in terms of order

relationships of attribute values called relationship axes

(Fig. 13). In addition, in the set-to-set model, the relation-

ships between the attribute values are expressed in a tree

structure called a facet (Fig. 14).

Among the three kinds of attribute-linking relation-

ships expressed by the musical structure model, the first

attribute-linking relationship for musical elements in terms

of musical logic structures expresses the musical elements

as nodes, the attributes as attributes, and the musical logical

structures as facets by the set-to-set model.

However, the second attribute linking for the musical

logic structures in terms of the element logical structures

cannot be easily performed by the earlier models. In the

earlier models, the side assigning attributes and the side

assigned attributes are expressed by separate kinds of

nodes. However, although the nodes expressing the attrib-

utes among the musical logic structures assign attributes to

the musical elements, conversely, the attributes are assigned

from the musical logic structures. Thus, in this paper,

regions are introduced as nodes which can assign and can

be assigned attributes, and the second attribute-linking re-

lationships can be expressed by expressing attributes in

terms of regions.

In addition, the third inclusion relationships between

the musical elements are expressed by the �part of � link

between the musical elements in the musical structure

model, and the �is a� and �part of � links can express

inclusion relationships between nodes in the set-to-set

model. However, in the set-to-set model, both documents

and subdocuments are attribute-linked from the same facet

and thus are not differentiated in terms of attribute linking.

On the other hand, in the musical structures, the musical

Fig. 13. Frame�axis model.

Fig. 14. Set-to-set model.

48

elements are differentiated from the musical logic struc-

tures which are attribute-linked by granularity. In order to

express this distinction, domains exist for the granularity

and the musical logic structures or element logic structures,

and the relationships of the regions are expressed by �mem-

ber of � links and a musical element of a particular granu-

larity is attribute-linked in terms of the musical logical

structure that is expressed by the construction link.

In musical analyses, there are quite a number of

attributes having the same names but different musical

logical structures, and expressing the relationships of the

regions to the musical logic structures in terms of the

domains also prevents these attributes with the same names

from being transferred differently from a conductor to a

performer. For example, for the tenor voice part, a tenor is

in the high register of a male chorus but in the middle

register of a mixed four-part chorus, and these factors

influence the selection of the performance plan differently.

However, different transfers of the same names occur fre-

quently. In the musical structure model, such confusion

does not easily occur, since the domains clearly express the

subordination relationships of the regions and the musical

logic structures.

5.3. Visualization

Here, the existing models and the model of this paper

are compared from the viewpoints of context information

and the designation of the user viewpoint regarding the

visualization of a structured hypertext.

Although the set-to-set model does not have a visu-

alizing function, it can extract only those notes having

specific attribute values from a node set and express the user

viewpoint by these extraction conditions. In addition, it

allows traverse operations, by which the set of extracted

nodes is gradually changed by altering part of the extracting

conditions. For example, there are four kinds of traverse

operations recognizing viewpoints: the hierarchical zoom

in and zoom out, which change the degree of detail of a

viewpoint by changing the attribute values of the conditions

to a lower level or a higher level value in the facet, and the

zoom in and zoom out, which narrow or broaden the view-

point by adding or eliminating a facet of conditions.

In the frame�axis model, a visualization mechanism

called the hyperchart, by which the nodes are placed on

graphs or table charts using the order information of the

relationship axes, has been proposed. The user viewpoint is

designated by selecting the relationship axes. Thus, a hy-

perchart* in which the selected relationship axes are placed

in correspondence with the XY axes is generated.

The visualization mechanism of this paper produces

a view display by designating a viewpoint in terms of four

parameters: the XY axes, display rules, selection, and

granularity.

The three models compared from the point of view

of the designation of the user viewpoint show the following

three differences.

Number of axes: Only three sets of relationship axes

can be visualized by the hyperchart, even if three-dimen-

sional display charts are used. However, an arbitrary num-

ber of facets can be used for conditions in the set-to-set

model. Since the visualization mechanism described in this

paper can visualize musical logical structures which cannot

be visualized by the XY axes by allocating them to display

attributes by means of display rules, it can visualize more

musical logical structures.

Degree of detail: There is no function for varying the

degree of detail in the hyperchart, and a part cannot be

magnified to receive special attention while the whole

hypertext structure is being looked at. However, the degree

of detail of a viewpoint can be designated by the hierarchi-

cal zoom in and zoom out operations in the set-to-set model.

The visualization mechanism described in this paper can

designate the degree of detail by means of the granularity

parameter.

Complexity of display: In the hyperchart, all of the

nodes loaded on the relationship axes designated by the

attribute values end up being displayed. Thus, a screen can

easily become complicated, and the parts of interest can be

differentiated only to a small extent. However, since the

attribute values of the nodes to be extracted are designated

by the set-to-set model, the nodes which do not have the

designated attribute values are not displayed. Since the

visualization mechanism of this paper also designates the

attributes of the musical elements to be displayed, the

musical elements which do not have the designated attrib-

utes are not displayed, making it difficult for a screen to

become complicated.

We next discuss the context information facility,

which gives information on the other nodes that surround

the node of interest.

Although the importance of the context information

is indicated in the set-to-set model, special visualization is

not performed for the set of nodes extracted by conditions.

Since the relationships between the nodes and the

attribute values of the nodes are expressed on a chart in the

hyperchart representation, the context information is ex-

pressed by distance relationships or positional relationships

between the node of interest and the surrounding nodes,

making visual understanding possible.

The visualization mechanism of this paper is the same

as that of the hyperchart, and it expresses the context

information designated by the XY axes in terms of distance

relationships or the positional relationships between the

*A hyperchart performing visualization by statistically processing attrib-

ute values as application charts has also been proposed.

49

node of interest and the surrounding nodes, making visual

understanding possible. In addition, the attributes of the

nodes can be expressed not only by the positions but also

by display attributes by appropriate specification of the

display rules.

6. Evaluations

In order to evaluate the prototype system, music

specialists including conductors and instructors were re-

quested to trial-use it. We obtained the following results.

x To users who have knowledge of musical struc-

tures, having studied musicology, the prototype

provides a good supporting function for the work

of establishing performance plans through an un-

derstanding of the musical structures.

x Even for people without knowledge of musicol-

ogy, the prototype is an effective supporting tool

for learning the methods of establishing perform-

ance plans or musical analyses.

x The prototype would be more effective if, for

example, a confirmation function capable of pro-

ducing the actual sound of a phrase were provided

in addition to visualizing.

In addition, not only conductors, but also composers

and performers had favorable responses to the prototype in

terms of the performance of demonstrations, and they gave

especially high evaluations to the function of emphasis-

displaying musical structures on a score and the function for

specifying the display rules of the visualization mechanism.

Thus, the following items should be improved and

evaluated in the future.

x In addition to the visualization mechanism, the

function for showing musical structures should be

strengthened by adding an audio mechanism by

allocating attributes to tone and volume.

x Since the visualization mechanism of the present

prototype system deals with only numbers 1, 3,

and 4 among the four kinds of structures determin-

ing the performance plan of a sequence of notes

discussed in Section 2.1, the linking of a meaning

to a musical structure from the elementary musical

structures by the multistep analyses of type num-

ber 2 should be dealt with.

In addition, only a tree structure is handled as the

topology of the musical elements and the musical logic

structures in the musical structure model. However, when

the pitch of the final note of a note sequence and the last

note of the next sequence of notes is the same, there is a

technique by which both notes are combined to bring out

the feeling of continuity of the flow of music. The cases in

which there are overlaps cannot be expressed by the musical

structure model in its current state. However, it is known

that these cases can be handled [1] by linking the score in

which the notes are separated before applying overlaps with

the score after the overlaps have been applied. Thus, the

musical structure model is expected to be expanded so that

overlaps can be handled. Since in the visualization, the

musician user is assumed to a nonexpert in computer utili-

zation, its expansion is designed such that it is based on a

score system which is easily mastered by the user without

using one removed from the musical score metaphor. How-

ever, applications of advanced visualization techniques,

separated from the traditional musical expressions by five-

line scores and the like, are planned as in Ref. 13, and will

not be confined in scope to expanding a score system.

7. Conclusions

HyperScore, which visualizes musical structures, is

intended as a supporting tool in creating the performance

of a musical piece of multiple voice parts. First, a musical

structure model which has been expanded from a structured

hypertext model and a visualization mechanism is pro-

posed, then the implementation of a prototype system is

discussed.

The musical structure model simply and cleanly ex-

presses the musical semantic structure associated with the

creation of a performance plan. This model is constructed

from: tree structures expressing the hierarchical structures

of the musical elements such as the phrases, motifs, and

notes; the tree structures expressing the musical logic struc-

tures of the musical form, voice part formation, and so on;

and the attribute-linking links expressing the relationships

of the above structures. The visualization mechanism dis-

plays complicated musical structures in easily understood

form by generating various views of the musical semantic

structure from different perspectives. The view is generated

by the visualization mechanism with four parameters: XY

axes, display rules, selection, and granularity. The view

illustrates the musical semantics of series of notes clearly

by their position on the view display and their display

attributes.

The logical structure of a score of multiple parts

treated in this paper has a time axis, and multiple voice parts

exist simultaneously in the structure. These features are

similar to multimedia content with a time axis, where

multiple media are present simultaneously. On the other

hand, HyperScore can treat a score on the logical structure

level. In order to make a computer into a tool for handling

multimedia, it is important that the multimedia be treated

on the logical structure level, as in the outline editing of

50

word processors. Thus, a study applying the handling of

multiple voice part pieces in terms of logical structure as

described in this study to the handling of multimedia on the

logical structural level is being planned.

REFERENCES

1. Lerdahl F, Jackendoff R. A generative theory of tonal

music. MIT Press; 1983.

2. Diener G. Ttrees: A tool for the compositional envi-

ronment. In: Pope S (editor). The well-tempered ob-

ject. MIT Press; 1991. p 157�168.

3. Backer M. Design of an intelligent tutoring system

for musical structure and interpretation. In: Balaban

M, et al. (editors). Understanding music with AI. MIT

Press; 1992. p 466�489.

4. Graeffe C, Wahila D, Maguire J, Dasna O. Muse

digital music stand for symphony musicians. ACM

Interact 1996;III:27�34.

5. Hewlett W. Musical information. In: Hewlett W, Sel-

frideg-Field E (editors). Computing in musicology,

Vol. 8. CCARH, California; 1992. p 61�134.

6. ISO. Information technology�Standard music de-

scription language (SMDL). ISO DIS 10743, 1995.

7. Pope S. Introduction to MODE: The musical object

development environment. In: Pope S (editor). The

well-tempered object. MIT Press; 1991. p 157�168.

8. ISO. Information technology hypermedia/time-

based structuring language (HyTime), 1st ed. ISO

10744, 1994.

9. Masuda Y, Ishitobi Y, Ueda M. Frame�axis model for

automatic information organizing and spatial naviga-

tion. Proc ECHT�94, Edinburgh, p 146�157.

10. Ueda M, Masuda Y, Ishitobi Y. CastingNet: Hyper-

media for organizing a visually browsing informa-

tion. Computer Software 1995;12:56�70.

11. Hara Y, Kasahara Y. A set-to-set linking strategy for

hypertext systems. Proc COIS90, Massachusetts, p

131�135.

12. Rumbaugh J. Object-oriented modeling and design.

Prentice�Hall; 1991.

13. Hiraga R, Igarashi S, Matsuura Y. Integrated perform-

ance visualization system. J Inf Process Soc

1997;38:2391�2397.

AUTHORS (from left to right)

Yusuke Ariyoshi graduated from the Department of Information Engineering, Faculty of Engineering Science, Osaka

University, in 1990 and was in the doctoral program until 1995, when he joined NEC. He currently works in the Human Media

Research Laboratories of that company. As a student, he studied multimedia systems and musical information processing. His

current research interests include information filtering.

Shinji Shimojo (member) graduated from the Department of Information Engineering, Faculty of Engineering Science,

Osaka University, in 1981 and completed the doctoral program in 1986. He then joined Osaka University as a research associate

(Faculty of Engineering Science), and subsequently was appointed a lecturer, associate professor, and professor in the

Computation Center in 1989, 1991, and 1998, respectively. His research interests include performance evaluations of LAN

access systems, performance evaluations of discrete processing systems, and discrete operating systems.

Hideo Miyahara (member) graduated from the Department of Communications Engineering, Faculty of Engineering,

Osaka University, in 1967 and completed the doctoral program in 1972. He became an associate professor on the Faculty of

Engineering Science, Osaka University, in 1980. He is currently a professor in the Department of Engineering Science Research.

His research areas include high-speed networks and multimedia. He was the recipient of a 1990 Paper Award, and is an IEEE

Fellow.

51

A musical structure model and its visual mechanism for hypertext score systems

Documents

Transcript of A musical structure model and its visual mechanism for hypertext score systems