Introduction

The Music Note Ontology?,??

The Music Note Ontology

0 FICLIT, University of Bologna , Bologna , Italy 1 LILEC, University of Bologna , Bologna , Italy

In this paper we propose the Music Note Ontology, an ontology for modelling music notes and their realisation. The ontology addresses the relation between a note represented in a symbolic representation system, and its realisation, i.e. a musical performance. This work therefore aims to solve the modelling and representation issues that arise when analysing the relationships between abstract symbolic features and the corresponding physical features of an audio signal. The ontology is composed of three di erent Ontology Design Patterns (ODP), which model the structure of the score (Score Part Pattern), the note in the symbolic notation (Music Note Pattern) and its realisation (Musical Object Pattern).

Ontology Design Patterns Computational Musicology Computer Music Music Information Retrieval

Introduction

A music note is de ned as \the marks or signs by which music is put on paper. Hence the word is used for the sounds represented by the notes" [ 10 ]. However, musical notation provides some general information on how to play a certain note. These indications are then enriched in the context of a performance by a large amount of information, for example, from the musician's sensitivity or the conductor's instructions. Historically, music notation has been a crucial innovation that allowed the study of music and hence the rise of musicology. Music scores were initially introduced for the primary purpose of recording music by giving other musicians the possibility of playing the same piece in turn, reproducing it. However, musical notation by de nition entails expressive constraints on the description of musical content.

Representing music involves a number of issues that are closely related to the complexity of the musical content. This can be found in every system of ? Copyright © 2021 for this paper by its authors. Use permitted under Creative

Commons License Attribution 4.0 International (CC BY 4.0). ?? This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 101004746. ? ? ? Corresponding author: andrea.poltronieri2@unibo.it music representation, from music scores to their various forms of digital encoding, usually referred to as symbolic representation systems. As Cook observed, music scores symbolise rather than represent music, as people don't play musical rhythms as written and often they don't play the pitches as written: that is because the notation is only an approximation [ 7 ]. To underline this concept, it is useful to draw a distinction between a score and a musical object [ 26 ]. While the former can be thought of as a set of instructions that the musician or a computer system uses to realise a piece of music, a musical object consists of the realisation of this process. In other words, musical notation precedes realisation or interpretation and de nes only partially the musical object [ 18 ].

Music notation is not just a mechanical transformation of performance information. Performance nuances are lost when moving from performance to notation, and symbolic structure is partly lost in the translation from notation to performance [ 8 ]. Moreover, in music notation some features of musical content are completely overlooked. For example, timbre information is generally ignored and is at best provided through generic information about the instrument that is supposed to play the part. Many of these problems have not been overcome by the numerous systems of symbolic representation proposed in recent decades. Music representation systems (MSR) have been evaluated by Wiggins et al. [ 26 ] using a system based on two orthogonal dimensions. These two dimensions have been named as expressive completeness and structural generality. Expressive completeness is described as the extent to which the original musical content may be retrieved and/or recreated from its representation [ 25 ]. Structural generality, instead, refers to the range of high-level structures that can be represented and manipulated [ 26 ]. For example, raw audio is very expressive since it contains a great deal of information about a certain music performance; but a MIDI representation contains much less information, e.g. timbral features are not represented in this format. On the contrary, when evaluating structural generality, raw audio performs poorly, due to the di culties in the extraction of structured information (e.g., tempo, chords, notes, etc.) from such a format.

Another problem in representing music is linked to the twofold nature of musical content, which contains information that can be reduced to mathematical functions, but also information related to the emotional spectrum and a wide range of psychoacoustic nuances. In fact, music is distinguished by the presence of many relationships that can be treated mathematically, including rhythm and harmony. However, elements such as tension, expectancy, and emotion are less prone to mathematical treatment [ 8 ].

The limits of symbolic representations are also accentuated when the performance is considered in relation to the concept of interpretation. For each composition, several performances of the same one correspond to as many interpretations of the musical piece. These interpretations can also vary greatly from one another from agogic (note accentuation, [ 3 ]), timbral and dynamic viewpoints.

Therefore, this work aims to represent musical notes both symbolically and as a realisation of the note itself. The alignment between these two representation systems can be relevant on several grounds. For example, it can allow musical analysis using a structured representation (i.e. a symbolic representation) while simultaneously taking into account all the information that is only contained in the signal (e.g. timbre information). This would allow a di erent level of analysis, taking into account information that is usually overlooked in music scores and symbolic representations. On the other side, this type of representation would allow structured analysis of the music signal by having the signal information encoded in strict relation with the music score and the multiple hierarchies that the music score entails.

Furthermore, this type of representation allows the analysis of di erent realisations of the same note, also providing information on how a score is performed di erently in di erent performances. 2

Physical Features and Perceptual Features

The features that need to be taken into account when representing music can be reduced to four main categories: tempo, pitch, timbre and dynamics. However, all these concepts refer to music perception. Symbolic representations, however, only represent abstractions of these features. In contrast, musical performance, as analysed in recorded form (i.e. signal representation), contains information that refers to physical characteristics. Through the analysis of these characteristics, it is in turn possible to abstract the perceptual characteristics aroused by a particular sound. However, this work aims not to analyse the perceptual aspects of music, but rather to formalise a representation that aims to express a musical note from the points of view of both scores and musical objects.

As far as time is concerned, the main distinction that arises is that between the quantised time of a symbolic representation, and real time, which is expressed in seconds. A symbolic representation describes temporal information using predetermined durations for each note (e.g., quarter notes, eighth notes, sixteenth notes, etc.). These durations are executed in relation to a time signature, indicated by a tempo marking (e.g., slow, moderate, allegro, vivace, etc.), or by a metronome mark, which indicates the tempo measured in beats per minute (bpm). Generally, symbolic notations (e.g. MusicXML) use the same expedients as classical musical notation. However, some representation systems rely on real-time, such as the MIDI standard [ 2 ].

When considering frequency, the representation issue is more challenging. A sound, e.g. a note in the form of a musical note, is usually associated with a pitch. However, although pitch is associated with frequency [ 16 ], it is a psychological percept that does not map straightforwardly onto physical properties of sound [ 15 ]. The American National Standards Institute (ANSI) de nes pitch as the \auditory attribute of sound according to which sounds can be ordered on a scale from low to high" [ 11 ]. In fact, in many languages, the pitch is described by using terms having a spatial connotation such as \high" and \low" [ 21 ]. However, the spatial arrangement of sounds has been shown to be in uenced by the listener's musical training [ 23 ] and their ethnic group [ 1 ]. The relation between pitch and frequency is evident in the case of pure tones [ 17 ]. For example, a sinusoid having a frequency of 440 Hz corresponds to the pitch A4. However, real sounds are not composed of a simple pure note with a unique and well-de ned frequency. Playing a single note on an instrument may result in a complex sound that contains a mixture of di erent frequencies changing over time. Intuitively, such a musical tone can be described as a superposition of pure tones or sinusoids, each with its frequency of vibration, amplitude, and phase. A partial is any of the sinusoids, by which a musical tone is described [ 17 ]. The frequency of the lowest partial is de ned as the fundamental frequency. Most instruments produce harmonic sounds, i.e. composed of frequencies close to the partial harmonics. However, some instruments such as xylophones, bells, and gongs produce inharmonic sounds, i.e. composed of frequencies that are not multiples of the fundamental. As a result, the pitch perception of these sounds is distorted for a listener who is unfamiliar with these instruments [ 15 ]. Several approaches have been proposed to determine the perceived pitch of these sounds, based both on the periodicity of the frequencies that compose the sound [ 22 ] and on spectral clues [ 5 ]. However, recent research suggests that pitch perception involves learning to recognise a sound timbre over a range of frequencies, and then associating changes in frequency with visuospatial and kinesthetic dimensions of an instrument [ 15 ].

On the other hand, timbre is a sound property even more di cult to grasp. ANSI de nes the timbre as the \attribute of auditory sensation, in terms of which a subject can judge that two sounds similarly presented and having the same loudness and pitch are dissimilar" [ 11 ]. However, even today, the de nition of timbre is rather debated. For example, it has been de ned as a \misleadingly simple and vague word encompassing a very complex set of auditory attributes, as well as a plethora of psychological and musical issues" [ 14 ]. The main problem in de ning timbre is that there is no precise correspondence with one or more physical aspects. Timbre is also challenging to abstract, as it is usually described using adjectives such as light, dark, bright, rough, violin-like, etc. Numerous studies have shown that timbre depends on a large number of factors and physical characteristics. Some research has tried to represent the perception of timbre using multidimensional scaling (e.g. [ 9 ]). Other studies in this area focus on the grouping of timbre into a \Timbre Space" [ 14 ]. While those studies represented timbre in terms of perceptual dimensions, others have represented timbre in terms of physical dimensions, such as the set of parameters needed for a particular synthesis algorithm [ 8 ]. These physical characteristics concern both the time axis, such as attack time and vibrato variation and amplitude, spectral features such as spectral centroid, and harmonic features such as odd-to-even ratio and inharmonicity. An approximation of these features is provided by a model called ADSR, which consists of the analysis of the envelope of the sound wave and the measurement of attack (A), decay (D), sustain (S) and release (R). The envelope can be de ned as a smooth curve outlining the extremes in the amplitude of a waveform. The ADSR model, however, is a strong simpli cation and only yields a meaningful approximation for amplitude envelopes of tones that are generated by speci c instruments [ 17 ].

The same distinction drawn between frequency and pitch can be made for loudness and sound intensity. Loudness is a perceptual property whereby sounds are ordered on a scale from quiet to loud. Sound intensity is generally measured in dB, and expresses the sound power (expressed in Watts) in relation to the physical space in which the sound propagates (expressed in square metres) [ 4 ]. As a perceptual property, it is by its nature subjective and is closely related to loudness. However, loudness also depends on other sound characteristics such as duration or frequency [ 17 ]. Also, the sound duration in uences perception since the human auditory system averages the e ect of sound intensity over an interval up to a second.

Formalizing the aforementioned perceptual concepts is beyond the scope of this paper. Instead, the purpose of this contribution is to de ne (some of) the attributes that can provide helpful information to describe sound material. The importance of this analysis is to highlight the inner complexity of the sound material, underlying the connections that can be drawn between physical aspects and perceptual components, as well as the relationships among these aspects. 3

Related Work

Several Semantic Web ontologies have been proposed for modelling musical data. The Music Theory Ontology (MTO) [ 19 ] aims at modelling music theoretical concepts. This ontology focuses on the rudiments that are necessary to understand and analyse music. However, this ontology is limited to the description of note attributes (dynamics, duration, articulation, etc.) at the level of detail of a note set. This peculiarity considerably reduces the expressiveness of the model, especially in relation to polyphonic and multi-voice music.

The Music Score Ontology (Music OWL) [ 12 ] aims to represent similar concepts described in the Music Theory Ontology. However, it focuses on music notation, and the classes it is composed of are all aimed at representing music in a notated form, hence related to music sheets.

The Music Notation Ontology [ 6 ] is an ontology of music notation content, focused on the core \semantic" information present in a score and subject to the analytic process. Moreover, it models some high-level analytic concepts (e.g. dissonances), and associates them with fragments of the score.

However, all these ontologies only consider the characteristics of the musical material from the point of view either symbolic representation or music score. The aim of this paper is instead to propose a model that can be as general as possible and that can therefore represent musical notation expressed in di erent formats and considering at the same time scores and symbolic notations. This would allow interoperability between di erent notation formats and the standardisation of musical notation in a format that aims to be as expressive as possible.

Moreover, to the best of our knowledge there is no works that relates the musical note (understood as part of the symbolic notation) to the corresponding musical object (understood as the realisation of a musical note). In addition, none of the proposed ontologies seem to model the physical information related to the audio signal.

Therefore, this paper proposes an ontology that can represent a note both from a symbolic and a performance point of view. To do this, it is necessary to take into account both the symbolic characteristics (the signs and attributes typical of musical notation), and the physical characteristics of the note as reproduced by a musical instrument. However, this work is not intended to go into the perceptual characteristics of music.

The problem of modelling information with respect to its realisation has already been analysed in the past. For example, the Ontology Design Pattern (ODP) Information Realization3 {extracted from DOLCE+DnS Ultralite Ontology4 [ 13 ]{ proposes to help ontology designers to model this type of scenario. This pattern allows designers to model information objects and their realizations. This allows to reason about physical objects and the information they realize, by keeping them distinguished.

The same kind of relationship can be found in the FRBR5 [ 24 ] conceptual entity-relationship model, which groups entities into four di erent levels (work, expression, manifestation and item). The relationship between expression, dened as the \speci c intellectual or artistic form that a work takes each time it is realized" and manifestation, de ned as \the physical embodiment of an expression of a work", can be traced back to the modelling problem discussed in this paper.

However, these examples do not solve the modelling problems that have been advanced in the previous sections. In particular, the relationship between the approximate features represented by the symbolic notation and their realisation remains a challenge in the eld of ontology design. To achieve this, this research proposes a new ontology to represent symbolic musical notation and its realisation, considering the relationships between the abstraction of musical features expressed in the symbolic notation and the physical characteristics of the audio signal. Furthermore, this paper proposes a pattern-based approach, by proposing an ontology composed of modular and reusable elements for the representation of both audio and symbolic musical content. 4

The Music Note Ontology

The Music Note Ontology6 (see Figure 1) models a musical note both as a constituent element of the score, and as a musical object, i.e. as a realisation of 3 http://www.ontologydesignpatterns.org/cp/owl/informationrealization.owl 4 http://ontologydesignpatterns.org/wiki/Ontology:DOLCE+DnS_Ultralite 5 https://www.ifla.org/publications/functional-requirements-for-bibliographic-records 6 The OWL le of the ontology can be found at: https://purl.org/ andreapoltronieri/notepattern the event represented by a score or symbolic notation. To do this, the proposed ontology represents both the elements and the typical hierarchies of a score, describing all its attributes. In addition, the ontology models the realisation of a note, describing its physical characteristics in a given performance.

The proposed ontology is composed of three distinct Ontology Design Patterns, which are presented in this paper and have been submitted to the ODP portal at the following links:: { The Score Part Pattern: http://ontologydesignpatterns.org/wiki/Submissions:

Scorepart { The Musical Object Pattern: http://ontologydesignpatterns.org/wiki/

Submissions:Musicalobject { The Music Note Pattern: http://ontologydesignpatterns.org/wiki/Submissions:

Notepattern.

ID CQ1 CQ2 CQ3 CQ4 CQ5 CQ6 CQ7

Competency Question what is the name of a note? what part of the score does a note belong to? what are the dynamic indications referring to a note in the score? what is the fundamental frequency of a note? what are the di erent frequencies that make up the spectrum of a note? what is the duration in seconds of a note, in a given performance? how is the envelope of a note shaped?

The note as represented in the score is described by the Music Note Pattern (in green in the gure), which in turn imports the Score Part pattern (in orange in the gure) and the Musical Object pattern (in yellow in the gure). The former import describes the relationships between the di erent components of a score, while the latter models the musical object from the point of view of its physical features.

A selection of competency questions that the ontology must be able to answer are listed in Table 1. However, the complete list of compency questions in available on the repository of the project7.

Some of the classes and properties of the ontology have also been aligned with currently available ontologies and the alignments are available in a separate le8. Alignments with other ontologies were mainly made on the Score Part Pattern and the Music Note Pattern. Although some of the available ontologies (see Section 3) de ne some of the classes and properties present in these patterns, for 7 The project repository is available at: https://github.com/andreamust/music_ note_pattern/ 8 Alignments to the Music Note Ontology are available at the URI: purl.org/ andreapoltronieri/notepattern-aligns this we needed to re-model the score and symbolic elements. In fact, the objective of this work is, among others, to abstract from a speci c representation system, combining the attributes of the musical score with other elements of the MIDI representation (as the most widely used symbolic representation) and, to the beat of our knowledge, there are no ontologies available with this characteristics. The hierarchy (i.e. the mereological structure) of a music score is described by the Score Part Pattern9. The Part class describes the instrumental part of a score, which refers to a speci c sta of the musical notation that is associated with an instrument. The instrument playing the part is modelled by the class Instrument, which is associated by means of a datatype property with the MIDI program assigned to that instrument (hasMidiProgram). Each part of the score is also divided into sections (class Section), similar music fragments (class HomogeneousFragment) and voices (class Voice). A voice is de ned as 9 Score Part Pattern URI: https://purl.org/andreapoltronieri/scorepart one of several parts that can be performed within the same sta (e.g. alto and soprano). A section is instead a part of the music score de ned by the repetition signs. A fragment, on the other hand, is a grouping of notes that share the same metric, tempo and clef, described by the object properties hasMetric, hasTempo and hasClef. 4.2

The Musical Object Pattern

The Musical Object Pattern10 models the physical features of a musical note object, i.e. the execution of a note. Speci cally, this pattern models the physical characteristics that can be extracted from the sound wave produced by an instrument playing a musical note. The MusicalObject class is connected to four classes that describe these physical characteristics, namely duration, sound intensity, frequency and envelope. The MusicObjectDuration class expresses the duration in seconds of the musical object, by means of the object property hasDurationInSeconds. In the same way, the musical intensity is modelled via the SoundIntensity class. Frequency is modelled by means of the class Frequency and its sub-classes FundamentalFrequency and PartialFrequency. For each expressed frequency, the magnitude of the frequency is also indicated using the hasFrequencyMagnitude datatype property. Finally, the Envelope class is connected to four datatype properties that describe the envelope of the waveform according to the ADSR model, namely hasAttack, hasSustain, hasDecay and hasRelease. 4.3

The Music Note Pattern

The Music Note Pattern11 models the symbolic note (as represented in a score or in most of symbolic representation systems) and its attributes. Moreover, this ODP aims to abstract over di erent music representation systems. This means that it is possible to represent with this pattern music that was originally encoded in other formats. To do so, the pattern proposed describes each symbolic notes both by means of the standard music score notation and by using the MIDI reference for each of the notes's attributes. This would allow the conversion between SymbolicNote is the central class representing the note as represented in the score. This class has seven subclasses, one for each of the seven notes of the tempered system (primitive notes), plus 35 subclasses which are de ned as the combination between primitives and accidentals. The class Accidental (which has in turn ve subclasses: Flat, Sharp, Natural, DoubleFlat, and DoubleSharp) allows the representation of altered notes. This allows to represent every note of the Western notation system, taking into account the function with respect to the tonality of the piece (e.g. di erentiating between the notes A# and Bb). 10 Musical Object Pattern URI:

musicalobject 11 Music Note Pattern URI: https://purl.org/andreapoltronieri/notepattern https://purl.org/andreapoltronieri/ Notice that such di erentiation has an important cognitive function for the music reader (distinguishing enharmonics according to the tonality), function that is mostly irrelevant for a computer reading a midi, but very relevant for an AI learning to generate music from notation). Furthermore, the SymbolicNote class is linked to several classes and data properties, describing the note's attributes. The class Position de nes the position of the note both in relation to the score (datatype property hasToMeasure) and within the measure (datatype property hasPositionInMeasure). The class NoteDuration describes the symbolic duration of the note, as expressed in the music score. The NoteDynamic class, instead, describes the dynamics of the note with reference to both musical notation (datatype property hasLiteralDynamic, e.g crescendo, vibrato, etc.) and the MIDI standard (data property hasMidiVelocity). Similarly, the NotePitch class de nes both the octave in which the note is located (datatype property isDefinedByOctave) and the MIDI pitch (datatype property hasMidiPitch). In addition, the SymbolicNote class is linked to the other two patterns that make up the Music Note Ontology. The SymbolicNote belongs in fact to a Section rather than to a Voice, while it has realisation in a MusicalObject. Finally, the relationships between the attributes of the SymbolicNote and those of the MusicalObject are expressed.

The NotePitch of a symbolic note is represented as the abstraction of the musical object's frequency (expressed by the object property hasFrequencyAbstraction), while NoteDynamic and NoteDuration are described as the abstraction of SoundIntensity and MusicObjectDuration, respectively (object properties hasDynamicAbstraction and hasDurationAbstraction).

In order to test the ontology, a Knowledge Base (KB)12 containing a single note and a single musical object was created. SPARQL queries were also created for each of the competency questions de ned in Table 1. The KB can be tested against the SPARQL queries, in order to verify the expressiveness of the ontology with respect to the competency questions. The complete list of SPARQL queries is available on the repository of the project.

The ontology can also be used to enrich the description of music content that is already annotated using other ontological models. For example, the classes and properties describing the structure of musical notation can be aligned to the widely used Music Ontology [ 20 ]. Similarly, the same classes can be aligned with the aforementioned Music OWL and Music Score Ontology. 5

Conclusions and Future Perspectives

In this paper we have proposed the Music Note Ontology, an ontology for modelling musical content both from a symbolic point of view and in terms of its realisation. To do this, three di erent Ontology Design Patterns have been designed to model the internal relations of the score, the note from the symbolic point of view, and the note as musical object. The relationships between the 12 https://purl.org/andreapoltronieri/notepatterndata di erent musical notations have also been considered, in particular between the symbolic abstraction and the features that can be extracted from the audio signal of a performance.

In our future work we aim at modelling the perception of the di erent components of the audio signal, describing the di erent perceptual levels at which it is possible to experience music. We also aim to formalise the musical perception of di erent features from a subjective point of view, e.g. in individuals with di erent musical skills or from di erent musical cultures. Furthermore, since the model proposed at the moment can only express notes related to the traditional notation system (and thus related to the temperate system of the Western musical tradition) we plan to expand the ontology with notes from other temperaments and musical traditions, as well as unpitched notes and microtonal variations.

1. Antovic , M. : Musical metaphors in serbian and romani children: An empirical study . Metaphor and Symbol 24 ( 3 ), 184 {202 (Jul 2009 ). https://doi.org/10.1080/10926480903028136, https://doi.org/10.1080/ 10926480903028136

2. Association , M.M.:

The complete midi 1.0 detailed speci cation: Incorporating all recommended practices (

1996 ), https://books.google.it/books?id= cPpcPQAACAAJ

3. Brown , C.: Historical performance, metronome marks and tempo in Beethoven's symphonies . Early Music XIX(2) , 247 { 260 (05 1991 ). https://doi.org/10.1093/earlyj/XIX.2.247, https://doi.org/10.1093/earlyj/ XIX.2. 247

4. Bruneau , M. , Scelo , T., d'Acoustique , S. : Fundamentals of Acoustics. ISTE, Wiley ( 2013 ), https://books.google.it/books?id=1\_y6gKuGSb0C

5. Carlyon , R.P. , Shackleton , T.M. : Comparing the fundamental frequencies of resolved and unresolved harmonics: Evidence for two pitch mechanisms? The Journal of the Acoustical Society of America 95 ( 6 ), 3541 { 3554 ( 1994 ). https://doi.org/10.1121/1.409971, https://doi.org/10.1121/1.409971

6. Cher , S.S.s. , Guillotel , C. , Hamdi , F. , Rigaux , P. , Travers , N.: Ontologybased annotation of music scores . In: Proceedings of the Knowledge Capture Conference. K-CAP 2017 , Association for Computing Machinery , New York, NY, USA ( 2017 ). https://doi.org/10.1145/3148011.3148038, https://doi.org/ 10.1145/3148011.3148038

7. Cook , N.: Towards the complete musicologist . In: Proceedings of the 6th International Conference on Music Information Retrieval ( 2006 )

8. Dannenberg , R.B.: Music representation issues, techniques, and systems . Computer Music Journal 17 ( 3 ), 20 { 30 ( 1993 ), http://www.jstor.org/stable/3680940

9. Grey , J.M. : Multidimensional perceptual scaling of musical timbres . The Journal of the Acoustical Society of America 61 ( 5 ), 1270 { 1277 ( 1977 ). https://doi.org/10.1121/1.381428, https://doi.org/10.1121/1.381428

10. Grove , G. , Fuller-Maitland , J. : Grove's Dictionary of Music and Musicians. No. v. 4 in Grove's Dictionary of Music and Musicians , Macmillan ( 1908 ), https://books. google.it/books?id=FBsPAAAAYAAJ

11. Institute , A.N.S. , Sonn , M. , of America, A.S. : American National Standard Psychoacoustical Terminology . N.: ANSI, American National Standards Institute ( 1973 ), https://books.google.it/books?id=5Fo0GQAACAAJ

12. Jones , J., de Siqueira Braga, D. , Tertuliano , K. , Kauppinen , T. : Musicowl: The music score ontology . In: Proceedings of the International Conference on Web Intelligence . p. 1222 { 1229 . WI ' 17 , Association for Computing Machinery, New York, NY, USA ( 2017 ). https://doi.org/10.1145/3106426.3110325, https://doi. org/10.1145/3106426.3110325

13. Mascardi , V. , Cord , V. , Rosso , P.: A comparison of upper ontologies . In: WOA . pp. 55 { 64 (01 2007 )

14. McAdams , S. , Giordano , B.L.: The perception of musical timbre ., pp. 113 { 124 . Oxford library of psychology., Oxford University Press, New York, NY, US ( 2016 )

15. McLachlan , N.M. : Timbre , Pitch, and Music . Oxford University Press ( Jun 2016 ). https://doi.org/10.1093/oxfordhb/9780199935345.013.44, https://doi. org/10.1093/oxfordhb/9780199935345.013.44

16. Moore , B.C.J.: An introduction to the psychology of hearing , 5th ed. An introduction to the psychology of hearing , 5th ed., Academic Press, San Diego, CA, US ( 2003 )

17. Muller, M.: Fundamentals of Music Processing: Audio, Analysis, Algorithms, Applications. Springer Publishing Company, Incorporated, 1st edn. ( 2015 )

18. Nattiez , J.J. , Dunsby , J.M. : Fondements d'une semiologie de la musique . Perspectives of New Music 15 ( 2 ), 226 { 233 ( 2021 /05/14/ 1977 ). https://doi.org/10.2307/832821, http://www.jstor.org/stable/832821, full publication date: Spring - Summer, 1977

19. Rashid , S.M. , De Roure , D. , McGuinness , D.L.: A music theory ontology . In: Proceedings of the 1st International Workshop on Semantic Applications for Audio and Music . p. 6 { 14 . SAAM ' 18 , Association for Computing Machinery, New York, NY, USA ( 2018 ). https://doi.org/10.1145/3243907.3243913, https: //doi.org/10.1145/3243907.3243913

20. Raymond , Y. , Abdallah , S. , Sandler , M. , Giasson , F. : The music ontology . In: Proceedings of the 8th International Conference on Music Information Retrieval (ISMIR 2007 ). pp. 417 { 422 . Vienna, Austria (Sep 2007 )

21. Rusconi , E. , Kwan , B. , Giorndano , B. , Umilta , C. , Butterworth , B. : Spatial representation of pitch height: the SMARC e ect . Cognition 99 ( 2 ), 113 {129 (Mar 2006 ). https://doi.org/10.1016/j.cognition. 2005 . 01 .004, https://doi.org/ 10.1016/j.cognition. 2005 . 01 .004

22. Schouten , J.: The Perception of Subjective Tones . Separaat / Laboratoria N. V. Philips Gloeilampenfabrieken, Verlag nicht ermittelbar ( 1938 ), https://books. google.it/books?id=d9vjzQEACAAJ

23. Stewart , L. , Walsh , V. , Frith , U. : Reading music modi es spatial mapping in pianists . Perception & Psychophysics 66 ( 2 ), 183 {195 (Feb 2004 ). https://doi.org/10.3758/bf03194871, https://doi.org/10.3758/bf03194871

24. Weiss , P.J. , Shadle , S.: Frbr in the real world . The Serials Librarian 52 ( 1-2 ), 93 { 104 ( 2007 ). https://doi.org/10.1300/J123v52n01 09, https://doi.org/10.1300/ J123v52n01_ 09

25. Wiggins , G.: Computer-representation of music in the research environment . Modern Methods for Musicology: Prospects , Proposals, and Realities pp. 7 { 22 ( 2009 ), https://doi.org/10.4324/9781315595894

26. Wiggins , G. , Miranda , E. , Smaill , A. , Harris , M.: A framework for the evaluation of music representation systems . Computer Music Journal 17 ( 3 ), 31 { 42 ( 1993 ), http://www.jstor.org/stable/3680941