<<

Carnatic : A Computational Perspective

Hema A Murthy Department of Computer Science and Engineering IIT Madras [email protected]

e-mail: [email protected]

December 13 2013 MIR Indian Music Preliminaries Tonic Gamaka¯s in Cent filterbanks Identifying the strokes of the Pitch Extraction Conclusions

Outline

MIR and Carnatic Music Carnatic Music Concert Preliminaries Tonic Melodic cues Processing the Drone Gamaka¯s in Carnatic Music Cent filterbanks Mridangam stroke transcription Modes of the mridangam Transcription Pitch Extraction Conclusions

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Music Information Retrieval and Carnatic Music

• Indian – rich repertoires, many traditions, many genres • An oral tradition • Well established teaching and learning practices • Hardly archived and studied scientifically • is rich in Manodharma – improvisation • Difficult to analyse and represent using ideas from Western Music • Objective: Enhance experience through MIR

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Structure of a Carnatic Music Concert: A listener’s perspective

• A concert is made up a sequence of items. • Items correspond to different forms. • Each form has a specific characteristic. • ¯ s are seldom repeated (except in thematic concerts) • One or more pieces in a concert are taken up for elaboration • : • an alaapana, a composition (, , ), niraval, svaraprasthara, solo percussion • Raagam Taanam Pallavi (RTP): • RTP: an alaapana, a taanam, a composition (pallavi only), svaraprasthara, solo percussion. • Pallavi is rendered at different speeds with niraval. • Svaraprasthara may include multiple melodies. • Rhythmic cycles chosen – complex (e.g. Adi taalam (tisra nadai)) • Other types: padham, jaavali, viruttam, slokam, , The Main item and RTP are generally the hallmarks of a concert.

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Primary Aspects of Indian Classical Music from a Computational Perspective I

• Tonic – the base sur chosen by the to render the music. • Each musician has his/her own tonic • Bombay Jayashree: 220 Hz, T M : 140 Hz, MDR: 110 Hz. • Raga¯ or melody • Gamaka¯s – the inflection of notes • Gamaka¯s are associated with phrases of raga¯ s1 • Exploration of Al¯ apana¯ through the relevant phrases • Phrases are rendered in different tempos • Phrases are derived from compositions – especially from the trinity • Talas • Strong tradition of rhythm • Carnatic Music – kalai, nadai, jaati. • Mrudangam stroke analysis • Segmentation of tani • Can we transcribe the same? • Other Aspects 1: The concert is more like a conversation between the artist and the audience.

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Primary Aspects of Indian Classical Music from a Computational Perspective II

• The appreciation by the audience is more a “here and now”2 • A concert is replete with applauses – can we use these to segment and archive them in terms of pieces? • Other Aspects 2: Since motivic analysis is based on pitch – and most algorithms fall short – explore new algorithms for pitch based on phase.

1 T M Krishna and Vignesh Ishwar, “Carnatic Music: , Gamaka, Motif and Raga Identity”, 2nd CompMusic Workshop, Istanbul, Turkey 2 M V N Murthy, “Applause and Aesthetic Experience,”http://compmusic.upf.edu/zh-hans/node/151

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Tonic I

3. • Tonic – A fundamental concept of Indian classical music • Tonic – Pitch chosen by the performer to serve as reference • The svara Sa in the middle octave range is the tonic • Drone is played to establish tonic – /Tambura • Accompanying instruments also tune to the tonic • Melodies defined relative to tonic

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Tonic II

320 320

280 280

240 240

200 200

160 160 Frequency Hz Frequency Hz 120 120

80 80

40 40

0 0 0 1000 2000 3000 4000 5000 6000 0 1000 2000 3000 4000 5000 6000 Frames Frames

Tonic 1 Tonic 2 Drone 1 Drone 2 Alapana

3 Ashwin Bellur, “Automatic identification of tonic in Indian classical music,” MS Thesis, IIT Madras, 2013

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Approaches to identify tonic

• Music provides various cues to the listener about identity of the tonic • Cues can be divided into two broad classes

1. Melodic characteristics of the music 1 2. Tuning of the drone2

1 S Arthi H G and T V Sreenivas. Shadja, swara identification and raga verification in alapana using stochastic models. WASPAA 2011 pages 29-32, 2011. 2 Salamon, J., S. Gulati, and X. Serra (2012). A multipitch approach to tonic identification in indian classical music. In Proc. of ISMIR, 157163

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Cue 1 - Melodic characteristics of the music

• Mono pitch extracted from the audio – prominent pitch • Pitch range: 40 to 700 Hz, window size = 133ms • Histograms as primary representation • Typical Histograms for Hindustani and Carnatic music (bin width 1 Hz)

6000 4000

5000 (b) 3000 4000

3000 2000

2000 1000 1000 Number of Instances (Carnatic Item) Number of Instances (Hindustani Item) 0 0 50 100 150 200 250 300 350 400 50 100 150 200 250 300 350 400 Frequency (Hz) Frequency (Hz)

Figure: Carnatic Pitch Histogram Figure: Hindustani Pitch Histogram

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Pitch Histograms based processing

1. A peak indicating note Sa always present, not necessarily the tallest peak 2. Drone and percussion ensure a peak at Sa 3. Histogram envelope is almost continuous due to gamakas. ( Example ) 4. Fixed ratio between peaks representing svara Sa and Pa 5. Less inflected nature of Sa and Pa 6. Characteristics more prominent in Carnatic music

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Cue 2 - Drone

• Determine tuning of the drone to identify tonic • Drone omnipresent in Indian classical music • Strings of the tambura tuned to indicate svara Sa • Tuning of one of the strings varies depending on the raga being performed • Attempt to develop fast tonic identification techniques with minimal data

Figure: Spectogram of an excerpt of Carnatic music

Example 1 Example 2 Example 3

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Determining tuning of the drone

• Drone omnipresent in the background • Drone extracted in low energy regions • Lead vocal frequencies predominantly occupy middle and upper octaves • Drone frequently registers pitch values at the lower octave Sadja/Sa

300 (a) 200

100 Frequency Hz 0 0 500 1000 1500 2000 2500 3000 3500 4000 Frame Number 1 lower sadja (b) middle 0.5 sadja

0 50 100 150 200 250 300 Frequency Hz

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Drone Prominent Frames

• Pick frames with drone as the prominent source • A host of low level audio descriptors were employed • Pitch estimated using a selected bag of frames using signal processing, dictionary learning methods (Non Negative Matrix Factorisation (NMF)). Results: • Performance almost as high as 90% on 1.5min of data when signal processing cue is used. • Performance 98% with drone on 1.5mins of data.

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Gamaka¯s in Carnatic Music4

• Written forms of a raga¯ • – sequence of notes in ascending order (generally starts from the tonic Sa). • – sequence of notes in descending order (generally starts from the tonic Sa˙ ). • Vocal/Instrumental form of a raga¯ • Replete with Gamaka¯s • Although specific placeholders – notes can meander quite signficantly about the placeholder. • The “note uttered” and “note sung” need not have a one-to-one correspondence

4 Vignesh Ishwar, Shrey Dutta, Ashwin Bellur and Hema A Murthy, “Motif spotting in an Alapana in Carnatic music,” ISMIR 2013.

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Classification of Gamaka¯s (SSP – Subbarama Dikshitar, Dasavidha Gamaka¯ –as told to us by the experts)

• A finite number of meandering patterns are defined • kampitam, jaaru, vali, spuritham, ... • The Gamaka¯s are stitched together/one gamaka¯ is overlayed on the other • Other than gamaka¯s, “brikhas” are also used. • Standard set of melodic phrases are observed in every piece.

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Characteristic Pitch and Spectrogram for some of the Gamaka¯s

Some example Gamaka¯s created by Vignesh Ishwar – tOdi

Kampitham:

Jaaru:

Odukkal:

Nokku:

Orikkai:

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Gamaka¯s in a given al¯ apana¯

Vignesh tOdi Al¯ apana¯ :

KVN Al¯ apana¯ :

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Summary of pitch analysis on complete al¯ apana¯ s

• The musician does not seem to always stick to the grammar of the gamaka¯. • Some definite phrases do exist • Most keen listeners identify raga¯ s without difficulty. • A set of Time-Frequency (T-F) motifs? • T-F representation of pitch? • Mapping from raga¯ to the language as defined by required. • Derive motifs for raga¯ s using “machine learning?”

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Histograms of notes for different raga¯ sI

Shankarabharana Pitch Histogram 1500 Sa

1000

P G3 500 M1 N3 Sa R2 D2

0 −1500 −1000 −500 0 500 1000 1500 2000 2500 Frequency in Cents Kalyani Pitch Histogram 4000 Sa 3000 N3

P 2000 Sa G3 R2 D2 1000 M2

0 −1500 −1000 −500 0 500 1000 1500 2000 2500 Frequency in Cents

Figure: Pitch Histograms of Kalyani and Sankaraabharana

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Histograms of notes for different raga¯ s II

• Significant of frequencies around every peak that is also frequent. • The two raga¯ s have only one note that is different – but their histograms are very different. • Pitch contour is seamless and continuous.

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Motifs in a raga¯

Pitch Contour of an Alapana 2000

1500

1000

500

0 Frequency in Cents

−500 0 1 2 3 4 Time in Minutes Motif1 Motif2 Motif3 1400 1400 1300

1200 1200 1200 1100

1000 1000 1000 900

800 800 800 Frequency in Cents Frequency in Cents Frequency in Cents 700

600 600 600 0 0.5 1 0 0.5 1 0 0.5 1 Time in Seconds Time in Seconds Time in Seconds

Figure: a) Motifs Interspersed in an Alapana; b) Magnified Motif

• The motif is repeated in different parts of the Al¯ apana¯ • There are a number of motifs for each raga¯ • Question: Motif as a query, can we locate it in an Al¯ apana¯ ? • Al¯ apana¯ : Long, erroneous pitch contour, computationally intensive. Saddle points in tandem with Rough LCS used to locate motifs. Shrey Dutta will discuss this work.

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

A new feature for analysis of Indian Music I

Tonic shift – causes a linear shift in cent scale. A possible solution: cent filterbank banks • Analysis of a concert in Indian Music depends on tonic. • Motifs can span more than one octave. • Cochlea – can be modeled by a bank of constant Q filters. • For Indian music: Normalise by tonic – place filters uniformly on the cent scale: Cent Filterbanks

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Cent Filterbanks

40 30 40 34 35 28 35 32 30 26 30 30 25 24 25 28 26 20 22 20 24 15 20 15 22 10 18 10 20 5 16 5 18 0 14 0 16 -5 12 -5 14 -20 0 20 40 60 80 100 120 140 -20 0 20 40 60 80 100 120 140 160 180

40 30 40 30 35 28 35 28 30 26 30 26 25 24 25 24 20 20 22 15 22 15 20 10 20 10 18 5 18 5 16 0 16 0 14 -5 14 -5 12 -20 0 20 40 60 80 100 120 140 160 180 200 -20 0 20 40 60 80 100 120

40 34 35 32 30 30 28 25 26 20 24 15 22 10 20 18 5 16 0 14 -5 12 -20 0 20 40 60 80 100 120 140

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Extraction of Cent filterbank features5

'(;,-573

%7,-,36/(808

#(3304. <04+5<

!08*7,9, "5:70,7 &7(48-573

,49 -029,7 )(418

$5. ,49 -029,7 )(41 ,4,7.= ;(2:,8

!08*7,9, 5804, &7(48-573

,6897(2 5,--0*0,498

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Cent Filterbanks

• Cent Filterbanks are similar to CQT (Constant Q transform) filters with a difference – they are tonic normalised. • CQT is equivalent to Cent Filterbanks with a tonic of 2. • Padi Sarala will talk more about cent filterbanks and their applications to segmentation motif recognition 6, stroke recognition7 and archival 8

6 unpublished 7 Akshay Ananthapadmanabhan, Juan Bello, Raghava Krishnan and Hema A Murthy, “Tonic independent stroke transcription of the mridangam, AES, 2014 8 Padi Sarala and Hema A Murthy, “Inter and intra segmentation of Carnatic music recordings for archival,” ISMIR 2013.

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Repeating the experiments of C V Raman9.

• Strokes were played carefully • The number of strokes is 8 • According to Raman there are 5 different modes – resonances • Therefore 5 basis vectors were estimated in NMF

9 Akshay Ananthapadmanabhan, Ashwin Bellur and Hema A Murthy, “Modal analysis and transcription of strokes of the mridangam using non-negative matrix factorisation,” ICASSP 2013, Vancouver,Canada

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Modes of Mridangam and Activations for different strokes

FirstTone 40

20

0 0 20 40 60 80 100 120 SecondTone 40

20

0 0 20 40 60 80 100 120 ThirdTone 200

100

0 0 20 40 60 80 100 120 FourthTone 100

50

0 0 20 40 60 80 100 120 FifthTone 200 Tones 100

0 0 20 40 60 80 100 120 Figure: Modal Tones Figure: Strokes and their modes

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Transcription of strokes

• Training of HMMs using activations • Testing on untranscribed data – use NMF activations to segment the strokes • Performance evaluation • An accuracy of about 87-89% was obtained on a controlled experiment. • Current work: • Tonic independence using CQT and cent filterbanks along with HMMs (unpublished) – very good results – Sarala/Akshay will talk about this. • Dr. Umayalpuram Sivaraman is working with us on transcribing data for his tanis – he has defined a number of new strokes – a task by itself. • Other upcoming artists are helping us transcribe various other artists.

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Pitch extraction using modified group delay functions10

• A group delay based approach to pitch extraction. • Flattened magnitude spectrum is a sinusoid. • Estimate the frequency of the sinusoid using modified group delay functions. • Results are encouraging – Rajeev Rajan will give details.

10 Rajeev Rajan and Hema A Murthy, “Group delay based melody monopitch extraction from music,” ICASSP 2013.

Carnatic Music: A Computational Perspective MIR Indian Music Preliminaries Tonic Gamaka¯s in Carnatic Music Cent filterbanks Identifying the strokes of the mridangam Pitch Extraction Conclusions

Conclusions

• Different Aspects of Carnatic Music Analysis addressed • Tonic identification: Group delay based, NMF • Motif discovery: Rough Longest Common Subsequence • Segmentation of recordings of Carnatic Music • Transcription of mridangam • Future Work • Motif discovery using , oneliners of • Does an Alapana¯ in CM have a rhythm? • Processing music uploaded into Music Brainz – convert many of the research ideas to enhance MIR • Multipitch pitch tracking using phase. • Cognitive signal processing and music – signal processing and machine learning used in tandem.

Carnatic Music: A Computational Perspective