Page 5 - Rouyer_Fromentin_2008
P. 5

Rouyer et al.: Wavelet analysis of multiple time series           15




                                                                          n−1
              decomposition is more general than the eigenvalue  DL ,L = ) ∑ atan[ ( L t ()− L t ())−(  k  (  k  (  1))]
                                                                  (
                                                                                  k
                                                                                        k
                                                                    k
                                                                      k
                                                                                                       j
                                                                                               i
              decomposition used for principal component analysis,  i  j          i     j     L t +11)− Lt +
                                                                          t=1
              but the 2 decompositions are nevertheless related. A                                          (8)
                                                                                                        k
              singular value decomposition performed on the     with n being the length of the vectors, and L i (t) and
                                                                 k
              covariance matrix between 2 similar wavelet spectra  L j (t) being the kth pair of leading patterns for W i and
              would yield the same results as an eigenvalue de-  W j . This metric compares 2 vectors by measuring the
              composition. The number of non-zero singular values  angle between each pair of corresponding segments,
              (k) of  ΓΓ is  inferior or equal to the number of fre-  defined by the consecutive points of the 2 vectors;
              quencies analysed and each one is associated with a  2 parallel vectors will thus lead to a null distance
              pair of singular vectors (frequency patterns), that are  (Fig. 2). This metric could be interpreted as a robust
              respectively associated with each spectrum.       version of the correlation between the derivatives of
                The leading patterns show how respective fre-   the leading patterns and/or the leading vectors.
              quency patterns evolve in time, and are obtained by  The sum of angles obtained is a comparable metric
              projecting each wavelet spectrum onto its respective  between each pair of leading patterns and singular
                                     k
                              k
              singular vectors. L i (t) and L j (t) are respectively the kth  vectors. The distance was then computed as the
              leading patterns for W i and W j , and are computed as  weighted mean of the distance for each of the k pairs of
              follows:                                          singular vectors and leading patterns retained (the
                                   ƒ =F                         weights being equal to the amount of covariance
                               ()
                              k
                             Lt = ∑ U  k  × W (ƒ ,t )           explained by each axis). For the comparison of the
                                           i
                              i
                                   ƒ =1
                                                   and        (6)  wavelet spectra  i and  j, we compute the distance
                                   ƒ =F                         DT(i,j) according to the following formula:
                               ()
                              k
                             Lt = ∑ V  k  × W (ƒ ,t )                         k=K
                                           j
                              j
                                                                                            )
                                                                                       (
                                   ƒ =1                                       ∑  w ×( D L ,L + D(UV  j k ))
                                                                                           k
                                                                                                  k
                                                                                         k
                                                                                                   ,
                                                                                  k
                                                                                         i
                                                                                                  i
                                                                                           j
                                                                        (
                                                                           )
              with F the maximum frequency common to both spec-       DT i, j =  k=1                        (9)
                                                                                        k=K
              tra. It is then possible to reconstruct the initial wavelet                ∑  w k
              spectra with a given number, N, of leading patterns by                    k k=1
              the following relationships:                      with w k being the weights, set equal to the amount of
                                                                covariance explained by each axis. The distances,
                                    k=N
                                         k
                               W i N  = ∑  U ×  L k i           DT(i,j), were then used to fill a distance matrix suitable
                                    k=1                         for cluster analysis (Fig. 1). The larger the amount of
                                                   and        (7)
                                    k=N                         covariance, the larger the number of axes retained; a
                                        k
                              W j N  = ∑  V × L k j             large number of axes will enable to take into account
                                    k=1                         more detailed common time–frequency features be-
                These correspond to the product of a matrix formed  tween the 2 spectra.
              with N singular vectors and another one formed with  All the computations were done using R version 2.4
              the N corresponding leading patterns. The reconstruc-  (R Development Core Team [2006] R: a language and
              tion for the kth axis is therefore determined by the kth  environment for statistical computing). This is avail-
              singular vector and the  kth leading pattern, and the  able online at: www.R-project.org.
              larger  k is, the less important is the common
              covariance explained. A reconstruction corre-
              sponds to a filtered representation of the spec-
              trum; the less important N is, the more impor-
              tant is the filter.
                Computing the distance between the
              wavelet spectra: The distance between 2 wa-
              velet spectra was measured by comparing the
              leading patterns and the singular vectors ob-
              tained by the MCA over a given number of
              axes (that correspond to a fixed percentage of
              the total covariance). As the relationships be-  Fig. 2. Computation of the distance index used between the pair of
                                                                                                 k
                                                                                            k
              tween the 2 singular vectors and between the 2  kth leading patterns (or singular vectors), L i and L j (see Eq. 8). The
                                                           absolute difference between the pair of vertical dashed lines for
              leading patterns were not linear, they could not
                                                           every pair of segments of the 2 series is computed and the angle a,
              be compared using a simple correlation. We
                                                           between each pair of segments is obtained by taking the atan of the
              thus computed the following distance (D) mea-  absolute differences. Summing over the segments yields the total
              sure adapted from Keogh & Pazzani (1998):                   angle between the 2 series
   1   2   3   4   5   6   7   8   9   10