Metrics for describing dyadic movement: a review

Joo, Rocio; Etienne, Marie-Pierre; Bez, Nicolas; Mahévas, Stéphanie

doi:10.1186/s40462-018-0144-2

Review
Open access
Published: 27 December 2018

Metrics for describing dyadic movement: a review

Rocio Joo ORCID: orcid.org/0000-0003-0319-4210^1,2,
Marie-Pierre Etienne³,
Nicolas Bez⁴ &
…
Stéphanie Mahévas²

Movement Ecology volume 6, Article number: 26 (2018) Cite this article

4172 Accesses
19 Citations
25 Altmetric
Metrics details

Abstract

In movement ecology, the few works that have taken collective behaviour into account are data-driven and rely on simplistic theoretical assumptions, relying in metrics that may or may not be measuring what is intended. In the present paper, we focus on pairwise joint-movement behaviour, where individuals move together during at least a segment of their path. We investigate the adequacy of twelve metrics introduced in previous works for assessing joint movement by analysing their theoretical properties and confronting them with contrasting case scenarios. Two criteria are taken into account for review of those metrics: 1) practical use, and 2) dependence on parameters and underlying assumptions. When analysing the similarities between the metrics as defined, we show how some of them can be expressed using general mathematical forms. In addition, we evaluate the ability of each metric to assess specific aspects of joint-movement behaviour: proximity (closeness in space-time) and coordination (synchrony) in direction and speed. We found that some metrics are better suited to assess proximity and others are more sensitive to coordination. To help readers choose metrics, we elaborate a graphical representation of the metrics in the coordination and proximity space based on our results, and give a few examples of proximity and coordination focus in different movement studies.

Introduction

Collective behaviour has been the object of study of many disciplines, such as behavioural ecology, psychology, sports, medicine, physics and computer sciences [7, 13, 19, 56, 57]. In multiple contexts, individuals – in a very wide sense of the word – adapt their behaviour as a function of their interaction with others. In movement ecology, where movement is regarded as an expression of behaviour [43], collective behaviour should be considered as a key element given that collective dynamics and individual movement are intricately intertwined [7]. Accordingly, mechanistic movement models should account for these dynamics. The vast majority of movement models neglect this aspect, with a few exceptions (e.g., [29, 44, 47, 53]). The consequence has been that the forms that these dynamics take in the few existing works rely on very simple theoretical assumptions.

Collective behaviour can be produced at large group scales (flocks, colonies, schools) but also at small group scales (triads, dyads). Regardless of the actual group scale, global patterns of collective behaviour originate from local interactions among neighbouring members [11], so analysing dyad interaction as a first step is a pertinent choice. Concerning dyadic interaction, here we focus on what we call ‘joint movement’, where two individuals move together during the total duration or a partial segment of their paths. Dyadic movement behaviour has been mostly studied in a data-driven approach, using several metrics to quantify it. In movement ecology, few works have applied and compared some of these metrics [38, 41]. However, their theoretical properties, and thus the similarities and differences in their construction and in what they actually assess, have not been thoroughly analysed yet.

This manuscript reviews a series of metrics used to assess pairwise joint-movement and proposes some modifications when appropriate (Table 1). Two criteria are taken into account for the review of these metrics: practical use and dependence on parameters; they are evaluated through both a theoretical (conceptual) as well as a practical approach. Metrics found in the literature essentially measured two aspects of joint movement: proximity and coordination. Proximity refers to closeness in space-time, as in how spatially close simultaneous fixes (individual locations recorded) are in a dyad; a point pattern perspective. The notion of proximity is thus subjective, since a judgement on proximity involves a threshold in distance whether local or global, or the definition of a reference zone (where encounters may be observed). Coordination, on the other hand, refers to synchrony in movement, which can be assessed through measures of similarity or correlation in movement patterns such as speed or direction. There might be a thin line between proximity and coordination, and some metrics may be associated with both at some degree, as we show through the description of their theoretical properties and the practical analysis of case scenarios.

Table 1 Metrics for measuring dyad joint movement

Full size table

The manuscript is thus organized as follows. We first describe the criteria used to evaluate the metrics as indices of dyadic joint movement. We then present the different metrics and their theoretical properties with special attention to their dependence towards parameters. Next, we define case scenarios to evaluate the practical properties of the metrics. In the last section, we discuss the overall suitability of the metrics for assessing joint movement in ecology and give some practical guidelines for their use.

Evaluation criteria

We categorized the desirable properties of metrics for assessing dyadic joint movement into three criteria: practical use, considered the most important one; dependence on parameters; and computational cost:

C1
Practical use [50, 52, 58]: 1) A metric is useful if it is interpretable and reflects a marked property of collective behaviour. 2) It should also be sensitive to changes in patterns of joint movement (e.g. higher values for high joint movement and lower values for independence in movement). 3) Being able to attain the theoretical range of values would also be important, as not doing so makes it harder to interpret empirical values. C1 is therefore a three dimensional criterion comprising interpretation, sensitivity and attainable range. Attainable range is covered in the theoretical properties section; we highlight the difficulties or implausibility to attain minimum and maximum values for the metrics when this is true. How to interpret each metric is also explained in this section; evidently, a metric without an attainable range is difficult to interpret. Sensitivity is addressed in the case-scenario section.
C2
Dependence on parameters: A metric that depends on few parameters and hypotheses is more robust and generic than one that strongly relies on many parameters and hypotheses, since the former can produce more easily comparable results and interpretations. In addition, an ideal metric can be defined in such a way that the user can easily see how a change in the values of the parameters or in the components related to movement assumptions conditions the metric derivations and interpretations. In the next section, we describe the assumptions underlying each metric and the parameters needed to be fixed by the user. This description will allow distinguishing user-tractable parameter-dependent metrics from those that are not.

Definition and theoretical properties of the metrics

In the following subsections the metrics are defined and their theoretical properties are described. A summary is proposed in Table 1. Considering two individuals named A and B, the position of A (resp. B) at time t is denoted by $X_{t}^{A}$ (resp. $X_{t}^{B}$). The distance between A at time t₁ and B at time t₂ will be referred to as $d_{t_{1},t_{2}}^{A,B}$. When the distance between two individuals is regarded at simultaneous time, this will be shortened to $d_{t}^{A,B}$. Whenever possible, metrics introduced by different authors but that are actually very similar in their definition, are grouped under a unified name and a general definition.

Proximity index (Prox)

The proximity index (Prox in [5]) is defined as the proportion of simultaneous pairs of fixes within a distance below an ad hoc threshold (Fig. 1). Other metrics in the literature are actually analogous to Prox: the coefficient of association (Ca) [12] and the I_AB index [4]. Denoting by T the number of pairs of fixes in the dyad, we propose a unified version of those metrics using a kernel K (formula 1):

$$ Prox_{K,\delta} = \frac{1}{T} \sum\limits_{t=1}^{T} K_{\delta}\left(X_{t}^{A}, X_{t}^{B}\right), $$

(1)

where δ is a distance threshold parameter.

Choosing $K_{\delta }(x,y) =\mathbbm {1}_{\{ \| x-y \| < \delta \}}$ ($\mathbbm {1}_{\{\}}$ represents the indicator function) as a kernel leads to the Prox metric in [5], denoted by $Prox_{\mathbbm {1},\delta }$ henceforward. Instead, choosing K_δ(x,y)= exp(−∥x−y∥²/(2δ²)) gives the I_AB index. Regarding Ca, for simultaneous fixes, its definition becomes exactly the same as $Prox_{\mathbbm {1},\delta }$ (using Ca’s adaptation to wildlife telemetry data shown in [38]).

Most of the proximity-related metrics are based on symmetric kernels and depend only on the distance between A and B; therefore, the formula notation (1) can be simplified as:

$$ Prox_{K,\delta} = \frac{1}{T} \sum\limits_{t=1}^{T} K_{\delta}\left(X_{t}^{A}, X_{t}^{B}\right) = \frac{1}{T} K_{\delta}^{+}. $$

(2)

If the distance between two individuals is below the threshold δ during their whole tracks, $Prox_{\mathbbm {1},\delta }$ will be 1 (and 0 in the opposite case). $Prox_{\mathbbm {1},\delta }$ might be interpreted as the proportion of time the two individuals spent together. This interpretation is, of course, threshold dependent. The I_AB index provides a smoother measure of the average proximity between two individuals along the trajectory. Proximity is thus dependent on the choice of a δ parameter and of a kernel function. Graphical examples illustrating the differences in $K_{\delta }(x,y) =\mathbbm {1}_{\{ \| x-y \| < \delta \}}$ and K_δ(x,y)= exp(−∥x−y∥²/(2δ²)) are in Additional file 1.

Coefficient of Sociality (Cs)

The Coefficient of Sociality (Cs) [26] compares the mean (Euclidean) distance between simultaneous pairs of fixes (D_O) against the mean distance between all permutations of all fixes (D_E).

$$ Cs = \frac{D_{E} - D_{O}}{D_{E} + D_{O}} = 1 - 2 \frac{D_{O}}{D_{E} + D_{O}}, $$

(3)

where

$$D_{O} = \left(\sum\limits_{t=1}^{T} d_{t}^{A,B} \right)/T, $$

and

$$D_{E} = \left(\sum\limits_{t_{1}=1}^{T}\sum\limits_{t_{2}=1}^{T}d_{t_{1},t_{2}}^{A,B}\right)/T^{2}. $$

Kenward et al. [26] stated that Cs belongs to [−1,1], and it has been used as a symmetrical index since. Nevertheless, that is not true. Cs equals 1 if and only if D_O=0 and D_E≠0, which occurs only when the two individuals always share the exact same locations. However, Cs equals −1, if and only if D_E=0 and D_O≠0, which is impossible (it could asymptotically approach to 1 for very large series when DO approaches infinity). Cs equals 0 when D_O=D_E.

If all simultaneous fixes are very proximal but not in the same locations, Cs would approach 1 (how close to 1 would depend on the value of D_E as illustrated in the right hand side of Eq. 3). Moreover, only if D_E<D_O, Cs can take a negative value. For Cs to take a largely negative value, the difference in the numerator should be very large compared to the sum in the denominator; in Additional file 2 we show how implausible that situation is and how sensitive it is to the length of the series. The latter makes Cs from dyads of different length difficult to compare, because their real range of definition would differ. This fact is neither evoked in the work that introduced the metric [26] nor in the ones that evaluated this and other metrics [38, 41], despite the fact that in those works no value lower than −0.1 was obtained.

Indeed, [26] assumed that the permutation of all fixes is a way to represent locations of independent individuals. While this is questionable, some modified versions, as the one proposed by [62], use correlated random walks as null models and simulated independent trajectories under these models to replace D_E by a more realistic reference value. Thus, a generalized version of Cs would be:

$$ Cs = \frac{ D_{chance} - D_{O}}{D_{chance} + D_{O}}, $$

(4)

where D_chance is defined through a user-chosen movement model for independent trajectories.

The Half-weight Association Index (HAI)

The Half-weight Association Index (HAI) proposed by [10] measures the proportions of fixes where individuals are close to each other (within a user-defined threshold). By that definition, HAI is exactly the same as $Prox_{\mathbbm {1},\delta }$. However, HAI was popularized by [2] in another form that did not consider all fixes for the computation of the metric, but used counts with respect to a reference area (called overlapping zone in the original paper):

$$ HAI = \frac{K_{\delta}^{+}}{K_{\delta}^{+} + \frac{1}{2}(n_{A0}+n_{0B})} $$

(5)

where n_AB (resp n_A0; n_0B; n₀₀) is the number of simultaneous occurrences of A and B in the reference area S_AB (resp. simultaneous presence of A and absence of B; simultaneous absence of A and presence of B; simultaneous absence of A and absence of B), and where $K_{\delta }^{+}$ is computed over the reference area.

It is worth noticing that the HAI adaptation proposed by [2] does not correctly account for spatial joint movement, as would do a $Prox_{\mathbbm {1},\delta }$ version constraint to the reference area; i.e. the denominator should be equal to n_AB + n_A0 + n_0B, which is the total number of simultaneous fixes where at least one individual is in the reference area.

The dependence to the definition of an overlapping zone or reference area is discussed in the following subsection dedicated to L_ixnT, which also relies on the definition of a static reference area.

If the individuals remain together (i.e. in the reference area and closer than δ) all the time, HAI is close to 1, and 0 in the opposite case. An example of the computation of HAI under [2]’s definition is given in Fig. 2.

Coefficient of Interaction (L _ixn and L _ixn T)

Minta [42] proposed a Coefficient of Interaction (L_ixn) that assesses how simultaneous are the use and avoidance of a reference area S_AB by two individuals:

$$ L_{ixn} = \ln{\left(\frac{n_{AB}/p_{AB} + n_{00}/p_{00}}{n_{A0}/p_{A0} + n_{0B}/p_{0B} }\right)}, $$

(6)

where p_AB is the probability, under some reference null model, of finding A and B simultaneously in S_AB (the same interpretation as for n when a subscript is 0; see HAI subsection). Attraction between individuals would cause greater simultaneous use of S_AB than its solitary use, which would give positive values of L_ixn. Conversely, avoidance would translate into negative values of L_ixn, since use of S_AB would be mostly solitary. A logistic transformation of the metric (L_ixnT) produces values between 0 (avoidance) and 1 (attraction), making the interpretation easier:

$$ L_{ixn}T = logistic(L_{ixn})= \frac{1}{1+e^{-L_{ixn}}}. $$

(7)

Minta [42] proposed two different approaches for computing the associated probabilities conditionally to the fact that the reference area is known (see examples in Fig. 2 and Table in Additional file 3). In both cases, the probabilities are estimated under the assumptions of independence in movement among the individuals and of uniform utilization of the space. Indeed this latter assumption can be relaxed and p_AB can be derived from any kind of utilization distribution (see for instance [20] for the estimation of utilization distribution).

HAI and L_ixnT (thus L_ixn as well) rely heavily on a static reference area – either known or estimated – and on the probabilities of presence within this reference area. The static reference area could be defined, for instance, as the intersection of the respective home ranges of A and B. However, there are many approaches for estimating home ranges, each one relying on particular assumptions about the spatial behaviour of the studied populations [9]. Thus, S_AB is not a simple tuning parameter. The way it is defined may completely modify the output. If the reference area is equal to the whole area of movement of the two individuals, then both the numerator and the denominator in the logarithm are equal to infinity and L_ixnT cannot be derived. That problem could arise for extremely mobile individuals, such as tuna, turtles and seabirds [8], or fishing vessels [6], and avoiding it would require the computation of multiple dynamic reference areas. Therefore, L_ixnT may be better used for specific cases where the definition of the reference area relies on a deep knowledge of the spatial behaviour of the populations.

Joint Potential Path Area (jPPA)

Long et al. [39] computed the relative size of the potential encounter area at each time step of two individuals’ tracks. Assuming a speed limit ϕ, the potential locations visited between two consecutive fixes define an ellipse (Additional file 4). Then, the potential encounter area corresponds to the intersection between the ellipses of the two individuals (at simultaneous time steps; see Fig. 3). The overall potential meeting area is given by the spatial union of all those potential encounter areas. This area is then normalized by the surface of the spatial union of all the computed ellipses to produce the joint Potential Path Area (jPPA) metric ranging from 0 to 1 (see formula in Table 1). jPPA values close to 0 indicate no potential spatio-temporal overlap, while values close to 1 indicate a strong spatio-temporal match.

Several issues can be discussed here. First, no movement model is assumed and therefore the method confers the same probabilities of presence to every subspace within the ellipse regions. This is clearly unrealistic as individuals are more likely to occupy the central part of the ellipse because they cannot always move at ϕ, i.e. maximal speed. Second, the computation of the ellipses relies strongly on the ϕ parameter. If ϕ is unrealistically small, it would be impossible to obtain the observed displacements and the ellipses could not be computed. By contrast, if ϕ is too large, the ellipses would occupy such a large area that the intersected areas would also be very large (hence a large jPPA value). Alternatively, [36] proposed a dynamic computation of ϕ as a function of the activity performed by the individual at each fix. Within this approach, additional information or knowledge (i.e. other data sources or models) would be required for the computation of ϕ.

Cross sampled entropy (CSE and CSEM)

Cross sampled entropy (CSE) [51] comes from the time series analysis literature and is used for comparing pairs of motion variables [3; 18, e.g.]. It evaluates the similarity between the dynamical changes registered in two series of any given movement measure. Here we present a simplification of the CSE for simultaneous fixes and position series. A segment of track A would be said to be m-similar to a segment of track B if the distance between paired fixes from A and B remain below a certain threshold during m consecutive time steps. If we define N_m as the number of m-similar segments within the series, then CSE can be defined as (the negative natural logarithm of) the ratio of N_m+1 over N_m and might be understood as (the negative natural logarithm of) the probability for an m-similar segment to also be (m+1)-similar. Formally, CSE is defined as:

$$ \begin{aligned} CSE_{\delta}(m) &= -\ln\left\{\frac{ \sum\nolimits_{t=1}^{T-m} \mathbbm{1}{\left\{ \left(\max_{k \in [0,m]} \left| X_{t+k}^{A} - X_{t+k}^{B} \right| \right) < \delta \right\} } }{\sum\nolimits_{t=1}^{T-m} \mathbbm{1}{\left\{ \left(\max_{k \in [0,m-1]} \left| X_{t+k}^{A} - X_{t+k}^{B} \right| \right) < \delta \right\} }}\right\}\\ &=-\ln\frac{N_{m+1}}{N_{m}}, \end{aligned} $$

(8)

A large value of CSE corresponds to greater asynchrony between the two series, while a small value corresponds to greater synchrony.

CSE relies on an ad hoc choice of both m and δ. In practice, it is expected that the movement series of A and B will not be constantly synchronous and that, for a large value of m, N_m could be equal to 0, in which case CSE would tend to ∞. Therefore, the largest value of m such that N_m>0, i.e. the length of the longest similar segment, could be an alternative indicator of similarity between the series (do not confuse with the longest common subsequence LCSS; see [60]). We propose to use this measure (standardized by T−1 to get a value between 0 and 1) as an alternative index of joint movement (formula 9), which we denote by CSEM. An example of a dyad and the computation of its CSEs and CSEM is shown in Fig. 4.

$$ CSEM = \frac{\max \left\{m; N_{m} >0\right\} }{T-1}, $$

(9)

with the convention that max{∅}=0.

Correlations (r _V)

Pearson and Spearman correlations between variables such as longitude, latitude, distance, velocity, acceleration and turning angles from pairs of tracks, have been used as measures of synchrony in several studies (e.g. [16],). Correlations are easy to interpret. Pearson correlation coefficients (Table 1) assess linear correlations, while Spearman correlation coefficients based on ranks statistics capture any functional correlation. The correlation in a given V variable between dyads is denoted by r_V.

Dynamic Interaction (DI, D I _d and D I _θ)

Long and Nelson [37] argued that it is necessary to separate movement patterns into direction and displacement (i.e. distance between consecutive fixes or step length), instead of computing a correlation of locations [55] which may carry a mixed effect of both components. To measure interaction in displacement, at each time step, the displacements of simultaneous fixes are compared (formula 10).

$$ g_{t}^{\beta} = 1- \left(\frac{\left| d_{t,t+1}^{A} - d_{t,t+1}^{B} \right|}{d_{t,t+1}^{A} + d_{t,t+1}^{B}}\right)^{\beta} $$

(10)

where β is a scaling parameter meant to give more or less weight to similarity in displacement when accounting for dynamic interaction. As β increases, $g_{t}^{\beta }$ is less sensitive to larger differences in displacement. Its default value is 1. When $d_{t,t+1}^{A} = d_{t,t+1}^{B}, g_{t}^{\beta }=1$; and when the difference in displacement between A and B at time t is large, $g_{t}^{\beta }$ approaches zero. For $g_{t}^{\beta }$ to be 0, one (and only one) of the individuals in the dyad should not move; for a sum of $g_{t}^{\beta }$ to be equal to zero, at every time t one of the two individuals should not move.

Interaction in direction is measured by

$$ f_{t} = \cos\left(\theta_{t,t+1}^{A} - \theta_{t,t+1}^{B}\right) $$

(11)

where θ_t,t+1 is the direction of an individual between time t and t+1. f_t is equal to 1 when movement segments have the same orientation, 0 when they are perpendicular and −1 when they go in opposite directions.

Long and Nelson [37] proposed 3 indices of dynamic interaction: 1) DI_d, dynamic interaction in displacement (average of all $g_{t}^{\beta }$); 2) DI_θ, dynamic interaction in direction (average of all f_t); and 3) DI, overall dynamic interaction, defined as the average of $g_{t}^{\beta } \times f_{t}$ (Table 1). DI_d ranges from 0 to 1, DI_θ from -1 to 1, and DI from -1 (opposing movement) to 1 (cohesive movement). Figure 5 shows an example of the three indices.

Conclusions on the theoretical properties of the metrics

Practical use (C1): While each metric concerns a concrete aspect of joint-movement behaviour, some of them, such as Cs and DI, are harder to interpret. DI mixes up the coordination in displacement and direction. When DI is close to 1, it is certainly explained by high values in both components. When it is close to −1, it is an indication of overall high displacement coordination but in opposite directions. With values around zero, however, it is impossible to know if it is because of displacement or direction or both. For Cs, because obtaining values close to −1 is extremely rare, values around zero and, more particularly, slightly negative values are difficult to interpret. In addition, the maximum attainable value depends on the length of the series, which is likely to vary from dyad to dyad (Additional file 2).

Dependence on parameters (C2): Almost every metric depends on the ad hoc definition of a parameter or component, as summarized in Table 1. This is consistent with the fact that, since there is no consensus on the definition of behaviour [34], and much less on that of collective behaviour, its study depends heavily on the definition that the researcher gives to it. It should be noted that behind each choice of a parameter value, there is also an underlying assumption (e.g. that a distance below a δ value means proximity); the difference is that parameters can be tuned, and a variety of values can be easily tested. HAI and L_ixnT make a critical assumption of a static reference area, and its definition, which may be tricky for highly mobile individuals, is a key issue for the computation of both metrics. On the other hand, r_V and DI_θ are the only metrics that do not depend on parameter tuning or assumptions for its derivation; except for the assumptions of correlations being linear, or of linear movement between two successive positions when deriving directions, respectively.

Exploration of metrics through case scenarios

In this section we used schematic, simple and contrasting case scenarios to evaluate the ability of the metrics to assess joint movement, in terms of proximity and coordination.

To build the case scenarios, we considered three levels of dyad proximity (high, medium and low); coordination was decomposed into two aspects, direction (same, independent and opposite) and speed (same or different). Eighteen case scenarios were thus built, with one example of dyad per scenario (Fig. 6; metrics in Additional file 5). The dyads for each case scenario were deliberately composed of a small number of fixes (∼10 simultaneous fixes, as in [37],) to facilitate interpretation of the metric values and the graphical representation of the arbitrarily constructed tracks (online access to tracks in github repository; see Availability of data and materials section). To assess the sensitivity of the metrics to changes in patterns of proximity and coordination, the case scenarios were grouped according to the categories in Table 2.

Table 2 Case scenarios

Full size table

Due to the simplicity for its interpretation, Prox was defined as $Prox_{\mathbbm {1},\delta }$. Three distance thresholds $Prox_{\mathbbm {1},\delta }$ of 1, 2 and 3 distance units were used for Prox, HAI and CSEM, thus denoted for instance Prox₁, Prox₂ and Prox₃. For Cs, the original definition (Eq. 3) was used. jPPA, ϕ was arbitrarily fixed to 10. Regarding dynamic interaction, β was fixed to 1. The v variables for Pearson correlations (Table 1) were longitude (r_Lon), latitude (r_Lat) and speed (r_Speed). An average of correlations in longitude and latitude, denoted by r_Lonlat, was also computed. Boxplots of each metric were derived for each proximity and coordination category (Figs. 7, 8 and 9).

The values taken by Prox, jPPA, CSEM and, to a lesser degree, Cs, showed sensitivity to the level of proximity (Fig. 7). Conversely, no association was revealed between the proximity scenarios and the metrics based on correlation, dynamic interaction and reference area occupation.

Changes in direction were reflected in values taken by correlation metrics on location (r_Lonlat,r_Lon and r_Lat) and two dynamic interaction metrics, DI and DI_θ (Fig. 8). Cs took lower values in scenarios of opposite direction, but independent and same direction scenarios reflected no distinction for this metric. High correlation in speed was found for scenarios of opposite and same direction, while a large variability was found when direction was independent. r_speed showed differences when direction was independent between dyads, but no distinction was caught by the metric between same and opposite direction scenarios. The other metrics did not show distinguishable patterns related to changes in direction coordination.

Concerning coordination in speed, the most sensitive metric was DI_d, which measures similarity in the distances covered by individuals at simultaneous fixes (Fig. 9). r_Speed took a wide range of values when speed was not coordinated, while it was equal to 1 when perfectly coordinated. DI_d is more sensitive to changes in the values of speed (similar to step length because of the regular step units) than r_speed which characterizes variations in the same sense (correlation), rather than correspondence in values. HAI and L_ixnT showed slight differences in their ranges of values with changes in speed-coordination scenarios. When analysing combined categories of proximity and speed-coordination, and proximity and direction-coordination, less distinctive patterns were found, probably due to the higher number of categories, each containing fewer observations (Figure in Additional file 6).

Overall, Prox, jPPA, CSEM, r_Lonlat,r_Speed,DI_d,DI_θ and DI were highly sensitive to changes in patterns of either proximity or coordination. For proximity scenarios, the variance of some metrics for each category was also sensitive to the δ chosen; i.e. for larger δ, the variance of Prox and CSEM decreased in high proximity, while it increased for low proximity cases. This pattern does not hold for HAI, probably due to the strong dependence of this metric on the arbitrary choice of the reference area. Cs showed a slight sensitivity to changes in direction and proximity scenarios, although the values taken for each type of case scenario did not show a clear separation.

Synthesis of metric analysis

Table 3 summarizes the theoretical and case-scenario analyses. Most metrics reflected marked properties of dyadic joint movement, evidenced both theoretically and through the case scenario assessment. Exceptions were Cs, HAI and L_ixnT. Cs was sensitive to the null model for the distance expected by chance (D_chance; formula 4), it did not attain its whole range of definition, turned out to be asymmetric and dependent on the length of the series (Additional file 2), and was less sensitive than the other metrics to changes in patterns of joint movement. Perhaps a change in the null model for D_chance could improve Cs’s power to assess joint movement, though the new null model should be justified. HAI and L_ixnT, dependent on the reference area definition, were even less sensitive to changes in joint movement patterns. This supports our earlier statement that L_ixnT and HAI should only be used when a reference area exists and is known. Alternatively, Prox works as a simpler metric and is highly sensitive to changes in proximity. The only drawback of Prox is the need to choose a distance threshold parameter, eventually based on prior knowledge of the spatial dynamics of the population. Otherwise, a set of values can be tested, as shown here. jPPA presents the advantage of not requiring the knowledge of a reference area, but still relies on assumptions related to equal probability of presence in an ellipse, which strongly depends on a ϕ parameter whose tuning is not obvious.

Table 3 Evaluation of the two criteria for each metric

Full size table

CSEM evaluates the similarity between the dynamical changes in movement patterns within a δ bandwidth, and, because of that, was expected to be more sensitive to changes in proximity than in coordination. It should be further assessed if using other variables for deriving CSEM (i.e. using [51] generic definition) could make it more sensitive to coordination than proximity. As with Prox, it is in the hands of the user to tune the threshold parameter. Because we were using locations as the analysed series (so the dynamical changes assessed were in fact changes in distance), we used exactly the same threshold values as for Prox. By contrast, correlations in location (r_Lon,r_Lat,r_Lonlat) did show sensitivity to changes in coordination, as expected. The same occurred with DI_θ and DI. Correlation in speed was sensitive to changes in both coordination components, showing high variance when there was no coordination (independent direction or speed). DI_d, on the other hand, was only sensitive to changes in speed. Because the time-step was regular, identical speed was equivalent to identical covered distance (at simultaneous fixes), which explained how in those scenarios DI_d was equal to 1. While DI behaved more similarly to DI_θ, its definition makes it impossible to separate the effects of coordination in displacement and in azimuth, which makes the interpretation of the metric more difficult than interpreting DI_d and DI_θ independently.

We also analysed the computational cost associated to these metrics. We simulated 50000 dyads with trajectories following a Brownian motion, each one composed of 100 fixes. Using a parallelization procedure, we found low CPU times for all metrics (<1 s) except jPPA (∼68 s). CPU time for jPPA and CSEM increased when we increased the number of fixes to 1000, to ∼161 and ∼94 s, respectively. It should be noted that for jPPA, the areas of intersection and union of the ellipses were approximated by grid cells, so for smaller cell sizes (i.e. more accurate jPPA estimation), the computational cost would increase. Researchers with long series of trajectories and a large amount of dyads should take this into consideration (results for the computational cost and more details on its calculation are in Additional file 7).

Although this review is directed at trajectory data (i.e. time series of locations that allow for movement path reconstruction) and the metrics presented here were defined for simultaneous fixes at regular time steps, technically speaking, some of these metrics could be computed only based on the identification of individuals simultaneously observed in a certain area (e.g. L_ixnT). These cases, which may be extremely sensitive to the spatial accuracy and the time intervals between observed fixes, are out of the scope of this review. For the case scenarios built to illustrate the metrics, we assumed that the granularity was correct, i.e. that the temporal and spatial resolution of the data were coherent in respect to the dyadic behavioural patterns under scope. Likewise, for practical uses of the metrics, researchers should 1) make sure that the spatiotemporal data that they are analysing allow reconstructing the movement paths of a dyad and 2) that the sampled (discretised) version of these paths are characterized by locations estimated with high precision, and that the time steps are small enough so that movement between two points could be assumed to be linear, so that the derivation of distances, speed and turning angles could be reliable. Further discussions on the importance of scale and granularity in the analysis of movement patterns can be found in [14, 30, 31].

We expected to obtain a binary classification of the metrics into proximity and coordination, based on the theoretical and case scenario evaluations. This was not so straightforward and we ended up instead with a 3-dimensional space representation (Fig. 10). Prox and CSEM are the most proximity-like indices. jPPA would be the third one due to its sensitivity to changes in proximity in the case scenario evaluation. Cs would be somewhere between Prox and direction coordination because it showed certain sensitivity to both HAI and L_ixnT are almost at the origin but slightly related to speed coordination. Theoretically, both metrics should account for proximity, since when two individuals are together in the same area, they are expected to be at a relative proximity; in practice, this was not reflected in sensitivity to proximity from HAI and L_ixnT. Still, HAI is represented in the graphic slightly above L_ixnT since its formulation specifically accounts for proximity in solitary use of the reference area. They are both graphically represented in association with the speed coordination axis because of the case scenario results which reflected that being in the same area only simultaneously requires some degree of synchrony. DI_d was the most sensitive metric to speed coordination, followed by r_Speed. DI_θ and r_Lonlat are the most strongly linked to direction coordination, seconded by DI, which is also related to speed coordination. A principal component analysis (PCA) using the values obtained for the case scenarios gave very similar results to those in Fig. 10 (Additional file 8), but this schematic representation is more complete because: 1) the theoretical and case-scenario assessment were both taken into account; 2) the PCA was performed without L_ixnT and HAI that had missing values for case scenarios with no common reference area (data imputation as in [25] was not appropriate for this case).

Figure 10 and Table 3 could be used as guidelines to choose the right metrics depending on the user’s case study. For instance, in an African lion joint-movement study [4], proximity was the focus of the study; in that case, the I_AB (Prox) metric was used. For similar studies several proximity-related metrics could be chosen; the choice would depend on the assumptions that the researcher is willing to make. In other cases, researchers may want to assess collective behaviour in tagged animals (e.g. birds or marine mammals) that do not remain proximal during their foraging/migration trips. Then, the collective behaviour component that could be evaluated would be coordination. Whether it is in direction or speed would depend on the researcher’s hypotheses. Coordination, or synchrony, has already been observed in some animal species such as northern elephant seals [17, e.g.] and bottlenecked sea turtles (e.g. [46]), among others. The use of the metrics presented here would allow a quantification of the pairwise behavioural patterns observed, a first step towards a quantitative analysis of the factors explaining those behaviours (e.g. physiological traits, personality or environmental conditions). The metrics presented here are applicable to any organism with tracking data (not necessarily georeferenced).

If the aim is to evaluate all three joint-movement dimensions, we advice to consider for each dimension at least one metric that is highly sensitive to it, rather than a metric that is weakly related to two or three. The complementarity of the metrics (i.e. multivariate approach) has not been studied here, and should be the focus of a future study.

Further perspectives on collective behaviour

The assessment of a ‘lagged-follower’ behaviour, where one individual would follow the other, was out of the scope of this work and should be addressed in the future. The study of this type of interactions is rather challenging, since the lag in the following behaviour is probably not static, and could vary between tracks and also within tracks. A few works use entropy-based measures similar to CSE (transfer entropy [54] or a causation entropy [40]), to measure how much the movement dynamics of an individual (called the source individual, or the leader) influences the transition probabilities in the movement dynamics of another individual [45, 61]. Some other works have focused on this type of interaction regarding it as a delay between trajectories and transforming the problem into one of similarity between trajectories, where one is delayed from the other [22, 27]. Metrics based on the Fréchet distance [1, 21] or the Edit distance [33] are common choices for measuring those similarities in computer science studies. In terms of computational cost, assessing following behaviour should be much more expensive than assessing joint movement.

This study focused on dyadic joint movement. The next step would be to identify metrics to characterize collective behaviour with more than two individuals. A pragmatical approach to investigate this more complex issue could be to identify, within large groups of individuals, the ones that move together for each given segment of trajectories (as dyads, triads or larger groups), and to study those dynamics. A similar procedure could then be used to spot following behaviour and leadership. Movement could be then regarded as spatio-temporal sequences of joint, following, hybrid and independence movement with one or more partners. Dhanjal-Adams et al. [15] present a Hidden Markov modelling approach to identify joint-movement states using metrics of direction and amplitude of flight synchronization in long-distance migratory birds (and assuming proximity between individuals). A similar approach could be used to identify more stages of collective behaviour, using several metrics as observed variables in the movement process.

Finally, a robust assessment of the different patterns of collective behaviour (e.g. proximal joint movement, coordination movement, follower movement) at multiple scales would provide realistic inputs for including group dynamic into movement models, which until now have relied on strong assumptions on collective behaviour in the few cases where it was taken into account [23, 29, 44, 47, 53], mostly due to the lack of understanding of collective motion.

Conclusions

The increasing availability of telemetry data for movement studies allow exploring patterns of collective movement. Here we reviewed metrics for assessing dyadic joint movement. We showed that some of the metrics were more suited for assessing proximity, others for coordination in direction or speed, and some others were not very sensitive to any of those aspects of joint movement. The results shown in this review offer guidelines to readers for choosing the metrics depending on which aspect of joint movement they would like to either describe or incorporate into movement models.

This study also contributes to highlighting the movement assumptions behind each metric as well as the parameters that need tuning. Users need to be able to decide whether these assumptions are realistic for their case studies, and to understand the consequences of their choice of parametrization. An accurate interpretation of movement patterns (here dyadic movement) relies on understanding the tools – in this case, metrics – used for obtaining those patterns.

Though the present work only concerns dyadic movement, further studies should concern the identification of larger groups moving together, where the size of the group would change in time, and metrics that would account for more than two individuals.

Abbreviations

C1:: Practical use criterion
C2:: Dependence on parameters criterion
Ca:: Coefficient of association
Cs:: Coefficient of sociality
CSE:: Cross sampled entropy
CSEM:: Standardized cross sampled entropy
DI:: Dynamic interaction index
D I _θ :: Dynamic interaction in direction
D I _d :: Dynamic interaction in displacement
HAI:: Half-weight association index
jPPA:: Joint potential path area
L _ixn :: Coefficient of interaction
L _ixn T :: Transformed coefficient of interaction
Prox:: Proximity index
r _V :: Correlation (on variable V)
S _AB :: Reference area

References

Aronov B, Har-peled S, Knauer C, Wang Y, Wenk C. Frechet Distance for Curves, Revisited. Algorithms - ESA 206. 2006;:52–63. 1504.07685. Accessed 03 2019.
Atwood TC, Weeks HP. Spatial home-range overlap and temporal interaction in eastern coyotes: the influence of pair types and fragmentation. Can J Zool. 2003; 81(9):1589–97. https://doi.org/10.1139/z03-144.
Article Google Scholar
Barnabe L, Volossovitch A, Duarte R, Ferreira AP, Davids K. Age-related effects of practice experience on collective behaviours of football players in small-sided games. Hum Mov Sci. 2016; 48:74–81. https://doi.org/10.1016/j.humov.2016.04.007.
Article Google Scholar
Benhamou S, Valeix M, Chamaillé-Jammes S, Macdonald DW, Loveridge AJ. Movement-based analysis of interactions in African lions. Anim Behav. 2014; 90:171–80. https://doi.org/10.1016/j.anbehav.2014.01.030.
Article Google Scholar
Bertrand MR, DeNicola AJ, Beissinger SR, Swihart RK. Effects of parturition on home ranges and social afficiliations of female white-tailed deer. J Wildl Manag. 1996; 60(4):899–909.
Article Google Scholar
Bertrand S, Diaz E, Lengaigne M. Patterns in the spatial distribution of Peruvian anchovy (Engraulis ringens) revealed by spatially explicit fishing data. Prog Oceanogr. 2008; 79(2-4):379–89. https://doi.org/10.1016/j.pocean.2008.10.009.
Article Google Scholar
Biro D, Sasaki T, Portugal SJ. Bringing a Time – Depth Perspective to Collective Animal Behaviour. Trends Ecol Evol. 2016; 31(7):550–62. https://doi.org/10.1016/j.tree.2016.03.018.
Article Google Scholar
Block BA, Jonsen ID, Jorgensen SJ, Winship AJ, Shaffer SA, Bograd SJ, Hazen EL, Foley DG, Breed GA, Harrison A-L, Ganong JE, Swithenbank A, Castleton M, Dewar H, Mate BR, Shillinger GL, Schaefer KM, Benson SR, Weise MJ, Henry RW, Costa DP. Tracking apex marine predator movements in a dynamic ocean. Nature. 2011; 475:86–90. https://doi.org/10.1038/nature10082.
Article CAS Google Scholar
Börger L, Dalziel BD, Fryxell JM. Are there general mechanisms of animal home range bevhaviour? A review and prospects for future research,. Ecol Lett. 2008; 11(6):637–50. https://doi.org/10.1111/j.1461-0248.2008.01182.x.
Article Google Scholar
Brotherton PN, Pemberton JM, Komers PE, Malarky G. Genetic and behavioural evidence of monogamy in a mammal, Kirk’s dik-dik (Madoqua kirkii). Proc Biol Sci R Soc. 1997; 264(1382):675–81. https://doi.org/10.1098/rspb.1997.0096.
Article CAS Google Scholar
Camazine S, Deneubourg J-L, Franks NR, Sneyd J, Bonabeau E, Theraula G. Self-organization in Biological Systems, vol 7. United States of America: Princeton University Press; 2003.
Google Scholar
Cole LC. The Measurement of Interspecific Association. Ecology. 1949; 30(4):411–24.
Article Google Scholar
Conradt L, List C. Group decisions in humans and animals: A survey. Philos Trans R Soc B Biol Sci. 2009; 364(1518):719–42. https://doi.org/10.1098/rstb.2008.0276.
Article Google Scholar
De Solla SR, Bonduriansky R, Brooks RJ. Eliminating autocorrelation reduces biological relevance of home range estimates. J Anim Ecol. 1999; 68(2):221–34. https://doi.org/10.1046/j.1365-2656.1999.00279.x.
Article Google Scholar
Dhanjal-Adams KL, Bauer S, Emmenegger T, Hahn S, Lisovski S, Liechti F. Spatiotemporal Group Dynamics in a Long-Distance Migratory Bird. Curr Biol. 2018; 28(17):2824–303. https://doi.org/10.1016/j.cub.2018.06.054. Accessed 14 Nov 2018.
Article CAS Google Scholar
Dodge S, Weibel R, Forootan E. Revealing the physics of movement: Comparing the similarity of movement characteristics of different types of moving objects. Comput Environ Urban Syst. 2009; 33(6):419–34. https://doi.org/10.1016/j.compenvurbsys.2009.07.008.
Article Google Scholar
Duarte CM, Riker P, Srinivasan M, Robinson PW, Gallo-Reynoso JP, Costa DP. Sonification of Animal Tracks as an Alternative Representation of Multi-Dimensional Data: A Northern Elephant Seal Example. Frontiers Mar Sci. 2018; 5. https://doi.org/10.3389/fmars.2018.00128. Accessed 10 June 2018.
Duarte R, Araújo D, Correia V, Davids K, Marques P, Richardson MJ. Competing together: Assessing the dynamics of team-team and player-team synchrony in professional association football. Hum Mov Sci. 2013; 32(4):555–66. https://doi.org/10.1016/j.humov.2013.01.011.
Article Google Scholar
Duranton C, Gaunet F. Behavioural synchronization from an ethological perspective: overview of its adaptive value. Adapt Behav. 2016. https://doi.org/10.1177/1059712316644966.
Fleming CH, Fagan WF, Mueller T, Olson KA, Leimgruber P, Calabrese JM. Estimating where and how animals travel: an optimal framework for path reconstruction from autocorrelated tracking data. Ecology. 2016; 97(3):576–82.
CAS PubMed Google Scholar
Frechet M. Sur L’Ecart de Deux Courbes et Sur Les Courbes Limites. Trans Am Math Soc. 1905; 6(4):435–49.
Article Google Scholar
Giuggioli L, McKetterick TJ, Holderied M. Delayed Response and Biosonar Perception Explain Movement Coordination in Trawling Bats. PLoS Comput Biol. 2015; 11(3):1–21. https://doi.org/10.1371/journal.pcbi.1004089.
Article Google Scholar
Haydon DT, Morales JM, Yott A, Jenkins DA, Rosatte R, Fryxell JM. Socially informed random walks: incorporating group dynamics into models of population spread and growth. Proc Biol Sci R Soc. 2008; 275(1638):1101–9. https://doi.org/10.1098/rspb.2007.1688.
Article Google Scholar
Johnson A. Polyclip: Polygon Clipping. 2015. Ported to R by Adrian Baddeley and Brian Ripley. R package version 1.3-2. http://CRAN.R-project.org/package=polyclip.
Josse J, Husson F, Pagès J. Gestion des données manquantes en Analyse en Composantes Principales. J Soc Fr Stat. 2009; 150(2):28–51.
Google Scholar
Kenward RE, Marcström V, Karlbom M. Post-nestling behaviour in goshawks, Accipiter gentilis: II. Sex differences in sociality and nest-switching. 1993. https://doi.org/10.1006/anbe.1993.1199. http://dx.doi.org/10.1006/anbe.1993.1199%5Cnhttp://linkinghub.elsevier.com/retrieve/pii/S0003347283711991.
Konzack M, McKetterick T, Ophelders T, Buchin M, Giuggioli L, Long J, Nelson T, Westenberg MA, Buchin K. Visual analytics of delays and interaction in movement data. Int J Geogr Inf Sci. 2017; 31(2):320–45. https://doi.org/10.1080/13658816.2016.1199806.
Article Google Scholar
Kuang K, Napolitano F. Pbmcapply: Tracking the Progress of Mc*pply with Progress Bar. 2018. R package version 1.3.0. https://CRAN.R-project.org/package=pbmcapply.
Langrock R, Hopcraft JGC, Blackwell PG, Goodall V, King R, Niu M, Patterson TA, Pedersen MW, Skarin A, Schick RS. Modelling group dynamic animal movement. Methods Ecol Evol. 2014; 5(2):190–9. https://doi.org/10.1111/2041-210X.12155. arXiv:1308.5850v1.
Article Google Scholar
Laube P, Purves RS. How fast is a cow? Cross-Scale Analysis of Movement Data. Trans GIS. 2011; 15(3):401–18. https://doi.org/10.1111/j.1467-9671.2011.01256.x.
Article Google Scholar
Laube P, Dennis T, Forer P, Walker M. Movement beyond the snapshot - Dynamic analysis of geospatial lifelines. Comput Environ Urban Syst. 2007; 31(5):481–501. https://doi.org/10.1016/j.compenvurbsys.2007.08.002.
Article Google Scholar
Lê S, Josse J, Husson F. FactoMineR: A package for multivariate analysis. J Stat Softw. 2008; 25(1):1–18. https://doi.org/10.18637/jss.v025.i01.
Article Google Scholar
Levenshtein VI. Binary codes capable of correcting deletions, insertions, and reversals. Sov Phys Dokl. 1966; 10(8):707–10. https://doi.org/citeulike-article-id:311174. arXiv:1011.1669v3.
Google Scholar
Levitis DA, Lidicker WZ, Freund G. Behavioural biologists don’t agree on what constitutes behaviour,. Anim Behav. 2009; 78(1):103–10. https://doi.org/10.1016/j.anbehav.2009.03.018.
Article Google Scholar
Long J. wildlifeDI: Calculate Indices of Dynamic Interaction for Wildlife Telemetry Data. 2014. R package version 0.2. https://CRAN.R-project.org/package=wildlifeDI.
Long J, Nelson T. Home range and habitat analysis using dynamic time geography. J Wildl Manag. 2015; 79(3):481–90. https://doi.org/10.1002/jwmg.845.
Article Google Scholar
Long JA, Nelson TA. Measuring Dynamic Interaction in Movement Data. Trans GIS. 2013; 17(1):62–77. https://doi.org/10.1111/j.1467-9671.2012.01353.x.
Article Google Scholar
Long JA, Nelson TA, Webb SL, Gee KL. A critical examination of indices of dynamic interaction for wildlife telemetry studies. J Anim Ecol. 2014; 83:1216–33. https://doi.org/10.1111/1365-2656.12198.
Article Google Scholar
Long JA, Webb SL, Nelson TA, Gee KL. Mapping areas of spatial-temporal overlap from wildlife tracking data. Mov Ecol. 2015; 3(1):38. https://doi.org/10.1186/s40462-015-0064-3.
Article Google Scholar
Lord WM, Sun J, Ouellette NT, Bollt EM. Inference of Causal Information Flow in Collective Animal Behavior. IEEE Trans Mol Biol Multi-Scale Commun. 2016; 2(1):107–16. https://doi.org/10.1109/TMBMC.2016.2632099. Accessed 21 Nov 2018.
Article Google Scholar
Miller JA. Using Spatially Explicit Simulated Data to Analyze Animal Interactions: A Case Study with Brown Hyenas in Northern Botswana. Trans GIS. 2012; 16(3):271–91. https://doi.org/10.1111/j.1467-9671.2012.01323.x.
Article Google Scholar
Minta SC. Tests of Spatial and Temporal Interaction Among Animals. Ecol Appl. 1992; 2(2):178–88.
Article Google Scholar
Nathan R, Getz WM, Revilla E, Holyoak M, Kadmon R, Saltz D, Smouse PE. A movement ecology paradigm for unifying organismal movement research. PNAS. 2008; 105(49):19052–9.
Article CAS Google Scholar
Niu M, Blackwell PG, Skarin A. Modeling interdependent animal movement in continuous time. Biometrics. 2016; 72(2):315–24. https://doi.org/10.1111/biom.12454.
Article Google Scholar
Orange N, Abaid N. A transfer entropy analysis of leader-follower interactions in flying bats. Eur Phys J Spec Top. 2015; 224(17):3279–93. https://doi.org/10.1140/epjst/e2015-50235-9. Accessed 14 Nov 2018.
Article Google Scholar
Plot V, de Thoisy B, Blanc S, Kelle L, Lavergne A, Roger-Bérubet H, Tremblay Y, Fossette S, Georges J-Y. Reproductive synchrony in a recovering bottlenecked sea turtle population. J Anim Ecol. 2012; 81(2):341–51. https://doi.org/10.1111/j.1365-2656.2011.01915.x.
Article Google Scholar
Potts JR, Mokross K, Lewis Ma. A unifying framework for quantifying the nature of animal interactions. J R Soc Interface. 2014; 11(96):20140333. https://doi.org/10.1098/rsif.2014.0333. 1402.1802.
Article Google Scholar
R Core Team. R: A Language and Environment for Statistical Computing. Vienna: R Foundation for Statistical Computing; 2015. R Foundation for Statistical Computing. https://www.R-project.org/.
Google Scholar
Ribeiro Jr PJ, Diggle PJ. geoR: Analysis of Geostatistical Data. 2015. R package version 1.7-5.1. http://CRAN.R-project.org/package=geoR.
Rice JC, Rochet MJ. A framework for selecting a suite of indicators for fisheries management. ICES J Mar Sci. 2005; 62(3):516–27. https://doi.org/10.1016/j.icesjms.2005.01.003.
Article Google Scholar
Richman JS, Moorman JR. Physiological time-series analysis using approximate entropy and sample entropy,. Am J Physiol Heart Circ Physiol. 2000; 278(6):2039–49.
Article Google Scholar
Rochet M-J, Trenkel VM. Which community indicators can measure the impact of fishing? A review and proposals. Can J Fish Aquat Sci. 2003; 60(1):86–99. https://doi.org/10.1139/f02-164.
Article Google Scholar
Russell JC, Hanks EM, Haran M. Dynamic Models of Animal Movement with Spatial Point Process Interactions. J Agric Biol Environ Stat. 2016; 21(1):22–40. https://doi.org/10.1007/s13253-015-0219-0. Accessed 07 June 2018.
Article Google Scholar
Schreiber T. Measuring Information Transfer. Phys Rev Lett. 2000; 85(2):461–4. https://doi.org/10.1103/PhysRevLett.85.461. Accessed 19 Nov 2018.
Article CAS Google Scholar
Shirabe T. Correlation Analysis of Discrete Motions In: Raubal M, Miller HJ, Frank AU, Goodchild MF, editors. Geographic Information Science. Volume 4197 of the Series Lecture Notes in Computer Science. Münster: Springer-Verlag: 2006. p. 370–82.
Google Scholar
Sumpter DJT, Mann RP, Perna A. The modelling cycle for collective animal behaviour. Interface Focus. 2012; 2:764–73. https://doi.org/10.1098/rsfs.2012.0031.
Article Google Scholar
Travassos B, Davids K, Araujo D, Esteves PT. Performance analysis in team sports: Advances from an Ecological Dynamics approach. Int J Perform Anal Sport. 2013; 13(1):83–95.
Article Google Scholar
Van Strien AJ, Soldaat LL, Gregory RD. Desirable mathematical properties of indicators for biodiversity change. Ecol Indic. 2012; 14(1):202–8. https://doi.org/10.1016/j.ecolind.2011.07.007.
Article Google Scholar
VanDerWal J, Falconi L, Januchowski S, Shoo L, Storlie C. SDMTools: Species Distribution Modelling Tools: Tools for Processing Data Associated with Species Distribution Modelling Exercises. 2014. R package version 1.1-221. https://CRAN.R-project.org/package=SDMTools.
Vlachos M, Kollios G, Gunopulos D. Discovering similar multidimensional trajectories. In: Proceedings 18th International Conference on Data Engineering. Washington, DC: IEEE Computer Society: 2002. p. 673–684. https://doi.org/10.1109/ICDE.2002.994784.
Google Scholar
Wang XR, Miller JM, Lizier JT, Prokopenko M, Rossi LF. Quantifying and Tracing Information Cascades in Swarms. PLoS ONE. 2012; 7(7):40084. https://doi.org/10.1371/journal.pone.0040084. Accessed 14 Nov 2018.
Article Google Scholar
White PCL, Harris S. Encounters between Red Foxes (Vulpes vulpes): Implications for Territory Maintenance, Social Cohesion and Dispersion. J Anim Ecol. 1994; 63(2):315–27.
Article Google Scholar
Wong J. Pdist: Partitioned Distance Function. 2013. R package version 1.2. http://CRAN.R-project.org/package=pdist.

Download references

Acknowledgements

The authors would like to warmly thank Mathieu Basille for constructive comments to the manuscript, Angela Blanchard for proofreading and Criscely Lujan for help with github.

Funding

This work has received funding from French region Pays de la Loire and the research project COSELMAR, the French research network PathTIS and the European Union’s Horizon 2020 research and innovation program under grant agreement No 633680, Discardless.

Availability of data and materials

Codes with an example, and the dyad tracks arbitrarily created for the case scenarios are accessible from: https://github.com/rociojoo/MetricsDyadJM/

All analyses were performed in R [48]. Distances between fixes were computed using the pdist package [63]. For jPPA calculations, the ellipses were computed as in [35] and intersection and union areas were approximated by gridding the space via packages polyclip [24] and geoR [49]. For L_ixnT and HAI, SDMTools [59] was used to identify points in and out of the reference area. Parallel calculations to simulate and account for CPU time were done using packages parallel and pbmcapply [28]. The PCA in Additional file 8 were performed with the FactoMineR package [32].

Author information

Authors and Affiliations

Department of Wildlife Ecology and Conservation, Fort Lauderdale Research and Education Center, University of Florida, 3205 College Avenue, Davie, Florida, 33314, USA
Rocio Joo
IFREMER, Ecologie et Modèles pour l’Halieutique, BP 21105, Nantes Cedex 03, 44311, France
Rocio Joo & Stéphanie Mahévas
Univ Rennes, Agrocampus Ouest, CNRS, IRMAR - UMR 6625, F-35000, Rennes, France
Marie-Pierre Etienne
MARBEC, IRD, Ifremer, CNRS, Univ Montpellier, Sète, France
Nicolas Bez

Authors

Rocio Joo
View author publications
You can also search for this author in PubMed Google Scholar
Marie-Pierre Etienne
View author publications
You can also search for this author in PubMed Google Scholar
Nicolas Bez
View author publications
You can also search for this author in PubMed Google Scholar
Stéphanie Mahévas
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

RJ conceived of the research idea and performed the case scenario evaluation. The review of theoretical properties of the metrics was mostly performed by RJ and MPE, as well as the computational cost assessment. RJ leaded the manuscript writing, but all authors actively participated in discussions and editing. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Rocio Joo.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

None of the authors have any competing interests in the manuscript.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1

Graphical examples of two kernel functions for Proximity metrics. (PDF 89 kb)

Additional file 2

Cs1 requirements to take large negative values. (PDF 359 kb)

Additional file 3

Lixn: Table for computing probabilities. (PDF 119 kb)

Additional file 4

How to define the ellipse of the potential path area. (PDF 62 kb)

Additional file 5

Metrics derived for each case scenario. (PDF 63 kb)

Additional file 6

Summary figures for proximity-speed and proximity-coordination scenarios. (PDF 59 kb)

Additional file 7

Computational cost of each metric. (PDF 91 kb)

Additional file 8

Principal component analysis of the metrics for the case scenarios. (PDF 190 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Joo, R., Etienne, MP., Bez, N. et al. Metrics for describing dyadic movement: a review. Mov Ecol 6, 26 (2018). https://doi.org/10.1186/s40462-018-0144-2

Download citation

Received: 17 August 2018
Accepted: 02 December 2018
Published: 27 December 2018
DOI: https://doi.org/10.1186/s40462-018-0144-2

Metrics for describing dyadic movement: a review

Abstract

Introduction

Evaluation criteria

Definition and theoretical properties of the metrics

Proximity index (Prox)

Coefficient of Sociality (Cs)

The Half-weight Association Index (HAI)

Coefficient of Interaction (L ixn and L ixn T)

Joint Potential Path Area (jPPA)

Cross sampled entropy (CSE and CSEM)

Correlations (r V)

Dynamic Interaction (DI, D I d and D I θ)

Conclusions on the theoretical properties of the metrics

Exploration of metrics through case scenarios

Synthesis of metric analysis

Further perspectives on collective behaviour

Conclusions

Abbreviations

References

Acknowledgements

Funding

Availability of data and materials

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Publisher’s Note

Additional files

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Movement Ecology

Contact us

Coefficient of Interaction (L _ixn and L _ixn T)

Correlations (r _V)

Dynamic Interaction (DI, D I _d and D I _θ)