 Methodology article
 Open Access
 Published:
A continuoustime statespace model for rapid quality control of argos locations from animalborne tags
Movement Ecology volume 8, Article number: 31 (2020)
Abstract
Background
Statespace models are important tools for quality control and analysis of errorprone animal movement data. The near realtime (within 24 h) capability of the Argos satellite system can aid dynamic ocean management of human activities by informing when animals enter wind farms, shipping lanes, and other intensive use zones. This capability also facilitates the use of ocean observations from animalborne sensors in operational ocean forecasting models. Such near realtime data provision requires rapid, reliable quality control to deal with errorprone Argos locations.
Methods
We formulate a continuoustime statespace model to filter the three types of Argos location data (LeastSquares, Kalman filter, and Kalman smoother), accounting for irregular timing of observations. Our model is deliberately simple to ensure speed and reliability for automated, near realtime quality control of Argos location data. We validate the model by fitting to Argos locations collected from 61 individuals across 7 marine vertebrates and compare modelestimated locations to contemporaneous GPS locations. We then test assumptions that Argos Kalman filter/smoother error ellipses are unbiased, and that Argos Kalman smoother location accuracy cannot be improved by subsequent statespace modelling.
Results
Estimation accuracy varied among species with Root Mean Squared Errors usually <5 km and these decreased with increasing data sampling rate and precision of Argos locations. Including a model parameter to inflate Argos error ellipse sizes in the north  south direction resulted in more accurate location estimates. Finally, in some cases the model appreciably improved the accuracy of the Argos Kalman smoother locations, which should not be possible if the smoother is using all available information.
Conclusions
Our model provides qualitycontrolled locations from Argos LeastSquares or Kalman filter data with accuracy similar to or marginally better than Argos Kalman smoother data that are only available via feebased reprocessing. Simplicity and ease of use make the model suitable both for automated quality control of near realtime Argos data and for manual use by researchers working with historical Argos data.
Background
Statespace models have emerged as important tools both for quality control and ecological analysis of errorprone animal movement data [1–5]. Analysis of these data with discretetime models is simple in principle, breaking down animal movement into a series of discrete steps that occur on some fixed time interval (e.g., [1, 6]). Yet animal movement is a process that unfolds continuously through time, usually absent of clear breaks that could delineate discrete steps. We merely measure the movements from locations obtained over discrete, often irregular intervals in time. In this sense, a continuoustime model can more naturally handle temporally irregular observations while mimicking the true underlying continuous movement process(es) [2, 7].
In the marine realm, airbreathing animal locations are typically measured by satellitelinked electronic tags at irregular time intervals dictated by a combination of satellite availability and an animal’s surface behaviour. The Argos satellite telemetry system is one of the most common platforms used to track animals at sea, with over 40,000 individuals tracked since 2007 (S. Baudel, pers. comm.). In this system, transmissions from electronic tags are received by one of several polarorbiting satellites as they pass overhead, and the Doppler shift in transmission frequency along with other information is used to geolocate the tags [8]. The polar orbits of Argos satellites result in more dense coverage and potentially higher temporal, resolution data closer to the poles than at the equator. From inception in 1978 to 2011, CLS (Collecte Localisation Satellites) has used a LeastSquares algorithm to geolocate the tag transmissions. This approach does not quantify location uncertainty but rather provides location quality classes based on information including the number of transmissions received [8].
Statespace models developed for Argos LeastSquares locations have relied on independent, groundtruth data (e.g., [9]) to quantify location uncertainty for each of the location quality classes [1, 2]. However, independently quantified uncertainties, based on a single or small number of data sets, are unlikely to be appropriate for all species in all locations. For example, modifications to assumed LeastSquares error variances can influence the accuracy of locations predicted by different statespace models [10].
In 2011, CLS replaced their LeastSquares algorithm with a statespace model, based on a multiple model Kalman filter algorithm, to estimate locations and their uncertainty [11]. This approach provides more location estimates, each with a corresponding estimated error ellipse, and with greater accuracy compared to the original LeastSquares method. These locations are provided in near realtime; here defined as within 24 h of occurrence. A fixedinterval Kalman smoother is also provided by CLS as an extra service to further improve location accuracy from the original Kalman filterbased location estimates [12]. Whereas the Kalman filter employs a onestep recursion to estimate locations based only on the current and previous observations, the Kalman smoother uses a twopass approach, first employing the Kalman filter and then employing a backwards smooth of the data [13]. In this sense, the Kalman smoother uses information from the entire animal track to estimate locations and their uncertainty. This results in more accurate location estimates than the Kalman filter alone [12]. Such smootherbased location estimates are theoretically optimal given the available data, and it should not be possible to improve on them if uncertainty is characterised and propagated accurately (e.g., [14]). Currently, CLS does not provide Kalman smootherbased locations in near realtime, they can only be obtained with reprocessing, for an additional fee, after a tag deployment ends.
Traditional use of animal tracking data has required neither near realtime data provision nor rapid modelling tools for quality control or ecological analysis. However, realtime management of atrisk species’ mortality from interactions with human activities such as offshore wind farms, fisheries and shipping increasingly relies on animal telemetry data [15–17]. Dynamic ocean management applied at high spatial and temporal resolutions can increase the efficiency and efficacy of measures to reduce mortality [18], placing an onus on rapidly available, highresolution data. Similarly, the utility of animalborne sensors for ocean observing [19, 20] as part of the Global Ocean Observing System has spurred coordinated animal telemetry programs, such as the Australian Integrated Marine Observing System’s Animal Tracking Facility (IMOS ATF^{Footnote 1}) and the U.S. Integrated Ocean Observing System’s Animal Telemetry Network (IOOS ATN^{Footnote 2}). These programs aim to provide near realtime ocean measurements via the World Meteorological Organization’s Global Telecommunication System for assimilation in operational ocean and atmospheric forecast models. In all these cases, near realtime telemetry data provision requires rapid and therefore automated, reliable quality control processes, including the errorprone Argos location data that are essential for understanding animal movements and distribution, and for providing geospatial context to ocean measurements.
Here we present a continuoustime statespace model for rapid filtering of any Argos location data. This model is now used as part of the IMOS ATF’s quality control/quality assurance process for animalborne ocean observations. To facilitate fast automation, we trade off realism  the ability to explain complex movement processes  for reliability by using a simple continuoustime random walk on velocity with a single variance parameter. We evaluate the model by: 1) comparing fits to all three Argos location types from the same individuals; 2) assessing accuracy of modelestimated locations against contemporaneous GPS locations; 3) assessing how a model assumption about Argos error ellipses influences estimation accuracy; 4) comparing the accuracy of modelled and unmodelled Kalman Smoother locations.
Methods
A continuoustime statespace model for animal telemetry data
We model animal movement as a continuoustime random walk on velocity v_{t} in two coordinate axes:
where Δ is the time increment and Σ_{Δ} is a zeromean, bivariate Gaussian random variable with variance 2DΔ. The parameter D is a 1d diffusion coefficient accounting for variability in velocity, which increases with the time interval Δ. Noting that locations x are the summed velocities, given a starting location, the following equation describes a simple process model subject to variable time increments:
where the subscript i indexes time t_{i}, x_{i} is the true location of the animal at time t_{i} and v_{i}Δ_{i} is the displacement (velocity x elapsed time) between x_{i−1} and x_{i}. To simplify the model, we assume that the velocity random walk variances 2DΔ_{i} are equal on the two axes but they could also be assumed to vary independently [2]. Correlation in movements arises from allowing the locations to be the sum of the velocities.
We couple this process model to a generally applicable measurement model that describes how the errorprone and possibly irregularlytimed observed locations y_{i} map onto the corresponding true location states x_{i}:
where y_{i} the location observed at time t_{i} corresponding to x_{i}, and Ω_{i} is the measurement error variancecovariance matrix that can be structured to suit different types of location data. Below, we focus on modifications to accommodate different Argos location types, but other location data (e.g., processed lightlevel geolocations) could also be considered in this framework.
Argos LeastSquares data
Locations measured using CLS’ older LeastSquares (LS) approach [8] are associated with location quality class designations: 3, 2, 1 0, A, B, and Z. These classes are the only contemporaneous information about location quality and provide only a relative index of measurement uncertainty [1]. We use the class information, along with independent estimates of their associated standard errors from Argos transmitters deployed on seals held captive at a known location [9], to construct the following variancecovariance matrix:
where \(\tau _{x}^{2}\) and \(\tau _{y}^{2}\) are the overall measurement error variances on the two coordinate axes, K_{x,i} and K_{y,i} are error weighting factors that scale the τ’s appropriately for the Argos location quality class associated with the ith observation, and ρ is the correlation between τ_{x}K_{x,i} and τ_{y}Ky,i. The τ’s are estimated during model fitting and the error weighting factors are the standard error ratios between the best quality class, 3, and each other class (2, 1, 0, A, B, Z).
Argos kalman filter and kalman smoother data
Locations measured using CLS’ Kalman filter (KF) or Kalman smoother (KS) algorithms have their estimated uncertainties provided to users as error ellipses [11]. Ellipses are defined by three variables: semimajor axis, semiminor axis and semimajor axis orientation from north. Building on McClintock et al. [14], the error variancecovariance matrix is:
with the elements being derived from the Argos error ellipse components:
and
where M_{i} is the ellipse semimajor axis length of the ith observation, m_{i} is the semiminor axis length and c_{i} is the semimajor axis orientation [11, 14].
McClintock et al. [14] used a bivariate tdistribution, with variancecovariance defined by the Argos error ellipses, in their measurement model to account for occasional outlier observations (i.e., where error ellipses underestimate the true measurement uncertainty). Here we chose to identify and remove outlier locations using a travel rate filter [21] prior to fitting the statespace model, as per [2, 22]. Additionally, we included the parameter ψ to account for possible consistent under estimation of the Kalman filter (& smoother)derived location uncertainty (Fig. 1). ψ rescales all ellipse semiminor axes m_{i}, where estimated values >1 inflate the uncertainty region around measured locations by lengthening ellipse semiminor axes.
In all cases, we project the y_{i}’s from geographic coordinates (lon, lat) onto a Cartesian plane prior to modelling, using the WGS84 World Mercator projection (EPSG 3395). To facilitate optimization, all planar coordinates and their uncertainty estimates, where available, are converted from m to km.
Estimation
We used the R package TMB (Template Model Builder, [23]) to fit the statespace model, using maximum likelihood to estimate model parameters and the Laplace approximation to rapidly estimate the random effects  the unobserved location and velocity states, x and v [5, 24]. Using this estimation approach, uncertainty in x and v estimates are obtained using a generalised delta method (see [23] for details).
The model presented here and associated general data preparation code are available in the foieGras R package [25], available on the CRAN server (https://CRAN.Rproject.org/package=foieGras). The latest version can also be downloaded from the lead author’s GitHub site (https://github.com/ianjonsen/foieGras).
Data and preprocessing
We model all three types of Argos satellite location data: LS, KF, and KS. The data are comprised of four pinnipeds, one seabird and two sea turtle species (Table 1); with deployment locations ranging between polar, temperate, and tropical marine regions (Additional file 1: Fig S1). The number of individual data sets by species and data type range from 6 to 13 with all having locations measured by GPS and at least one Argos type (Table 1). All data collected after 2008 were reprocessed by CLS to obtain the three Argos data types (4 species; Table 1).
We used an automated prefiltering step to identify outlier observations to be ignored by the statespace model. This prefiltering used the argosfilter R package [21] to identify locations implying travel rates >3 ms^{1} for all pinnipeds and sea turtles and travel rates >17 ms^{1} for northern gannets. These speed thresholds represent conservative upper limits of travel for these species and are intended to identify only the extreme outlier observations. This resulted in <30% of LeastSquares, <15% of Kalman filter, and <10% of Kalman smoother data being removed. The proportion of data removed by prefiltering is considerably less than those associated with optimal speed thresholds for other species (e.g., [22]).
Empirical validation
We examined the accuracy of modelpredicted locations, assuming GPS data represent truth. Although GPS data have higher spatial accuracy and precision, and typically have higher sampling rates than Argos data, they are nonetheless discrete measurements of a continuoustime process. As a consequence, they are also likely to misrepresent animals’ true movement paths but to a far smaller extent (10’s of m; [26]) than Argos data.
For all validations presented, we compared GPS locations to modelfitted locations (hereafter modelestimated locations), which are location states estimated at the times of the Argosmeasured locations. By focusing on modelestimated locations and not predicted locations that occur at regular time intervals, we reduce the degree to which model accuracy is confounded with data sampling rates that are known to vary across species and Argos data types (see Discussion).
We compared modelestimated locations from fits to all three Argos data types, where available, with GPS data. In all cases, the times of GPS observations do not match the times of Argos observations or the corresponding modelestimated locations. To account for this mismatch, we initially considered three approaches for comparing between GPS and modelled locations. First, using a linear interpolation of GPS locations to modelestimated location times [27]. Second, using the temporally closest GPS observation if any occurred within ±10 min. Third, using the model to predict locations at the GPS observation times. In several cases, it was not feasible to predict model locations for each GPS observation time as the typically higher frequency of GPS observations resulted either in implausible artefacts in the model fits to the Argos data or in convergence failures of the optimiser used to fit the model. For these reasons, we chose not to consider this approach further.
Fitting the statespace model with a fixed 2h prediction interval resulted in optimiser convergence for all individual tracks. For each individual track, we summarized the deviations between modelestimated locations and either the linearly interpolated GPS locations or the temporally matched GPS locations by taking the root mean of the squared distances (RMSD in km) between all pairs of locations and comparing distributions of individual RMSD values among species. We report results of comparisons with the linearly interpolated GPS locations here and comparisons with the temporally matched GPS locations in Supplementary Information. We discuss implications of using each of these approaches.
Total sequential processing time for validation using all 129 Argos data sets (Table 1) was 13.43 min, an average of 6.25 s per data set. This included both the prefilter algorithm and statespace model estimation, running on a 2018 MacBook Pro 15" laptop with 2.9 GHz i9 processor, 32 GB RAM, with R 3.6.2.
Potential underrepresentation of Argos KF/KS location uncertainty
Our default model accounts for a perceived underestimation of the size of CLS’ Kalman filter and Kalman smoother error ellipses (Figs. 1 and Additional file 1: S2S5) by including the parameter ψ (Eqns. 6, 8). Although uncertainty is expected to be lower in the general North  South plane due to the polar orbits of the Argos satellites [11], the frequent compression of error ellipses in this plane (semiminor axis; e.g., Fig. 1b) seems extreme. Values of ψ>1 inflate the semiminor axis, increasing the uncertainty region around Argos KF/KS observations and could allow the model to more appropriately smooth the data. It is unclear how much the parameter actually improves the accuracy of estimated tracks versus yielding a less accurate oversmoothing of the data. To assess this, we evaluated the influence of the ψ parameter on the accuracy of modelestimated locations by comparing RMSD values from models with and without the ψ parameter. To simplify the results, we pooled RMSD values across species and assessed the loge difference in RMSD (denoted as logΔ RMSD), which approximates % difference on the linear scale [28].
Argos KS location accuracy
The CLS Kalman smoother locations have greater spatial accuracy and precision than LeastSquares or Kalman filter data [12]. In principle, it should not be possible to improve the accuracy of KSbased locations with subsequent modelling because they are theoretically optimal estimates, using all available data. It does seem reasonable, however, to question whether this is actually the case. We evaluated this by comparing logΔ RMSD derived from GPS and KS locations to those derived from GPS and estimates from the statespace model fit to the KS locations. In both cases, we apply the same prefiltering to identify and remove outlier locations, though these outliers should not be present in KSbased locations.
Results
Statespace model fits to the 3 Argos data types
We fit the statespace model to the four species with all three Argos data (Table 1), and present fits with a 2h prediction time interval. Model fits to hawksbill turtle and southern elephant seal data show a consistent increase in spatial resolution and decrease in estimation uncertainty of the predicted tracks across the three Argos data types (top to bottom; (Fig. 2 a,e,i and b,f,j, respectively). This effect is due to an increase in the number of observations from leastsquares to Kalman filter data, and to a shrinking of the error ellipses (measurement uncertainty), by nearly half, from Kalman filter to Kalman smoother data (Table 2). Model fits to leopard seal and northern gannet data do not show any clear differences in resolution or estimation uncertainty across the Argos data types (Fig. 2 c,g,k and d,h,l, respectively). This appears due to smaller differences in the number of observations for LeastSquares versus Kalman filter data, arising from lower proportions of class A and B locations, relative to hawksbill turtles and southern elephant seals (Table 2). The lower proportions of class A and B locations for leopard seals and northern gannets are likely due to the large amount of time they spend at or above the ocean surface. Additionally, northern gannets had, on average, far larger error ellipses than the other species (Table 2). The uncertainty of their statespace modelpredicted locations was consequently larger, regardless of Argos data type (light blue 95% confidence ellipses in Fig. 2 d,h,l).
Validation with GPS data
Median distances between statespace modelestimated and interpolated GPS locations were within 8 km for all species and data types, with most species and data types having 95% of estimated locations within 12 km of GPS locations (Table 3). Northern gannets were an exception, with 95th percentiles extending >40 km for all Argos data types (Table 3). Importantly, the median accuracy of statespace modelestimated locations, regardless of Argos data type, were all smaller or comparable to those of prefiltered but unmodelled KS locations (Table 3). Across species, the weighted average (± se) improvement of statespace modelestimated location accuracy relative to unmodelled KS location accuracy was: LS = 0.21 ±0.60 km; KF = 0.14 ±0.07 km; KS = 0.34 ±0.05 km.
Six of the 7 species’ estimated tracks had median RMSD values under 5 km with all values under 10 km, regardless of Argos data type (Fig. 3). Northern gannet tracks had considerably higher and more variable RMSD’s (between 13 and 31 km), across all Argos data types (Fig. 3). This is consistent with their considerably larger statespace modelpredicted location uncertainty (Fig. 2). Both hawksbill turtle and southern elephant seal tracks had declining RMSD values as Argos data frequency and precision increased (Fig. 3), and this was consistent with the increasing resolution and precision of their statespace modelpredicted tracks (Fig. 2). Conversely, leopard seal and northern gannet tracks showed no such pattern, which was consistent with the general lack of increasing resolution of both the observed and predicted tracks (Fig. 2). Results were similar, although with overall lower RMSD values, when comparing statespace model estimated locations to the temporally closest GPS location within ±10 min (Additional file 1: Fig. S6).
Effect of ψ parameter
Inclusion of the ψ parameter resulted in lower RMSD values, on average, implying that Argos error ellipses underrepresent the true location uncertainty in the general north  south direction (Fig. 4). This result was less pronounced with fits to Argos Kalman smoother locations, with 81% of individuals having a logΔ RMSD <0 versus 90% of individuals for Argos Kalman filter locations (KF Δ RMSD: median = 0.57 km, range = 3.78,0.45; KS Δ RMSD: median = 0.27 km, range = 3.34, 0.85). Of the four species, predicted locations for hawksbill turtle tracks were least likely to benefit from rescaled error ellipses, with most individuals having logΔ RMSD values close to or >0 (Fig. 4). It is unclear whether this is due to: 1) their relatively low absolute RMSD values (Fig. 3); 2) their slightly more circular error ellipses (Table 2), where the ψ rescaling effect would be less pronounced; or, 3) a combination of the two.
Argos KS accuracy
Argos Kalman smoother locations were less accurate by an average of 0.34 km without subsequent statespace model filtering (Table 3; compare KS and pf_KS values), although comparisons of logΔ RMSD were variable both within and among species (Fig. 5). The mean logΔ RMSD across species implied a average 6% increase in accuracy with subsequent statespace model filtering of Argos KS locations. However, results were equivocal for southern elephant seals and hawksbill turtle tracks were typically more accurate without any subsequent statespace filtering (Fig. 5).
Discussion
We presented a continuoustime model for animal movement, fit in a statespace framework that allows flexible handling of Argos satellite telemetry data. The model was initially intended for automated quality control of large Argos animal tracking data sets, but is broadly applicable for any Argos location data. Using Argos  GPS double tagged animals, we assessed the accuracy of modelestimated locations, comparing across three types of Argos data where possible. Median accuracy was within 4 km for most species and data types, with statespace modelestimated locations being slightly more accurate (by 0.1  0.3 km on average) than the best quality CLS Kalman smoother locations. Median root mean squared deviations were typically at or under 5 km for 6 of the 7 species studied. In most cases, RMSD values were lowest when fitting to Argos Kalman smoother data and highest when fitting to Argos LeastSquares or Kalman filter data, although the withinspecies differences in RMSD between data types were typically small. Although the model was evaluated over a limited number of individuals and species, it is apparent that the accuracy and spatiotemporal resolution of inferred locations is situational.
Highlighting this situational aspect are the northern gannet results (Table 3; Figs. 2 & 3), which are clearly distinct from the other species. Accuracy of modelestimated locations was approximately 45 times worse than for other species, although absolute magnitude is subject to the approach used for matching modelestimated and GPS locations (compare Figures. 3 & Additional file 1: Fig S5). Unlike other species where median distances between modelestimated and GPS locations either declined consistently or were similar when comparing LS to KF and KF to KS data types, gannets had the lowest median distances for fits to LS data and had far broader distributions of distance across the 3 data types. We suspect this pattern may arise from the considerably faster mean travel rates of northern gannets (12 km h^{1}, with cruising speeds up to 45 km h^{1}) compared to the other species (approximately 0.7  3 km h^{1}). Similarly, Lopez et al. [12] reported lower overall coverage probabilities of error ellipses estimated by their Kalman filter and Kalman smoother algorithms for two avian species analyzed in comparison to other platforms (terrestrial and marine mammals, sea turtles, ships and drifters). Combined, this implies that Argos error ellipses may be more strongly underestimated for species/platforms that travel faster and/or at higher altitude.
McClintock et al. [14] used a bivariate tdistribution, parameterised by the Argos error ellipse information, to model location measurement error. Their estimates of the t degrees of freedom parameter implied that the Argos error ellipses do not fully explain location measurement error. To avoid computational challenges associated with tdistribution parameter estimation, we used a twostep approach for dealing with location measurement error in Argos Kalman filter and Kalman smoother data. First, we identified and removed potentially large outliers using a travelrate filter [21] prior to fitting the statespace model, as per [2, 22]. Although underestimation of location error was acknowledged by Lopez et al. [11, 12] and has been reported by others [14, 29], it is unclear why occasional, apparent hugely underestimated error ellipses are present in the Kalman filter and Kalman smoother data. Second, we accounted for potential Argos error ellipse underestimation by including the ψ parameter to inflate the semiminor axis. We adopted this approach given the observation that Argos error ellipses often have semiminor axes vastly smaller than corresponding semimajor axes, resulting in “squashed” error ellipses (Additional file 1: Figures S2S4). We found that in most cases the ψ parameter contributed to more accurate location estimates, implying that the error ellipses commonly underestimate the true uncertainty in Argosmeasured locations. This result is evident but less pronounced when fitting to Kalman smoother versus Kalman filter data. Location estimates were more accurate for at least some individuals of all species, however, hawksbill turtles and northern gannets appeared least likely to benefit from the ψ rescaling effect (see Fig. 4). Both of these species had somewhat more circular error ellipses, in comparison to the leopard and southern elephant seals, and thus any possible contribution of ψ would be reduced. Ultimately, we are unsure why Argos error ellipses appear to be so commonly biased low in the semiminor axis direction (generally north  south).
Where possible, both Kalman filter and Kalman smoother data types were included in this study. We found, in most cases, that the modelestimated locations were most accurate when using the Kalman smoother data, but on average by less than 200 m compared with fits to Kalman filter data. Although the Kalman smoother data should represent optimal estimates of location because information along the entire movement track is used to update and smooth each location estimate, we show that fitting the statespace model to these estimates can further improve location accuracy in some cases (by an average reduction in error of approximately 6%). The Kalman smoother data are not provided in the default, near realtime service from CLS, rather they are only available with postprocessing by CLS at an additional cost. There are two points to be made about this. First, the smoothing algorithm is a standard approach that can be implemented rapidly, with computing requirements no greater than the Kalman filter. It could be applied in near realtime. Second, a near realtime Kalman smoother would result in the best available location estimates changing as new data became available. This incremental improvement, due to information gain propagating backwards in time, would reduce as locations become less recent. This should be of little consequence to most wildlife users who typically do not use their data in near realtime, and users who do require near realtime data may see greater benefit in more accurate locations even if they are subject to change in retrospect.
Our statespace model produced location estimates with a median accuracy comparable to or greater than CLS’ Kalman smoother locations, regardless of input Argos data type. This implies that users can obtain similar or better accuracy than CLS’ Kalman Smoother locations by applying the statespace model to their LeastSquares or Kalman filter data. Therefore the method we describe is a viable alternative to the CLS’ feebased reprocessing service. The Laplace approximation approach employed in Template Model Builder models states (velocity and location) as unknown random effects, providing a most likely estimate of the current state from the posterior of it’s location given all available data, both forward and backward in time. This is precisely what a Kalman smoother does. That our model can improve on the CLS Kalman smoother’s location estimates may imply that uncertainty is somehow not wellpropagated from the raw Doppler shift data available to CLS through to the location estimates available to users. If this is indeed the case, it is unclear why this is so. The issue may be due to necessary tradeoffs between accuracy and precision versus providing a near realtime location service for a multitude of moving platforms, of which wildlife are a small component.
Spatiotemporal resolution and spatial accuracy
It is important to note that when comparing GPS locations with those from models fitted to Argosmeasured locations, accuracy is interlinked with the temporal resolution (sampling rate) of Argos relative to GPS locations. As GPS resolution is typically greater than Argos, comparisons to determine spatial accuracy of estimated locations are confounded by this difference. No model fit to Argosmeasured locations alone can resolve all the nuances of a movement path that are present in higher resolution GPS data. This discrepancy will be reflected in measures of spatial accuracy, unless GPS data are suitably subsampled or interpolated.
We interpolated GPS locations to the times of the Argosmeasured locations to which the statespace model was fitted. Our reasoning was that interpolation of the generally higher resolution GPS data should be less corrupted by spatial error than a similar interpolation of the lower resolution and irregularly occurring modelestimated locations. Subsampling GPS locations by matching them with the temporally closest modelestimated location, commonly used elsewhere [12, 30, 31], resulted in lower RMSD or greater (apparent) accuracy than comparison with the linearly interpolated GPS locations. These lower RMSD values, however, were based on fewer (n <10) temporally matched pairs of modelestimated and GPS locations for some species/individuals (Additional file 1: Fig S5); using a 20min window. Although sample sizes could be increased by choosing a wider time window, the potential for biased comparisons would increase differently across species due to their different spatiotemporal scales of movement.
Fits to the three Argos location types from the same individuals showed that movement pathways can be predicted with increasing spatial resolution, i.e., resolve greater spatial detail despite the same prediction time interval (2 h), and precision as the number of Argosmeasured locations increased (transition from LeastSquares to Kalman filter data) and as their uncertainty decreased (transition from Kalman filter to Kalman smoother data). One of the main advantages of Argos’ Kalman filter over the older LeastSquares method is a gain in the number of location estimates, mostly by resolving locations from the single transmissions between tag and satellite that LeastSquares can not [11]. This increase in resolution and precision is casedependent, however, as species with lower overall proportions of class A and B locations do not gain as many new locations when transitioning from LeastSquares to Kalman filter data. This casedependency is likely tied to typical surface time intervals of diving species, and, for those species spending the majority of time in air, on the magnitude of their travel rates.
Caveat
Fitting a statespace model to animal location data is not a panacea. Many ecological analyses of animal tracking data consider remotely sensed or other environmental data at spatial resolutions (2  10 km; e.g., [32]) approaching the statespace model accuracy limits found here. This highlights the need for researchers to consider the appropriate resolution of their environmental data given their specific questions and the limitations of their location estimates. While spatiotemporal mismatches between location estimates and environmental data can sometimes be dealt with by specifying coarser or finer prediction time intervals, such an approach has implications both for spatial and temporal autocorrelation affecting inference from subsequent analyses and for uncertainty in the location estimates themselves. Researchers should consider carrying location uncertainty estimates provided by statespace models through to subsequent ecological analyses. For example, by repeatedly sampling from the location uncertainty, conducting the analysis, and pooling results (sensu [33]). This can be done either completely through the whole analysis or partially via subsequent sensitivity analysis. Failure to examine or directly account for potential influences of spatiotemporal autocorrelation and estimation uncertainty of locations in subsequent analyses risks biased inferences.
Conclusions
The statespace model developed and validated here can be used to obtain qualitycontrolled animal locations from Argos LeastSquares or Kalman filter data in near realtime, with median accuracy comparable to or marginally better than CLS’ reprocessed Kalman smoother data. Our model also accounts for apparent northsouth bias in Kalman filter and Kalman smootherderived error ellipses.
The model’s near realtime capability provides the best estimates of location, given the available data, that can be continually updated as new data arrive via the Argos system. This rapid, continual quality control of animal tracking data is necessary as near realtime monitoring and forecasting of ocean states increasingly incorporates oceanographic data from animalborne sensors, and as the need for dynamic ocean management grows in our increasingly exploited and rapidly changing oceans.
Although the model was developed for fast, automated quality control processes, its simplicity and ease of use also make it suitable for manual use by researchers wishing to conduct quality control of historical or otherwise less immediate Argos data.
Abbreviations
 CLS:

Collecte Localisation Satellites
 EPSG:

European Petroleum Survey Group
 GPS:

Global Positioning System
 LS:

LeastSquares
 KF:

Kalman filter
 KS:

Kalman smoother
 IMOS ATF:

Integrated Marine Observing System Animal Tracking Facility
 IOOS ATN:

Integrated Ocean Observing System Animal Telemetry Network
 RMSD:

Root Mean Squared Deviation
 WGS84:

World Geodetic System 1984
References
Jonsen I, Flemming J, Myers R. Robust state–space modeling of animal movement data. Ecology. 2005; 86(11):2874–80.
Johnson DS, London JM, Lea M, Durban JW. Continuoustime correlated random walk model for animal telemetry data. Ecology. 2008; 89(5):1208–15.
Patterson TA, Thomas L, Wilcox C, Ovaskainen O, Matthiopoulos J. Statespace models of individual animalmovement. Trends Ecol Evol. 2008; 23:87–94.
Albertsen CM, Whoriskey K, Yurkowski D, Nielsen A, Mills Flemming J. Fast fitting of nonGaussian statespace models to animal movement data via Template Model Builder. Ecology. 2015; 96:2598–604.
AugerMéthé M, Albertsen CM, Jonsen ID, Derocher AE, Lidgard DC, Studholme KR, et al.Spatiotemporal modelling of marine movement data using Template Model Builder (TMB). Mar Ecol Prog Ser. 2017; 565:237–49.
McClintock BT, King R, Thomas L, Matthiopoulos J, McConnel BJ, Morales JM. A general discrete–time modeling framework for animal movement using multistate random walks. Ecol Monogr. 2012; 82:335–349.
McClintock BT, Johnson DS, Hooten MB, Ver Hoof JM, Morales JM. When to be discrete: the importance of time formulation in understanding animal movement. Mov Ecol. 2014; 2:21.
Service Argos. Argos User’s Manual. CLS. 2016. http://www.argossystem.org/manual. Accessed 07 July 2020.
Vincent C, McConnell BJ, Fedak MA, Ridoux V. Assessment of ARGOS location accuracy from satellite tags deployed on captive grey seals. Mar Mamm Sci. 2002; 18:301–22.
Lowther AD, Lydersen C, Fedak MA, Lovell P, Kovacs KM. The ArgosCLS Kalman filter: error structures and statespace modelling relative to Fastloc GPS data. PLOS One. 2015; 10:4.
Lopez R, Malardé J, Royer F, Gaspar P. Improving Argos Doppler location using multiplemodel Kalman filtering. IEEE Trans Geosci Remote Sens. 2014; 52:4744–55.
Lopez R, Malardé J, Danès P, Gaspar P. Improving Argos Doppler location using multiplemodel smoothing. Anim Biotelemetry. 2015; 3:32.
Rauch HE, Tung F, Striebel CT. Maximum likelihood estimates of linear dynamic systems. AIAA J. 1965; 3:1445–50.
McClintock BT, London JM, Cameron MF, Boveng PL. Modelling animal movement using the Argos satellite telemetry location error ellipse. Methods Ecol Evol. 2015; 6:266–77.
Maxwell SM, Hazen EL, Lewison RL, Dunn DC, Bailey H, Bograd SJ, et al.Dynamic ocean management: Defining and conceptualizing realtime management of the ocean. Mar Policy. 2015; 58:42–50.
Hazen EL, Scales KL, Maxwell SM, Briscoe DK, Welch H, Bograd SJ, et al.A dynamic ocean management tool to reduce bycatch and support sustainable fisheries. Sci Adv. 2018; 4:eaar3001.
Pirotta V, Grech A, Jonsen ID, Laurance WF, Harcourt RG. Consequences of global shipping traffic for marine giants. Front Ecol Environ. 2019; 17:39–47.
Dunn DC, Maxwell SM, Boustany AM, Halpin PN. Dynamic ocean management increases the efficiency and efficacy of fisheries management. Proc Natl Acad Sci. 2016; 113:668–73.
Treasure AM, Roquet F, Ansorge IJ, Bester MN, Boehme L, Bornemann H, et al.Marine mammals exploring the oceans pole to pole: A review of the MEOP consortium. Oceanography. 2017; 30(2):132–8.
Harcourt R, Sequeira AMM, Zhang X, Roquet F, Komatsu K, Heupel M, et al.AnimalBorne Telemetry: An Integral Component of the Ocean Observing Toolkit. Front Mar Sci. 2019; 6:326. https://www.frontiersin.org/article/10.3389/fmars.2019.00326. Accessed 07 July 2020.
Freitas C, Lydersen C, Fedak MA, Kovacs KM. A simple new algorithm to filter marine mammal Argos location. Mar Mamm Sci. 2008; 24:315–25.
Patterson TA, McConnell BJ, Fedak MA, Bravington MV, Hindell MA. Using GPS data to evaluate the accuracy of statespace methods for correction of Argos satellite telemetry error. Ecology. 2010; 91:273.
Kristensen K, Nielsen A, Berg CW, Skaug H, Bell BM. TMB: Automatic Differentiation and Laplace Approximation. J Stat Softw. 2016; 70:1–21.
Jonsen I, McMahon CR, Patterson TA, AugerMéthé M, Harcourt R, Hindell MA, et al.Movement responses to environment: fast inference of variation among southern elephant seals with a mixed effects model. Ecology. 2019; 100:e02566.
Jonsen I, Patterson TA. foieGras: Fit continuoustime statespace and latent variable models for filtering Argos satellite (and other) telemetry data and estimating movement behaviour. 2019. R package version 0.4.0. Available from: https://cran.rproject.org/package=foieGras. Accessed 07 July 2020.
Bryant E. 2D location accuracy statistics for Fastloc Ⓡ cores running firmware versions 2.2 & 2.3. Redmond: Wildtrack Telemetry Systems Ltd.; 2007. Technical Report TR01. http://www.wildtracker.com/results_files/Technical%20Report%20TR01.pdf.
Silva MA, Jonsen I, Russell DJF, Prieto R, Thompson D, Baumgartner MF. Assessing performance of Bayesian statespace models fit to Argos satellite telemetry locations processed with Kalman filtering. PLOS One. 2014; 9:e92277.
Tornqvist L, Vartia P, Vartia YO. How should relative changes be measured?. Am Stat. 1985; 39:43–6.
Boyd JD, Brightsmith DJ. Error properties of Argos satellite telemetry locations using least squares and Kalman filtering. PLOS One. 2013; 8:e63051.
Costa DP, Robinson PW, Arnould JPY, Harrison AL, Simmons SE, Hassrick JL, et al.Accuracy of ARGOS locations of pinnipeds atsea estimated using Fastloc GPS. PLOS One. 2010; 5:e8677.
Hoenner X, Whiting SD, Hindell MA, McMahon CR. Enhancing the use of Argos satellite data for home range and long distance migration studies of marine animals. PLOS One. 2012; 7:e40713.
Hindell MA, Reisinger RR, RopertCoudert Y, et al.Tracking of marine predators to protect Southern Ocean ecosystems. Nature. 2020; 580:87–92.
McClintock BT. Incorporating telemetry error into hidden Markov models of animal movement using multiple imputation. J Agric Biol Environ Stat. 2017; 22:249–69.
Acknowledgments
We thank Michael Weise and Bill Woodward for motivating the validation study, Holly Lourie and Sophie Baudel for assistance with CLS reprocessing and for providing background information, Melinda Holland and Kenady Wilson for facilitating data access, and Jason Hartog and Karen Evans for valuable comments on an early version. CG thanks Baptiste Picard and many others who helped collect data. WJG and SCV thank Greg & Lisa Morgan for field assistance.
Funding
IDJ was supported by Macquarie University’s coFunded Fellowship Program and by external partners: Office of Naval Research grant N000141812405; the Integrated Marine Observing System  Animal Tracking Facility; the Ocean Tracking Network; Taronga Conservation Society; Birds Canada; and Innovasea/Vemco. TAP was supported by CSIRO Oceans & Atmosphere internal research funding scheme. SSK was supported by a National Science Foundation Office of Polar Projects research grant.
California sea lion, Cape fur seal, and leopard seal data collection were funded by the National Oceanographic Partnership Program, the Office of Naval Research, the Gordon and Betty Moore Foundation, the David and Lucille Packard Foundation, and the Sloan Foundation, and the California Sea Grant Program.
Hawksbill turtle data collection was funded by the Australian Government under the Caring for Country Initiative, the Anindilyakwa Land Council, the Northern Territory Government, Charles Darwin University, and the ANZ Trustees Foundation – Holsworth Wildlife Research Endowment.
Leatherback turtle data collection was funded by Defra Darwin (17005), the Peninsula Institute for Marine Renewable Energy, and Harvest Natural Resources and Vaalco Energy Inc.
Southern elephant seal data collection was funded by the Institut Polaire Français Paul Emile Victor (IPEV programs 109, H.Weimerskirch and 1201, C.Gilbert) for logistical and financial support, CNESTOSCA program “Eléphants de Mer Océanographe” and SNOMEMO for financial support.
Northern gannet data collection was funded by NERC New Investigators Grant (NE/G001014/1), the Peninsula Research Institute for Marine Renewable Energy and EU INTERREG Project CHARM III.
Author information
Authors and Affiliations
Contributions
Conceived and designed the study: IDJ, CRM, TAP. Developed methodology: IDJ, TAP. Performed the analyses: IDJ. Contributed data: DPC, PDD, WJG, BJG, CG, XH, SK, PWR, SCV, SW, MJW, MAH, RGH, CRM. Wrote the paper: IDJ. Edited the paper: All. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Additional file 1
Supplementary figures.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Jonsen, I.D., Patterson, T.A., Costa, D.P. et al. A continuoustime statespace model for rapid quality control of argos locations from animalborne tags. Mov Ecol 8, 31 (2020). https://doi.org/10.1186/s40462020002177
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s40462020002177
Keywords
 Animalborne sensors
 Biotelemetry
 foieGras R package
 Global Positioning System
 Seabird
 Pinniped
 Sea turtle
 Template Model Builder