The main goal of the current work was to systematically review the scientific knowledge contained in peer-reviewed research articles that proposed threshold(s) used in establishing soccer players’ movement intensity. A total of thirty-nine published papers addressing velocity and/or acceleration/deceleration bands respectively using tracking systems and inertial measurement were considered here. Based on these, our main collated findings were: (1) for either, velocity zones or acceleration demands, the preferred method to define intensity among studies was based on outcomes from 40 m sprint test, which was used in more than one-third of all literature covered in the searches; (2) the most frequent data collection systems employed to obtain external load measures were GPSs adjusted at a sampling frequency of 10 Hz (~72%); these were also often used in creating the thresholds (~41%); (3) nearly half of evidence is derived from youth male samples and during competitive matches; (4) there was a predominant choice toward depicting movements solely in the meter unit (~60%) and it is evident that the specific type of displacement recorded is unspecified in all excepting one work. Finally and of most importance to the current aim (5) it was not possible to identify a standardization in speed categories linked with distinct levels of movement given the wide discrepancies found across literature formulating individualized thresholds.
An important finding of this review study was that 40 m sprint test seemingly the most frequent procedure in establishing individualized speed thresholds in soccer. In fact, recommendations were formulated indicating that a 40 m path may be sufficient for players reaching their peak speed, being faster than in competitive matches and thereby possibly represent an adequate method of depicting players' external load [54]. Nevertheless, in none of the studies considering the 40 m sprint test either when evaluating players velocity [17–27] or accelerations/decelerations [11, 12, 52], there was a mention regarding its measurement properties (e.g. validity and reliability) for the specific population assessed while only three [11, 12, 23] provided references which commented or directly determined a given of these aspects (r = 0.95–0.97; ICC = 0.94–0.99; TEE = 1.67–1.95%) [37, 55]. The transference of a locomotor testing outcome to match-play running performance is also critical when selecting appropriate testing tools. The so-called construct–or ecological–validity of the 40 m sprint test lacks consensus [see for a review: [56]] as reports are confirming its associations with match running performance [57] whilst no meaningful [37] or only position-dependent results were elsewhere observed [58]. One existing potential solution is the adoption of the maximal sprinting speed (MSS) [59] or a cluster technique using players’ velocity samples [60] both obtained in the own matches, as input parameters to obtain thresholds. Yet only a few studies included here considered in-game MSS [47–49], and the clustering method was challenged [61]. Thus, despite gaining popularity to help individualize soccer demands, doubts may persist on the practical value of 40 m on-field sprinting test.
It is important to note that the individualization of thresholds may arguably benefit soccer practitioners. Examples include an a priori more accurate representation of player’s demands experience in practice or match-play when using individualized thresholds. Enhanced ability in the management of individuals' workload will theoretically allow for the design of more effective recovery schedules and periodization training [19, 30, 31, 59]. Also, the use of customized thresholds helps reduce high-speed running variability in either, within and between matches as well as from an individual or position-specific point of view [62]. On the other hand, some studies provided evidence that it can represent no additional value to the understanding of soccer external loads. The clearest example falls in the case of determining dose-response to daily training routines. MSS in 40 m test routine showed impaired correlations with heart rate, ratings of perceived exertion [26] and wellness [27]. Furthermore, MSS is not necessarily higher in 40 m testing as compared to match-play outputs [63]. Most important, training-induced adaptations in running performance encountered during actual match-play are not always matched with those changes verified in 40 m sprint performance [64] whilst the frequency to which fitness components needs re-assessment, aiming at adjust thresholds accounting for those time-related changes, seemingly also unknown [65]. Collectively, such results reinforce the lack of full confidence and consensus in applying a 40 m linear sprint test as a way to obtain “anchors” of speed/acceleration thresholds. Soccer demands generally involve also energetic cost in changing direction, unorthodox displacements and physical impacts [27] which might be difficult to capture in common outcome metrics derived from traditional linear sprint tests.
In an attempt to overcome possible limitations of a single bout maximal linear sprint as mentioned above, also considering the lowest weight it may have to a dataset of soccer external load measures collected in official matches [66] likely given the one-off nature of MSS in soccer [67], some authors employed test protocols more prolonged in nature. These included Yo-Yo Intermittent Recovery Test level 1, Vam-Eval maximal incremental running test, 30:15 Intermittent Fitness test (30–15IFT) and Conconi test performed on a treadmill. Despite having large-to-very large associations with match-play running performance either relating to the total distance covered or high-intensity running [56], graded exhaustive treadmill tests represent serious limitations to most clubs given time requirements, costs and players motivation implying a need to consider other solutions with more prominent practical value such as field-based assessments [68]. Instead, a trend of a recent increase in the use of anaerobic speed reserve (ASR) as a threshold was noted here for approximately one-fourth of all studies included of which most were published over the last three years [20, 27, 35, 36, 38, 39, 42]. The ASR is a compound of two markers, i.e. computed as the difference between player MSS and maximal aerobic speed thus combining in a single index the individual’s fitness characteristics observed on a separate all-out sprint effort and those from vV̇O2max. Such metric seemingly of benefit in creating thresholds since players showing similar vV̇O2max (not uncommon across outfield playing positions; [69]) may not have a matched MSS performance [57,70]. In this conditions, ability to cope with a given load, in particular at high-intensity domain, would depends on the proportion of ASR reached [71] rather than looking solely for a percentage of the former fitness indicator. Again, one of the issues which arguably preclude unrestricted recommendation of ASR to date is none empirical evidence supporting its construct validity (e.g. 30–15IFT performance versus match-play running outputs; see also [72]).
Regardless of whether there is currently an unsolved debate, since studies recommend using [54, 62, 73], maybe [19, 65, 74] or others suggest avoid [26, 56, 59] fitness testing when defining speed thresholds, a lack of standardization was observed here in both, determinations of individualized speed/acceleration categories representing distinct workload demands and the parameters used to extract a given anchor. To be explicit and using the 40 m sprint test as an example, timing gates at the start, 10 and 40 m [22]; 30 and 40 m [21]; 10-m intervals [17–20, 26, 27]; or MSS attained independent of location [23, 24, 52] were among methods used. Furthermore, following these procedures, the levels for “higher” intensity recorded were defined considering 50–60% [52], 80–100% [19, 20], >61% [17, 18, 21], >65% [26], >75% [22], >80% [23, 24, 27] or >90% [52] of MSS. It makes it difficult to directly compare results across literature and provide systematic concluding remarks on the most appropriate one. To assist move beyond on this question, intervention designs assessing the practical effect of individualized thresholds in various aspects (e.g. fitness, injury and match performance) are recommended as opposed to common application/comparison of methods [73].
Finally, particular attention should be also paid to the technology employed in obtaining performance indices often used in originating the movement intensity thresholds. Ten-hertz GPS were identified as the most common devices used in both, determination of velocity/acceleration thresholds during testing routines and collection of task external loads. It is recognized that these generally provide valid measures to assess distance and velocity in linear movements and during simulations of running characteristics pertaining to team sports while no additional benefits of a nearby higher acquisition frequency can exist [6, 75]. However, during acceleration occurrences above 4 m/s² limits, accuracy using 10 Hz GPS is not always ensured [76]. One can argue that there is not a true ‘gold standard’ available in computing external load such as running performance [17, 77, 78] while others recognize high-speed three-dimensional motion capture systems [6]. Context, logistics and the need for a qualified team that has the how-to for data treatment of image sequences are among potential constraints on the use of the latter. Examples include respectively the costs involved, set-up configuration and time-consuming nature which collectively make difficult application of video-based tracking systems in practice. This is also observed in the present analysis owing that only one study using the latter method was found (see Table 2). Regardless of which EPTS or IMU are used, one further point requiring more caution is the fact that measurement error should be ideally evaluated considering the specific location they were collected [see for a review: [59]], and only 2 studies included in our analysis did it such way [47, 48]; most cited data from previous investigations or reported just the horizontal dilution of precision calculated by the proprietary software. In sum, interpreting current evidence on speed but not acceleration thresholds using 10 Hz GPS may be reliable, and quality-control experiments are still needed within original investigations.