Skip to main content

An Open Access Journal

A Data Envelopment Analysis approach for accessibility measures: Simulating operational enhancement scenarios for railway across Europe

Abstract

Introduction

As well known, infrastructure endowment influences competitiveness of a region since the characteristics of a transport system in terms of capacity, connectivity, speeds, etc. determine the advantages/disadvantages of an area compared to other locations. This article attempts to investigate the potential impacts on rail accessibility across Europe when different possible operational enhancement scenarios are simulated.

Methods

The simulations are carried out by means of a combination of the TRANSTOOLS rail network and Traffic Analyst, the post-processing analyses are implemented in Matlab and the results for each zone (at NUTS3 level) are reported both in tabular form and in easy-to-read ArcGIS maps. Several accessibility measures are evaluated including two Data Envelopment Analysis (DEA) approaches aiming to construct a composite index for embracing all the complementary information provided by ‘partial’ accessibility sub-indicators; to better evaluate and understand the results either sensitivity and robustness analyses are performed for both the aggregate indicators.

Results

The outcomes provide insight into where major benefits in terms of accessibility can be expected; in particular the current infrastructure endowment already benefits many regions but improvements in speed could still increase significantly rail accessibility across Europe (mainly outside the core area as in Poland, Bulgaria, Romania, Slovakia, etc.). Furthermore both the proposed global indexes, although associating different ‘endogenous’ weights to the various sub-indicators, appear to be worthy and robust against uniform random noise.

Conclusions

Ultimately the results provide information useful for the prioritization of investment needs; moreover even if the interpretation of the partial accessibility indicators is clear and useful for policy-makers, the evaluation of a composite measure could allow planners not only to compare or fully rank the level of accessibility for different regions but even to control for eventual confusing and/or incomplete results that may appear when using only a partial approach.

1 Introduction

As stated in the White Paper on Transport [1]: “Infrastructure shapes mobility. No major change in transport will be possible without the support of an adequate network and more intelligence in using it. Overall, transport infrastructure investments have a positive impact on economic growth, create wealth and jobs, and enhance trade, geographical accessibility and the mobility of people. It has to be planned in a way that maximizes positive impact on economic growth and minimizes negative impact on the environment”. In this context, ascertained the strong relationship between accessibility and socio-economic regional development, in the last decades the topic of impacts on accessibility of transport policies has been widely treated in the scientific literature [217] and also in several European projects [1821]. Beyond general assessment papers [27], several authors focused their attention either on particular corridors/areas [811] or on a wider scale [1217].

This article simulates various European-wide scenarios assuming different rail operational enhancements (such as improving speeds) and it attempts to assess qualitatively the potential impacts on railway accessibility across Europe.

Accessibility is a complex concept with various facets; inter alia it could be defined as ‘the amount of effort for a person to reach a destination’ or ‘the number of activities which can be reached from a certain location’ [3]. Indicators of accessibility measure the benefits households and firms in an area enjoy from the existence and use of the transport infrastructure relevant for their area [13]. As highlighted by Wegener et al. [13], accessibility could be calculated in function of origin, destination, spatial impedance, type and mode of transport. Our analysis focuses on European regions (origins and destinations) and rail passenger services; the spatial impedance between two regions is assumed equal to the travel time along the minimum path between the zones over the rail network.

The study analyses several accessibility indexes offering complementary information and mainly based on two concepts: travel resistance (cost, time, etc.) and attractiveness of urban agglomerations (depending on variables such as population, employment or gross domestic product). In particular four different indicators (emphasizing different cost or attraction attributes) are explored: index of location, relative efficiency of the network, potential and daily accessibility.

Since the location of each zone could influence the measures of the mentioned sub-indicators (core-periphery patterns), the article also explores two different approaches trying to construct an accessibility composite indicator, thought as a synthetic parameter embracing all the complementary information delivered by the other four ‘partial’ accessibility measures. In particular the research examines:

  • a Data Envelopment Analysis (DEA) approach as already proposed among others by Martin, Gutiérrez, and Roman [2225],

  • a Benefit of Doubts (BOD) approach widely threated in the scientific literature for creating composite indicators (e.g. [2637]), but according to the authors’ knowledge, never used for synthetic accessibility index.

The Data Envelopment Analysis is a non-parametric method for evaluating the relative efficiency of Decision Making Units; in the last years DEA has been extensively applied in several sectors, and also in transportation [3844]. The methodology consists in obtaining from the available data an approximation of the “best-practice” frontier by means of the linear programming; efficiency measures are then investigated according to this surface [4548].

Both the proposed approaches are based on the use of the DEA to create a composite indicator, but the main difference resides on the adopted model: in the first case the sub-indicators are either considered as inputs or outputs according to their characteristics and an output-oriented DEA with variable return of scale (VRS) is adopted to take in account the great heterogeneity among of the various European regions. The BOD approach, instead, as pointed out by Despotis [34], is formally equivalent to the original input oriented and constant-returns-to-scale DEA model presented by Charnes et al. [33], with the sub-indicators representing the different outputs and allocating a single ‘dummy input’ with value unity to each country.

To test the proposed methodologies and also to better understand the results, sensitivity and robustness analyses are performed on both the aggregate indicators; although associating different weights to the various sub-indicators, both the models appear to be robust against uniform random noise. Furthermore the outcomes of the study show how the current European railway endowment already benefits many regions (mainly in Italy, Spain, Germany, Netherlands, UK, Austria, France, Belgium, etc.) but improvements in speed could still increase the accessibility of various areas.

Regarding the structure of this paper, after this introduction and brief review of the technical literature, the next paragraph describes in detail the proposed methodology and the utilized data while paragraph 3 illustrates the results of the analysis and finally paragraph 4 sets out our conclusions.

2 Data & methodology

This article summarizes the results of the model simulations carried out in order to estimate the potential impacts on rail accessibility of infrastructure enhancements across Europe (see also [17]). The model simulations have been performed by a combination of the TRANS-TOOLS rail network and Traffic Analyst: the TRANS-TOOLS (“Tools for transport forecasting and scenario testing”) is a European transport network model that has been developed in collaborative projects funded by the European Commission’s Joint Research Centre and DG TREN (for more information see [49]) and it provides a quite detailed European rail network for 2005; its assignment module (Traffic Analyst by Rapidis, see [50]) allows the model to capture changes in route choice as a result of hypothesized changes in speed and frequency.

The data for origins and destinations at NUTS 3 level (the Nomenclature of Units for Territorial Statistics is a geocode standard developed and regulated by the European Union) and for the baseline (2005) have been assumed according to the ETIS Plus figures (downloadable at [51]): ETIS Plus is a FP7 project on data collection for transport at European level aiming at providing a bridge between official statistics and applications within the transport policy theme; in practice it consists of an European Transport Policy Information System, combining data, analytical modelling with maps (GIS), and a single online interface for accessing the data.

The post-processing analyses of the results have been carried out with utilities developed in Matlab, while the outcomes for each zone have been also reported in easy-to-read ArcGIS maps (Fig. 1).

Fig. 1
figure 1

Flowchart of methodological steps

Beyond the baseline (2005), three different scenarios have been implemented by changing speeds as follows:

  • Scenario 200 km/h: speed increased up to 200 km/h for all links that currently have a speed lower than 200 km/h. For links with current speed higher than 200 km/h (high speed trains), no changes were introduced.

  • Scenario 90 km/h: speed increased up to 90 km/h for all links that currently have a speed lower than 90 km/h. For links with current speed higher than 90 km/h, no changes were introduced.

  • Scenario 45 km/h: speed decreased to 45 km/h for all links that currently have a speed higher than 45 km/h. For links with current speed lower than 45 km/h, no changes were introduced.

In practice the article assumes the best and worst (hypothetic) network settings by simulating respectively the Scenario 200 km/h and Scenario 45 km/h; this last one hypothesizes a degradation of the current network (or better an imaginary configuration previous to the baseline), to evaluate the benefits of the current infrastructure endowment compared to this lower bound. Subsequently the analysis estimates the effects of a more feasible and realistic interventions such as increases of speed of some links (with current speed lower than 90 km/h).

As also proposed in other studies [18] our analysis considers the centroids of NUTS3 regions as origins and destinations. The all-or-nothing assignment module calculates the minimum paths through the networks, i.e. the path with minimum travel times between the centroids of the regions. According to the new travel times between each couple OD, for each scenario various accessibility indexes have been evaluated.

In the last decades, in fact, accessibility has been measured by several types of indicators often based on two different concepts: travel resistance (cost, time, etc.) and attractiveness of urban agglomerations (depending on variables such as population, employment or gross domestic product). In this analysis we have considered four different accessibility measures offering complementary information and highlighting different cost or attraction attributes.

The location index represents the average travel time between each couple OD weighted on the mass, measured in this analysis by population of the destination regions:

$$ Li=\frac{{\displaystyle {\sum}_j{t}_{ij}}\ast {W}_j}{{\displaystyle {\sum}_j{W}_j}} $$
(1)

where:

  • L i represents the location index of origin i;

  • t ij represents the travel time between i and j;

  • W j represents the population of destination j (activities to be reached at j).

Since no distance decay function (and so no discrimination between neighbor or far locations) is considered, the accessibility for each zone depends on the geographical position; remote locations present low accessibility values and even a good transport infrastructure endowment could be not enough to overcome the negative effects of a large geographical distance to the main activity areas [11]. Figure 2 reports the location index for each NUT3 zone in Europe and for each simulated scenario showing clearly the above described core-periphery patterns; as expected the scenarios with maximum speed of 45 km/h or minimum speed of 200 km/h on the whole railway network present respectively the highest and lowest values of the location index.

Fig. 2
figure 2

Location index for each NUTS3 zone and each scenario

The network efficiency indicator “represents the distance between the real accessibility against the best accessibility that can be obtained if the zone is connected with all the other regions by the best possible infrastructure” [22], in our case a network with speed on each link of at least 200 km/h.

It offers a measure in terms of the relative ease of access according to the network efficiency; the relative ease of access is represented by the ratio of the travel time between 2 zones to the ideal travel time between the same zones assuming the best possible infrastructure, i.e. for us scenario 200 km/h):

$$ {N}_i=\frac{{\displaystyle {\sum}_j\frac{t_{ij}}{{\underset{\bar{\mkern6mu}}{t}}_{ij}}*{W}_j}}{{\displaystyle {\sum}_j{W}_j}} $$
(2)

where:

  • N i represents the network efficiency indicator of origin i;

  • t ij represents the travel time between i and j;

  • t ij represents the travel time between i and j assuming the best possible infrastructure (i.e. for us scenario 200 km/h);

  • W j represents the population of destination j.

This indicator provides an idea on how efficient are the connections from a given zone, regardless of its geographic location: it could occur that a region which is peripheral according to the location index is highly accessible in terms of network efficiency [11]. The following figure shows the values of this indicator for each zone and each scenario; of course the scenario with speed of at least 200 km/h on the whole network represents the best possible (ideal) setting (A = 1 for each region) (Fig. 3).

Fig. 3
figure 3

Efficiency Network Index for each NUTS3 zone and for each scenario

Finally regarding the potential and the daily accessibility, it is possible to express them as a construct of two functions: the activities function (representing the activities or opportunities to be reached) and the impedance function (representing the effort, time, distance or cost needed to reach them):

$$ {A_i}_m={\displaystyle {\sum}_j{W}_j}\ast F\left({c}_{ij}\right) $$
(3)

where:

  • A im represents the accessibility of origin i by mode m (i.e. rail in our analysis);

  • W j represents the population of destination j (activities to be reached at j).

  • F(c ij ) represents the impedance function depending on the generalized cost to reach destination j from origin i;

In practice (3) calculates the total of activities reachable in j weighted by the ease of getting from i to j. As described by the impedance function, the interaction between locations declines with the increasing disutility (distance, time, and/or costs) between them. In general, the perception and valuation of the distance between an origin and a destination differ according to transport modes, purpose of trips, characteristics of the household and of the destination [3]; in the present paper we focused on rail mode and on the population of destination.

Several forms of distance decay function have been already used and described in past accessibility studies; this analysis considers two different shapes depending on travel time (Fig. 4):

  • a negative exponential function to represent the potential accessibility:

    $$ F\left({t}_{ij}\right)={e}^{-\beta t}; $$
    (4)

    as proposed also in [18] in our analysis “the parameter β has been set to 0.005. That means that assuming a travel time between two regions of 0 min (which does not occur in reality), the population of the destination region would be included with its full value in the potential accessibility of the origin region, while for a travel time of little more than 2 h the weight is 0.5, and for a travel time of little more than 5 h the weight goes down to 0.2 only” (see Fig. 4).

  • an ‘ad hoc’ impedance function dropping linearly from 1 to zero with travel times between 1 and 4 h (see Fig. 4) to represent the daily accessibility; this indicator calculates the amount of population or economic activities that can be reached from each zone within a limited period of travel time (in our analysis 4 h), so that it is possible to go and return within the same day and carry out some activities at the destination. The proposed decay function assigns weight 1 to destinations with travel time less than or equal to 1 h and linearly decreasing weights to farther locations with travel time within 4 h. Since the calculations of accessibility have been implemented in Matlab with a post-processing application, it has not been difficult to reproduce the proposed shape.

Fig. 4
figure 4

Considered impedance functions

Of course different shapes of the impedance function could represent diverse aspects (and could provide different indications) of accessibility; the described exponential decay function (potential accessibility) allows to consider the population (activities) of all the reachable zones even if with a diverse weight depending on travel time, while the proposed ‘ad hoc’ decay function associates the accessibility measures only to short trips with travel time within 4 h (allowing ‘daily’ commuting) giving a different and more specific indication. The following figures report the potential and daily accessibility values for each NUT3 zone and each considered scenario.

To allow a first analysis of the results, Table 1 reports the percentage of variation (at country level) of all the considered accessibility indicators comparing each alternative scenario and the baseline 2005 (do-nothing configuration); it presents also the average speed on the network (for each country) weighted on the length of the links.

Table 1 Percentage of variation in accessibility (alternative scenarios vs baseline) and average speed on the network

As evident, all the proposed scenarios show a positive impact on the ‘partial’ accessibility measures for each country but the differences among of the indicators vary according to the specific area; also the baseline scenario (2005) presents a positive variation in accessibility compared to the hypothetic degraded configuration (maximum speed of 45 km/h on the whole network).

Analyzing the results of the previous table and the Figs. 2, 3, 5 and 6 it is not difficult to recognize the different approach of each indicator, such as for example the core-periphery and border patterns of the location index and of the potential accessibility.

Fig. 5
figure 5

Potential accessibility for each NUT3 zone and for each scenario

Fig. 6
figure 6

Potential daily accessibility for each NUTS3 zone and for each scenario

As pointed out in [24], in fact, “the interpretation of the partial accessibility indicators is clear and useful for policy-makers, but it does not provide a synthetic and global measure that allows planners to compare or fully rank the level of accessibility for different regions or cities within Europe. Besides this, it is necessary to control for some contradictory results that may confuse planners who only use a partial approach.”

Table 2 reports the Pearson and the Spearman correlation coefficients among of the considered accessibility indexes and for each scenario to assess how well their relationship can be described using respectively a linear or a monotonic function; results seem to justify the assumption that the different ‘partial’ indicators can be considered complementary more than substitutive. As expected the potential and the daily accessibility indexes present the strongest correlation; we have retained both the indicators in our successive analyses since, how already noticed, they allow us to measure accessibility from different perspectives (overall or ‘within-4-h/daily’ activities reachable from each zone).

Table 2 Pearson correlation and Spearman rank correlation (* for values with significance levels below 0.05)

To better analyze the accessibility impacts of the simulated operational enhancements and trying to summarize the complementary information provided by the four described indicators, the authors have attempted to explore the construction of a composite indicator (CI) by means of two different approaches:

  • Data Envelopment Analysis (DEA) [2225];

  • Benefit of Doubts (BOD) [2636].

The first methodology is based mainly on the estimation of a DEA-accessibility index as already proposed by Martin, Gutiérrez, Roman and also Reggiani [2225] while the second method recalls the wide scientific literature for creating composite indicators in analysis at large scale [2636].

A frequent issue concerning the development of a global index is related to the weights to assign to the partial indicators; in this context, the main appeal of DEA-based composite indicators (CIs) is that they “look for endogenous weights, which maximize the overall score for each decision-making unit given a set of other observations” [31].

Indeed the Data Envelopment Analysis is a non-parametric methodology for evaluating the relative efficiency of Decision Making Units; practically the proposed DEA approach suggests solving the following multiple objective problem of accessibility for both the scenarios with and without interventions:

$$ \begin{array}{l}{ \min}_j(LocationAccessibilityIndex)={ \min}_j\left({L}_j\right)\\ {}{ \min}_j(EfficiencyNetworkIndex)={ \min}_j\left({N}_j\right)\\ {}{ \max}_j(PotentialAccessibilityIndex)={ \max}_j\left(P{A}_j\right)\\ {}{ \max}_j(PotentialDailyAccessibilityIndex)={ \max}_j\left(PD{A}_j\right)\end{array} $$
(5)

where j represents the generic European NUT3 zone and L, N, PA and PDA represent respectively the location index, the network efficiency indicator, the potential and the daily accessibility.

In other words this “synthetic DEA-based indicator is calculated as the inverse of the maximum proportional accessibility outputs that can be obtained for the indicated accessibility inputs” [10].

The model determines for each scenario the most efficient zones (from an accessibility perspective) to individuate the frontier of the envelopment surface; the regions not lying on the frontier are inefficient and the measurement of the grade of inefficiency is represented by their distance from this ‘best-practice’ frontier.

When the data cannot be easily interpreted as inputs or outputs, a general rule suggests to consider the variables for which lower levels are better as inputs (in our case Location index and Network efficiency), and to treat as outputs those variables for which higher amounts are better (potential and daily accessibility in our analysis) [10].

We have assumed variable returns to scale (VRS) because of the great heterogeneity among of the various EU regions and an output orientation.

The Benefit of Doubts approach, instead, is rooted in the copious literature concerning CIs. Considering m sub-indexes and n regions the idea is to merge the sub-indicators’ values per region into a single number, e.g. their weighted average. In absence of reliable information about the weights, the proposed approach endogenously determines the weights maximizing the CI value for each region. In practice it comes to solving the following linear programming problem for each zone j:

$$ \begin{array}{l}C{I}_j={\displaystyle {\sum}_i{y}_{ij}}\ast {w}_{ij}\hfill \\ {}{\displaystyle {\sum}_i{y}_{ij}}\ast {w}_{ij}\le 1\left( bounding\kern0.2em constraint\right)\hfill \\ {}{w}_{ij}\ge 0\left(non- negativity\kern0.2em constraint\right)\hfill \end{array} $$
(6)

where:

  • CI j represents the value of the composite indicator for region j;

  • y ij represents the original value of sub-indicator i for region j;

  • w ij represents the weight of sub-indicator i for region j;

The benefit-of-the-doubt interpretation of the methodology resides in the fact that the method chooses those weights maximizing the resulting indicator: the highest relative weights are accorded to those sub-indicators for which the zone j achieves the best performance (in relative terms) when compared to the other regions in the sample.

This model is equivalent to the original input oriented, constant-returns-to-scale DEA model presented by Charnes et al. [33] (for more details see [29]), with the sub-indicators representing the different outputs and a single ‘dummy input’ with value unity to each country.

Although, as pointed out in [30], a BOD-based indicator meets the property of units invariance, which makes the normalization stage redundant, the selected four sub-indicators have not the same direction, meaning that for some of them (location index and network efficiency) higher values represent worse performances. To overcome this issue the “min-max” normalization method has been applied to the results of the partial indexes according to the original direction of the variables:

$$ {I}_{ij}=\frac{y_{ij}-{ \min}_j\left({y}_i\right)}{{ \max}_j\left({y}_{ij}\right)-{ \min}_j\left({y}_i\right)} $$
(7)
$$ {I}_{ij}=\frac{{ \max}_j\left({y}_i\right)-{y}_{ij}}{{ \max}_j\left({y}_{ij}\right)-{ \min}_j\left({y}_i\right)} $$
(8)

where:

  • i and j represent respectively the sub- indicator and the region

  • I ij represents the normalized value of sub-indicator i in region j;

  • y ij represents the original value of sub-indicator i in region j;

To better compare and also to test the two proposed approaches, sensitivity and robustness analyses are performed on both the aggregate indicators for the baseline 2005 (do-nothing scenario). We have carried out the Data Envelopment Analysis several times eliminating the sub-indicators one by one, to evaluate the weight, the importance associated to each dimension in both the procedures and also to estimate the changes in scores.

Tables 3, 4, 5 and 6 report the Spearman correlation coefficients across the sub-indicators (eliminating one of them per time) and across the scenarios respectively applying the DEA or the BOD approach. In particular:

  • Scenario 45 (or 45), Baseline 2005 (or 2005), Scenario 90 (or 90) and Scenario 200 (or 200) indicate the simulated scenarios;

  • -LA indicates the DEA or BOD approach carried out neglecting the location index;

  • -NA indicates the DEA or BOD approach carried out neglecting the network efficiency;

  • -PA indicates the DEA or BOD approach carried out neglecting the potential accessibility;

  • -PDA indicates the DEA or BOD approach carried out neglecting the potential daily accessibility;

  • All indicates the DEA or BOD approach carried out considering all the sub-indicators

Table 3 Spearman rank correlation across indicators applying the DEA approach
Table 4 Spearman rank correlation across scenarios applying the DEA approach
Table 7 Mean and standard deviation of the variation coefficient
Table 5 Spearman rank correlation across indicators applying the BoD approach

The tables point out the different set of “endogenous” weights (and the consequent relevance) associated to the sub-indicators applying the two methods and also how they vary across the scenarios; while for the DEA the most relevant indicator comes to be the potential accessibility, for the BOD procedure the major changes in results are obtained eliminating the location index. This is due to the different approach of the methods; as noticed by Martin and Reggiani [24], the DEA results “can be interpreted as the distance that separates every region from the most accessible location that can be found taking into account all the observations”. In other words this “indicator measures how accessible a region is with respect to all the other regions included in the sample”. The BOD approach, instead, maximizes the CI value (see formula (6)) for each region and so it takes in account also the heterogeneity of the sub-indicators among of the regions: the highest relative weights are accorded to those dimensions for which the region j achieves the best performance (in relative terms) when compared to the other regions in the sample; in other words “each region has its own weights which are optimal and guarantee the best possible position for the associated region among all other regions in the sample. With any other weighting scheme, the relative position of that region would be worse.”[35].

The main disadvantage of this last method is that without additional setting constraints countries performing very well only in one indicator can be considered successful. This is why the scientific literature has focused on different ways to take in account experts’ (subjective) opinions or to set additional relative weight constrictions (e.g. pie share constraints). Anyway we have explored the original BOD approach and so no additional constraints are added except those already described in (6). The described effect is quite evident in Tables 5 and 6 where the Scenario 200 is less relevant than the others since the network efficiency index presents unity value for each region (reference “best-possible” infrastructure).

Finally a robustness analysis has been performed on both the synthetic indicators adding a uniform casual noise to the normalized value of each sub-indicator, according to the formula:

$$ {I}_{ij}^k={I}_{ij}+ Unif\left(-\alpha, \alpha \right)\ast \underset{\bar{\mkern6mu}}{I_i} $$
(9)

where

  • I ij k represents the normalized value of sub-indicator i in region j and for the casual extraction k;

  • I ij represents the normalized value of sub-indicator i in region j;

  • Unif(−α,α) represents a uniform casual distribution with values between −α and α. The value of α has been set to 0.05 (5 %) in our analysis.

  • I i represents the average of the normalized values of sub-indicator i across the regions;

In practice we have performed a Monte Carlo simulation by extracting 100 random values (within the range defined) of noise for each sub-indicator and assessing the related variation in outcomes. To evaluate the dispersion of the results, for each region we have calculated the variation coefficient, defined as the ratio of the standard deviation to the mean:

$$ {C}_{Vj}=\frac{\sigma_j}{\mu_j} $$
(10)

where

  • μ i is the mean of the 100 simulated scores (in terms of ranking) of the composite indicator for the region j;

  • σ j represents the standard deviation of the 100 simulated scores (in terms of ranking) of the composite indicator for the region j;

Table 7 reports the mean and the standard deviation across all regions of the variation coefficient C Vj . Both the approaches seem to be robust against the assumed uniform casual noise: by hypothesizing a value of α equal to 5 % the results show a variation coefficient of less than 2 % in both the cases.

Table 6 Spearman rank correlation across scenarios applying the BoD approach

3 Results

As reported below the outcomes of this study show how the current scenario already benefits many regions but improvements in speed could still increase significantly rail accessibility across Europe. For example the above reported Table 1 indicates that the current infrastructure endowment (baseline 2005 versus scenario 45) benefits in terms of accessibility mainly the countries in the European core (such as Austria, Germany, Belgium, Luxembourg, Netherlands, Italy, United Kingdom) and even the Iberian Peninsula and France.

The same outcomes are also evident in Figs. 7, 8, 9 and 10, reporting the rankings of accessibility (and the variation in it) across Europe and for all the scenarios. At this stage is worthy to notice that, as highlighted in the scientific literature, both the proposed composite indicators do not provide scores, but ranks; in other words, either DEA or BOD provides values but they are directly incomparable, and this explains why the results are reported in relative rankings. In particular Figs. 9 and 10 show the variation in ranking of accessibility for each scenario versus the baseline 2005; a negative value means that a region is better ranked when moving from the baseline towards the assumed scenario.

Fig. 7
figure 7

DEA-based ranking of accessibility across Europe and for each scenario

Fig. 8
figure 8

BOD-based ranking of accessibility across Europe and for each scenario

Fig. 9
figure 9

DEA-based variation in ranking of accessibility for each scenario vs the baseline 2005

Fig. 10
figure 10

BOD-based variation in ranking of accessibility for each scenario vs the baseline 2005

Table 1 and the next figures show also how slight improvements in speed (at least 90 km/h for all the links) could benefit mainly peripheral and border regions (in Bulgaria, Romania, Poland, Greece, Hungary, Slovakia, etc.) and how eastern regions would gain accessibility advantages by ideally improving the speed on the current railway network at least at 200 km/h; obviously, although technically not feasible (or hardly achievable), this hypothesis tries only to represent an extreme ‘optimum’ situation, an upper limit for the rail network (as well as the scenario 45 km/h represents a lower bound) with which comparing more plausible and less ambitious interventions.

Moreover, analyzing the DEA and BOD maps reported below, it seems quite reasonable that they embrace all the different information provided by the ‘partial’ accessibility measures, even if with a different set of endogenous weights, as already noticed in the previous paragraph.

Looking for example at the maps for the baseline (2005) in Figs. 7 and 8, it is not difficult to recognize the outcomes already observed in Tables 3, 4, 5 and 6: while the DEA index is mainly influenced by the potential accessibility, the BOD ranking takes more in account the location index and the network efficiency. This difference affects also Figs. 9 and 10: the BOD-based variation in ranking between the baseline and the degraded scenario 45 for the Iberian Peninsula, for example, is bigger than the corresponding variation applying the DEA approach.

Anyway in the authors’ point of view both the methods are worthy and as already mentioned above they only present a different endogenous weighting scheme: while in the DEA approach the regions with highest values are considered the most efficient ones in terms of accessibility since they present the shortest distances from the “best-practice” frontier, the BOD attempts to be sensible to national policy priorities, meaning that the set of optimal weights for each region guarantees its best position against all other zones in the sample, however the index is calculated so that regions with specializations in a particular component are not penalized for this.

4 Conclusions

As already highlighted above, this article has tried to explore the impacts of improvements of the European railway infrastructures in order to evaluate how these could potentially increase accessibility for passengers across European regions; it summarizes the results of the model simulations carried out with a combination of the TRANSTOOLS rail network and its assignment module (Traffic Analyst).

Beyond the baseline (year 2005), three different scenarios have been tested: two scenarios simulating increases of speed at least up to 90 and 200 km/h on the whole rail network and the last one assuming a decrease down to 45 km/h.

Furthermore the study has considered four accessibility indicators providing different and complementary information: location index, relative efficiency of the network, potential and daily accessibility. Since the results suggest that the location of each zone (core-periphery patterns) could influence the measures of the various accessibility indicators, the article has also evaluated two different approaches (DEA and BOD) to create a composite accessibility index embracing all the complementary information delivered by the other four ‘partial’ accessibility measures. In fact, as already pointed out among others by Martin and Reggiani [24], despite the clear and useful results provided by the sub-indicators, a composite measure could allow policy makers to control for eventual confusing and/or incomplete scenarios which may appear when using only a partial approach. Sensitivity and robustness analyses have been carried out to test the proposed methodologies and to better analyze the results.

The outcomes provide insight into where major benefits in terms of accessibility can be expected; this information, in turn, could also be useful for the prioritization of investment needs. In particular the results of the study show how the current European railway infrastructure already benefits many regions (mainly in Italy, Spain, Germany, Netherlands, UK, Austria, France, Belgium, etc.) but improvements in speed could still increase the accessibility of various areas (mainly outside the core) of Europe (as in Poland, Bulgaria, Romania, Slovakia, etc.).

Moreover the proposed composite indicators appear worthy and robust against casual noise. Their biggest advantage is represented by their different weighting schemes: the weights of the partial indicators do not need to be fixed ‘a priori’ but they are endogenously derived by the methodology.

References

  1. EUROPEAN COMMISSION (2011) WHITE PAPER. Roadmap to a single European transport area –towards a competitive and resource efficient transport system

  2. Schürmann C, Spiekermann K, Wegener M (1997) Accessibility indicators. Berichte aus dem Institut für Raumplanung 39. Dortmund: IRPUD

  3. Geurs KT, Ritsema van Eck JR (2001) Accessibility measures: review and applications. RIVM report 408505 006. National Institute of Public Health and the Environment, Bilthoven

    Google Scholar 

  4. Geurs KTV, Wee B (2004) Accessibility evaluation of land-use and transport strategies: review and research directions. J Transp Geogr 12:127–140

    Article  Google Scholar 

  5. Gutiérrez J, Monzón A, Piñero JM (1998) Accessibility, network efficiency, and transport infrastructure planning. Environ Plan A 30:1337–1350

    Article  Google Scholar 

  6. Morris JM, Dumble PL, Wigan MR (1978) Accessibility indicators for transport planning. Transp Res 13A:91–109

    Google Scholar 

  7. Van Wee B, Geurs K (2011) Discussing equity and social exclusion in accessibility evaluations. Eur J Transp Infrastruct Res 11(4):350–367

    Google Scholar 

  8. Gutiérrez J (2001) Location, economic potential and daily accessibility: an analysis of the accessibility impact of the high-speed line Madrid-Barcelona-French border. J Transp Geogr 9:229–242

    Article  Google Scholar 

  9. Martin JC, Gutiérrez J, Roman C (2006) Evaluating accessibility gains produced by new high-speed train services. In: Jourquin B, Rietveld P, Westin K (ed) Towards better performing transport networks, part III. Routledge Studies in Business Organizations and Networks

  10. Salze P, Banos A, Oppert J-M, Charreire H, Casey R, Simon C, Chaix B, Badariotti D, Weber C (2011) Estimating spatial accessibility to facilities on the regional scale: an extended commuting-based interaction potential model. Int J Health Geogr

  11. López E, Gutiérrez J, Gómez G (2008) Measuring regional cohesion effects of large-scale transport infrastructure investments: an accessibility approach. Eur Plan Stud 16(2):277–301

    Article  Google Scholar 

  12. Spiekermann K, Neubauer J (2002) European accessibility and peripherality: concepts, models and indicators. Nordregio working paper. Nordregio, Stockholm, p 9

    Google Scholar 

  13. Wegener M, Eskelinnen H, Fürst F, Schürmann C, Spiekermann K (2002) Criteria for the Spatial Differentiation of the EU Territory: Geographical Position. Forschungen 102.2, Bonn: Bundesamt für Bauwesen und Raumordnung.

  14. Linneker B (1997) Transport infrastructure and regional economic development in Europe: a review of theoretical and methodological approaches, TRP 133. Department of Town and Regional Planning, Sheffield

    Google Scholar 

  15. Vickerman RW (1995) The regional impacts of Trans-European networks. Ann Reg Sci 29

  16. Martin JC, Gutiérrez J, Roman C (2007) Accessibility impacts of the Trans-European railway network. In: Van Geenhuizen M, Reggiani A, Rietveld P (ed) Policy analysis of transport networks. Part II. Ashgate Publishing Company

  17. Rotoli F, Christidis P, Vannacci L, Lopez-Ruiz HG, Navajas CE, Ibáñez RN (2014) Potential impacts on accessibility and consumer surplus of improvements of the European railway system. Transp Res Procedia 3:319–328

    Article  Google Scholar 

  18. ESPON (2007) Update of selected potential accessibility indicators. Final Report. Spiekermann & Wegener Urban and Regional Research (S&W), RRG Spatial Planning and Geoinformation

  19. EUROPEAN COMMISSION (2013) TIPTAP ESPON project: territorial impact package for transport and agricultural policies. http://www.espon.eu/main/Menu_Projects/Menu_AppliedResearch/tiptap.html

  20. NECTAR-Network on European Communications and Transport Activity Research. http://www.nectar-eu.org/Clusters/Cluster6/Cluster6.htm

  21. EUROPEAN COMMISSION (2010) DG regional policy, fifth report on economic, social and territorial cohesion. http://ec.europa.eu/regional_policy/information/reports/index_en.cfm

  22. Martin JC, Gutiérrez J, Roman C (2003) An accessibility DEA index to measure the impact of new infrastructure investment. The case of the HST corridor Madrid-Barcelona-French Border. ALdE (Asociación Libre de Economia), VI Encuentro de Economia Applicada,, Granada, 6 Junio 2003

  23. Martin JC, Gutiérrez J, Roman C (2004) Data Envelopment Analysis (DEA) index to measure the accessibility impacts of new infrastructure investments: the case of the high- speed train corridor Madrid-Barcelona-French border. Reg Stud 38(6):697–712

    Article  Google Scholar 

  24. Martín JC, Reggiani A (2007) Recent methodological developments to measure spatial interaction: synthetic accessibility indices applied to high‐speed train investments. Transp Rev 27(5):551–571

    Article  Google Scholar 

  25. Hesse C, Evangelinos C, Bohne S (2013) Accessibility measures and flight schedules: an application to the European air transport, European Transport, Issue 55, Paper n° 6

  26. OECD-JRC (2008) Handbook on constructing composite indicators: methodology and user guide. Organisation for Economic Co-operation and Development, Paris

    Google Scholar 

  27. Nardo M, Saisana M, Saltelli A, Tarantola S (2005) Tools for composite indicators building. Joint Research Centre, Ispra, EUR 21682

    MATH  Google Scholar 

  28. Cherchye L, Moesen W, Rogge N, Van Puyenbroeck T (2008) Benefit of the doubt composite indicators. Workpackage 5. Deliverable 5.3. KEI-project supported by European Commission by funding from the Sixth Framework Programme for Research

  29. Cherchye L, Moesen W, Rogge N, Van Puyenbroeck T, Saisana M, Saltelli A, Liska R, Tarantola S (2006) Creating composite indicators with DEA and robustness analysis: the case of the technology achievement index. KEI-project supported by European Commission by funding from the Sixth Framework Programme for Research

  30. Cherchye L, Moesen W, Rogge N, Van Puyenbroeck T, Saisana M, Saltelli A, Liska R, Tarantola S (2008) Creating composite indicators with DEA and robustness analysis: the case of the technology achievement index. J Oper Res Soc 59(2)

  31. Cooper WW, Seiford LM, Zhu J (2011) Handbook on data envelopment analysis (2nd edition). International Series in Operations Research & Management Science 164

  32. Cherchye L, Kuosmanen T (2002) Benchmarking sustainable development: a synthetic meta-index approach. Working paper

  33. Charnes A, Cooper W, Rhodes E (1978) Measuring the efficiency of decision-making units. Eur J Oper Res 2:429–444

    Article  MathSciNet  MATH  Google Scholar 

  34. Despotis DK (2005) A reassessment of the human development index via data envelopment analysis. J Oper Res Soc 56:969–980

    Article  MATH  Google Scholar 

  35. Hudrliková L (2013) Composite indicators as a useful tool for international comparison: the Europe 2020 example. Prague Econ Pap 2013(4):459–473

    Article  Google Scholar 

  36. Vidoli F, Mazziotta C (2013) Robust weighted composite indicator by means of frontier methods with an application to European infrastructure endowment. Stat Appl Ital J Appl Stat 23(2)

  37. Saisana M, Saltelli A, Tarantola S (2005) Uncertainty and sensitivity analysis techniques as tools for quality assessment of composite indicators. J R Stat Soc 168(Part 2):307–323

    Article  MathSciNet  MATH  Google Scholar 

  38. Lozano S, Gutiérrez E, Salmerón JL (2009). Network DEA models in transportation. Application to airports. German Aviation Research Society Seminar on Airport Benchmarking, Berlin, 20–21 November 2009

  39. Schaar D (2010) A method for stakeholder-based comparative benchmarking of airports. Dissertation submitted in partial fulfilment of the requirements for the degree of Doctor of Philosophy at George Mason University

  40. Wang TF, Song DW, Cullinane K (2002) The applicability of data envelopment analysis to efficiency measurement of container ports. IAME Panama 2002 Conference Proceedings

  41. Lin LC, Tseng LA (2005) Application of DEA and SFA on the measurement of operating efficiencies for 27 international container ports. Proceed East Asia Soc Transp Stud 5:592–607

    Google Scholar 

  42. Yu MM, Lin ETJ (2008) Efficiency and effectiveness in railway performance using a multi-activity network DEA model. OMEGA Int J Manag Sci 36:1005–1017

    Article  Google Scholar 

  43. Barnum DT, McNeil S, Hart J (2007) Comparing the efficiency of public transportation subunits using data envelopment analysis. J Public Transp 10(2)

  44. Oum TH, Waters WG, Yu C (1999) A survey of productivity and efficiency measurement in rail transport. J Transp Econ Policy 33(Part 1):9–42

    Google Scholar 

  45. Coelli TJ, Rao DSP, O’Donnell CJ, Battese GE (2005) An introduction to efficiency and productivity analysis, 2nd edn. Springer, New York

    MATH  Google Scholar 

  46. Coelli TJ (1996) A guide to DEAP version 2.1: a data envelopment analysis (computer) program. Centre for Efficiency and Productivity Analysis (CEPA). Working Paper N. 8/96

  47. Erkoce TE (2012) Estimation methodology of economic efficiency: stochastic frontier analysis vs data envelopment analysis. Int J Acad Res Econ Manag Sci 1(1)

  48. Botasso A, Conti M (2011) Quantitative techinques for regulatory benchmarking. Cerre (Centre on regulation in Europe) study

  49. EUROPEAN COMMISSION, JRC, IPTS: Trans-tools model. http://energy.jrc.ec.europa.eu/transtools/

  50. RAPIDIS, Traffic analyst. http://www.rapidis.com/products/traffic-analyst/

  51. EUROPEAN COMMISSION, ETIS plus Project. http://www.etisplus.eu/default.aspx

Download references

Disclaimer

The views expressed are purely those of the authors and may not in any circumstances be regarded as stating an official position of the European Commission.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to F. Rotoli.

Additional information

This article is part of the Topical Collection on Accessibility and Policy Making

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Rotoli, F., Navajas Cawood, E. & Christidis, P. A Data Envelopment Analysis approach for accessibility measures: Simulating operational enhancement scenarios for railway across Europe. Eur. Transp. Res. Rev. 7, 18 (2015). https://doi.org/10.1007/s12544-015-0167-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s12544-015-0167-3

Keywords