Special Issue on Transportation Big Data and Its Applications

Xiaolei Ma

Xinqiang Chen

3 and

Zhuang Dai

School of Transportation Science and Engineering, Beihang University, Beijing 100191, China

Key Laboratory of Intelligent Transportation Technology and System, Ministry of Education, Beijing 100191, China

Institute of Institute of Logistics Science and Engineering, Shanghai Maritime University, Shanghai 201306, China

School of Transportation and Logistics, National Engineering Laboratory of Integrated Transportation Big Data Application Technology, Southwest Jiaotong University, Chengdu 611756, China

Author to whom correspondence should be addressed. Appl. Sci. 2024, 14(4), 1517; https://doi.org/10.3390/app14041517 Submission received: 19 October 2023 / Accepted: 4 February 2024 / Published: 13 February 2024 (This article belongs to the Special Issue Transportation Big Data and Its Applications)

1. Introduction

Large-scale traffic sensors are strategically deployed across various infrastructures and modes of transportation (e.g., vehicles, ships, airplanes, bridges, and traffic signals) [1,2,3]. These sensors offer a vast and readily accessible trove of traffic data, pivotal for the advancement of intelligent transportation systems (ITS) [4,5,6,7]. Additionally, diverse data sources from crowdsourcing, social media, and mapping platforms present clear opportunities for both efficient and sophisticated traffic management [8,9,10,11]. To date, various big data relevant architectures and applications (e.g., transfer learning, online learning, and edge computing) have been tailored to utilize multiple traffic source data to enhance and optimize real-time traffic operations and safety [12,13,14,15]. A notable focus has been on discerning spatial–temporal traffic patterns and predicting traffic flows across various temporal and spatial scales, aided by a range of deep learning models [16,17,18,19]. Furthermore, numerous studies aim to achieve seamless cooperation between vehicles, ships, and airplanes in a connected and intelligent traffic environment, leveraging edge computing, 5G, and lightweight machine learning models [20,21,22,23,24]. In essence, data from diverse sensors, including video, radar, and inductive loop detectors, across transportation modes (e.g., vehicles, trains, subways, ships, and airplanes) are harnessed to understand spatial–temporal mobility and commuter behavior [25,26,27,28]. Consequently, there is a pressing demand for more efficient models to discern transportation trends in the smart city era.

This Special Issue is dedicated to the exploration of knowledge and the application of big data in transportation, including big data systems and architectures (e.g., Spark and Hadoop-related traffic systems and geo-and-temporal data visualization systems), big data processing (e.g., machine learning, deep learning, edge computing, cloud computing, parallel computing, and 5G), and big data utilization (e.g., for traffic pattern discovery, collision identification, dynamic route planning, traffic demand prediction, operational efficiency optimization, urban planning, and customer service improvement). Historical data analytics, real-time traffic management, and visual data-supported analytics are all included.

2. An Overview of the Published Articles

A total of sixteen papers (fifteen research papers and one review paper) in various fields of transportation big data analytics including maritime traffic evaluation, urban traffic intersection extraction, bus–metro-transfer and ride-hailing ridership analysis, assessment of bridge passage, traffic state prediction, crowd prediction, risk measurement and forecasting, electric taxi battery swapping strategy, anomaly detection, ship classification, and tourism activity modeling are presented in this Special Issue. Hou et al. (contribution 1) presented a forecasting model based on the trip chain and entropy-maximizing theory to predict the trip distributions of tourists on suburban tourist railways. The model was able to accurately reflect the real trip distribution characteristics of tourists and can be used for the planning and construction of suburban tourist railways. Gnap et al. (contribution 2) reported a global assessment approach for bridge passage in relation to oversized and excessive road transport. They analyzed vehicle/vehicle combination parameters and assessed routes for heavy and oversized transportation in Slovakia. The authors proposed a new procedure for obtaining special permission for road use and introduced the concept of the cumulative axle load for assessing bridge passage. Kim et al. (contribution 3) developed a new method for the density analysis of Automatic Identification System (AIS) data in the coastal waters of Korea. They used spatial–temporal density analysis and standard deviation-based stretch symbolization to calculate and visualize the density distribution of different ship types. The method allowed for the identification of major maritime traffic patterns and will be valuable for confirming maritime traffic patterns and density in the coastal areas of Korea. Wang et al. (contribution 4) developed a ship classification method that integrated multiple base classifiers and effectively integrated the static and dynamic information of ships. The method outperformed individual base classifiers and achieved near real-time online classification. Gao et al. (contribution 5) proposed a method for the automatic extraction of urban road intersections using trajectory line segment intersection points. The method achieved better recognition accuracy compared to the benchmark method and effectively identified intersections in low-traffic or low-sampling areas. They used a maximum reconstruction error method to extract straight-line segments and merged them to enhance the road network structure pattern. However, the method had limitations in terms of identifying different types of intersections and when the distances between adjacent intersections were less than 60 m. Wang et al. (contribution 6) developed a short-term traffic state prediction model based on the data acquisition strategy of a mobile edge computing-assisted vehicle-to-everything (V2X) network. The model combined the advantages of a Graph Convolutional Network (GCN) and a Gated Recurrent Units (GRU) soft-attention mechanism to analyze the spatial–temporal characteristics of traffic data. The proposed model showed improved accuracy in predicting traffic flow compared to other models. Working on a similar issue, Mai et al. (contribution 7) presented a traffic prediction model called Time-Evolving Graph Convolutional Recurrent Network (TEGCRN). The model utilized time-evolving adjacency graphs to capture dynamic internode dependency and achieved superior performance compared to the baseline models, especially in short-term prediction. Zou et al. (contribution 8) proposed a hybrid model for vehicle acceleration prediction using Hidden Markov Models (HMM), Long Short-Term Memory (LSTM), and GRU. They used MHMM to divide the driving behavior semantics, evaluate the similarity, and group drivers accordingly. The results showed that the MHMM-based approach improved the prediction accuracy, and the GRU outperformed the LSTM in predicting vehicle acceleration. Tišljarić et al. (contribution 9) presented a tensor-based approach for road traffic anomaly detection. They used Speed Transition Matrices (STMs) to model the spatial–temporal traffic patterns and applied anomaly detection based on the center of mass computation. The proposed method achieved a precision score of 92.88% in detecting anomalies on urban road networks. Liu et al. (contribution 10) analyzed the impact of urban built-environment variables on bus–metro-transfer ridership using the XGBoost model. They found that the XGBoost model had a better fitting degree compared to traditional linear and nonlinear models. The model identified the relative importance of different variables and revealed the nonlinear and threshold effects of built-environment variables on ridership. The study also explored the moderating impact of station location on the relationship between the built environment and ridership. Liu et al. (contribution 11) proposed a modular battery swapping mode for electric taxis and developed a data-driven approach to configure and operate modular battery swapping stations (BSSs). Their study showed that a BSS with modular battery swapping can save on investment costs and better respond to time-of-use pricing compared to traditional battery swapping modes. Utilizing the ride-hailing order data from Chengdu, Wang et al. (contribution 12) employed the Nugget–Sill Ratio (NSR) method and the Optimal Parameter-Based Geographical Detector (OPGD) model to determine the optimal grid scale. Furthermore, they integrated the “5D” built-environment determinants to construct a model assessing the impact factors influencing ride-hailing during morning and evening peak hours. Hu et al. (contribution 13) reported the development of a dynamic graph convolutional network model (Res-DGCN) for crowd flow prediction in urban areas. The model incorporates a spatial–temporal attention module and a conditional convolution module to capture spatial and temporal dependencies in the crowd flow data. The model was trained using the Huber loss function, which improved its robustness to outliers. The experimental results showed that the Res-DGCN model outperformed the baseline models in terms of the mean absolute error and the root mean square error, demonstrating its effectiveness in crowd flow prediction tasks. Chen et al. (contribution 14) developed a driving behavior risk measurement model that calculates the risk of different driving behaviors based on four indicators: lateral stability, longitudinal stability, car-following risk, and lane-changing risk. The model used weights to combine these indicators and provided a comprehensive risk evaluation of a driver’s behavior. The study found that most drivers’ behavior fell within a normal distribution, with the majority exhibiting a risk measurement between 0.1 and 0.3. Ye et al. (contribution 15) presented a traffic accident risk prediction model based on an LSTM algorithm. The model was able to accurately forecast the accident risk by analyzing the regional risk index and capturing long-term dependencies in the data. Zhou et al. (contribution 16) reviewed the intersection between ITS sensing and edge computing applications. They discussed the recent advances in ITS sensing and identified key challenges in this field. The authors also highlighted the potential benefits of integrating edge computing with ITS sensing and proposed future research directions.

3. Conclusions

In this Special Issue, it is evident that the intersection of knowledge discovery and big data in transportation heralds a transformative era for the sector. The diverse topics explored, from advanced architectures to innovative applications, underscore the vast potential and multifaceted challenges in harnessing data for transportation’s evolution [29]. As we navigate the complexities of urbanization and the demands of modern mobility [30], the insights presented herein serve as both a compass and catalyst. We extend our gratitude to all the contributors for shedding light on these pivotal areas and anticipate that the discussions sparked will drive further research and real-world applications in the ever-evolving landscape of transportation.

Author Contributions

Writing—original draft preparation, X.M.; writing—review and editing, X.C. and Z.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research is sponsored by Beijing Nova Program (No. 20230484432).

Acknowledgments

We extend our sincere appreciation to the authors and peer reviewers for their invaluable contributions to this Special Issue titled ‘Transportation Big Data and Its Applications’. Furthermore, our gratitude extends to the entire staff and all individuals involved in the realization of this Special Issue.