Federated Split Learning with Large Language Models Integration: A Study on Potential Container Source Identification in Sea-Rail Intermodal Transport

Authors

  • Weiguang Ma Department of Information Management, School of Economics and Management, Beijing Jiaotong  University, Beijing, China
  • Lei Huang Department of Information Management, School of Economics and Management, Beijing Jiaotong  University, Beijing, China
  • Qianyao Zhang Department of Information Management, School of Economics and Management, Beijing Jiaotong  University, Beijing, China
  • Ying Wang Department of Information Management, School of Economics and Management, Beijing Jiaotong  University, Beijing, China
  • Xiong Zhang Department of Information Management, School of Economics and Management, Beijing Jiaotong  University, Beijing, China
  • Rongjia Song Experimental Center of Data Science and Intelligent Decision Making, Department of Information Management, School of Management Hangzhou Dianzi University, Hangzhou, China

DOI:

https://doi.org/10.15837/ijccc.2026.3.7263

Keywords:

federated learning, segmentation learning, large language model, sea-rail intermodal transport, privacy protection

Abstract

To address the challenge of potential container source identification in sea-rail intermodal transport scenarios, which arises from data silos and privacy barriers among multiple stakeholders, this paper proposes FSL-Qwen, a Federated Split Learning framework integrated with Large Language Models. Innovative to this framework is the vertical partitioning of the Qwen model at the embedding layer: clients (e.g., ports, railways, customs) deploy only lightweight embedding layers for local feature extraction, while the server retains the Transformer backbone for centralized reasoning. This architecture decouples local computation from inference, theoretically reducing client-side complexity to O(1) and drastically minimizing communication overhead compared to standard Federated Learning. To resolve cross-domain semantic heterogeneity, a ChatML-based semantic alignment mechanism is introduced, enabling collaborative inference without sharing raw records. Privacy analysis demonstrates that the framework achieves inherent structural isolation, converting data reconstruction attacks into blind inverse problems. Experiments on a dataset of 48,800 SRIT container data confirm that FSL-Qwen achieves a predictive accuracy of 94.0% and an F1-score of 94.1%, effectively matching the centralized upper bound while limiting client-side memory usage to merely 0.26 GB. These results validate FSL-Qwen as a robust, efficient, and privacy-preserving paradigm for intelligent logistics decision-making.

References

Luo, Y. (2024). Paradigm shift and theoretical implications for the era of global disorder, Journal of International Business Studies, 55(2), 127-135, 2024. https://doi.org/10.1057/s41267-023-00659-2

McKibbin, W.; Fernando, R. (2023). The global economic impacts of the COVID-19 pandemic, Economic Modelling, 129, 106551, 2023. https://doi.org/10.1016/j.econmod.2023.106551

Liu, W.; Wang, L.; Yan, B.; Zhu, X.; Liu, Z. (2025). Integrated optimization of dynamic deployment and scheduling for rail-mounted gantry cranes in sea-rail intermodal port with on-dock rails, Transportation Research Part E: Logistics and Transportation Review, 202, 104312, 2025. https://doi.org/10.1016/j.tre.2025.104312

Shenoy, D.; Bhat, R., ; Krishna Prakasha, K. (2025). Exploring privacy mechanisms and metrics in federated learning, Artificial Intelligence Review, 58(8), 223, 2025. https://doi.org/10.1007/s10462-025-11170-5

Wang, Y.; Wu, Y.; Hao, C.; Hong, C. (2025). Research on the scheduling in sea-rail intermodal trains based on full-length and full-occupied strategy, Research in Transportation Business & Management, 59, 101301, 2025. https://doi.org/10.1016/j.rtbm.2025.101301

Feng, C.; Lei, Y. (2024). Research on interval prediction method of railway freight based on big data and TCN-BiLSTM-QR, IET Intelligent Transport Systems, 18(12), 2713-2724, 2024. https://doi.org/10.1049/itr2.12531

Montoya, G.; Lozano-Garzón, C.; Paternina-Arboleda, C.; Donoso, Y. (2025). A Mathematical Optimization Approach for Prioritized Services in IoT Networks for Energy-constrained Smart Cities, International Journal of Computers Communications & Control, 20(1). https://doi.org/10.15837/ijccc.2025.1.6912

Zhang, X.; Mavromatis, A.; Vafeas, A.; Nejabati, R.; Simeonidou, D. (2023). Federated feature selection for horizontal federated learning in IoT networks, IEEE Internet of Things Journal, 10(11), 10095-10112, 2023. https://doi.org/10.1109/JIOT.2023.3237032

Gulati, S.; Guleria, K.; Goyal, N.; Alzubi, A. A.; Castilla, A. K. (2024). A privacy-preserving collaborative federated learning framework for detecting retinal diseases, IEEE Access, 12, 170176- 170203, 2024. https://doi.org/10.1109/ACCESS.2024.3493946

Gong, M.; Zhang, Y.; Gao, Y.; Qin, A. K.; Wu, Y.; Wang, S.; Zhang, Y. (2023). A multi-modal vertical federated learning framework based on homomorphic encryption, IEEE Transactions on Information Forensics and Security, 19, 1826-1839, 2023. https://doi.org/10.1109/TIFS.2023.3340994

Vepakomma, P.; Gupta, O.; Swedish, T.; Raskar, R. (2018). Split learning for health: Distributed deep learning without sharing raw patient data, arXiv preprint, arXiv:1812.00564, 2018.

Xue, R.; Xue, K.; Zhu, B.; Luo, X.; Zhang, T.; Sun, Q.; Lu, J. (2023). Differentially private federated learning with an adaptive noise mechanism, IEEE Transactions on Information Forensics and Security, 19, 74-87, 2023. https://doi.org/10.1109/TIFS.2023.3318944

Alqazzaz, A. (2025). Federated Learning with Homomorphic Encryption: A Privacy-Preserving Solution for Smart Cities, International Journal of Computational Intelligence Systems, 18(1), 304, 2025. https://doi.org/10.1007/s44196-025-00829-0

Piccialli, F.; Chiaro, D.; Qi, P.; Bellandi, V.; Damiani, E. (2025). Federated and edge learning for large language models, Information Fusion, 117, 102840, 2025. https://doi.org/10.1016/j.inffus.2024.102840

Zhou, H.; Inkpen, D.; Kantarci, B. (2024). Evaluating and mitigating gender bias in generative large language models, International Journal of Computers Communications & Control, 19(6). https://doi.org/10.15837/ijccc.2024.6.6853

Dantas, P. V.; Cordeiro, L. C.; Junior, W. S. (2025). A review of state-of-the-art techniques for large language model compression, Complex & Intelligent Systems, 11(9), 407, 2025. https://doi.org/10.1007/s40747-025-02019-z

Putrama, I. M.; Martinek, P. (2024). Heterogeneous data integration: Challenges and opportunities, Data in Brief, 56, 110853, 2024. https://doi.org/10.1016/j.dib.2024.110853

Mokhtari, Z.; Amani-Beni, M.; Asgarian, A.; Russo, A.; Qureshi, S.; Karami, A. (2023). Spatial prediction of the urban inter-annual land surface temperature variability: An integrated modeling approach in a rapidly urbanizing semi-arid region, Sustainable Cities and Society, 93, 104523, 2023. https://doi.org/10.1016/j.scs.2023.104523

Peng, T.; Gan, M.; Ou, Q.; Yang, X.; Wei, L.; Rødal Ler, H.; Yu, H. (2024). Railway cold chain freight demand forecasting with graph neural networks: A novel GraphARMA-GRU model, Expert Systems with Applications, 255, 124693, 2024. https://doi.org/10.1016/j.eswa.2024.124693

Liang, X. (2025). Cross-border logistics risk warning system based on federated learning, Scientific reports, 15(1), 39131, 2025. https://doi.org/10.1038/s41598-025-25507-1

Zheng, J.; Chen, Y.; Lai, Q. (2024). PPSFL: Privacy-Preserving Split Federated Learning for heterogeneous data in edge-based Internet of Things, Future Generation Computer Systems, 156, 231-241, 2024. https://doi.org/10.1016/j.future.2024.03.020

Qin, J.; Zhang, X.; Liu, B.; Qian, J. (2023). A split-federated learning and edge-cloud based efficient and privacy-preserving large-scale item recommendation model, Journal of Cloud Computing, 12(1), 57, 2023. https://doi.org/10.1186/s13677-023-00435-5

Li, Y.; Yan, Y.; Tong, Z.; Wang, Y.; Yang, Y.; Bai, M., et al. (2025). Efficient fine-tuning of small-parameter large language models for biomedical bilingual multi-task applications, Applied Soft Computing, 175, 113084, 2025. https://doi.org/10.1016/j.asoc.2025.113084

Huang, L.; Jiang, D.; Zhang, X.; Wang, Y.; Bai, T. (2025). VFed-PU: Identifying Containers with Potential to be Shipped by Rail from Ports with Privacy Protection, Tehnički vjesnik, 32(2), 485-494, 2025. https://doi.org/10.17559/TV-20240601001729

He, Y.; Tan, X.; Ni, J.; Yang, L. T.; Deng, X. (2022). Differentially private set intersection for asymmetrical id alignment, IEEE Transactions on Information Forensics and Security, 17, 3479-3494, 2022. https://doi.org/10.1109/TIFS.2022.3207911

Romanini, D.; Hall, A. J.; Papadopoulos, P.; Titcombe, T.; Ismail, A.; Cebere, T.; Hoeh, M. A. (2021). Pyvertical: A vertical federated learning framework for multi-headed splitnn, arXiv preprint, arXiv:2104.00489, 2021.

Xu, X.; Yang, M.; Yi, W.; Li, Z.; Wang, J.; Hu, H.; Liu, Y. (2024). A stealthy wrongdoer: Featureoriented reconstruction attack against split learning, Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 12130-12139, 2024. https://doi.org/10.1109/CVPR52733.2024.01153

Qiu, Y.; Liu, Y.; Yu, H.; Fang, H.; Chen, B.; Xia, S. T.; Xu, K. (2025). Revisiting the Privacy Risks of Split Inference: A GAN-Based Data Reconstruction Attack via Progressive Feature Optimization, arXiv preprint, arXiv:2508.20613, 2025.

Zhu, L.; Liu, Z.; Han, S. (2019). Deep leakage from gradients, Advances in neural information processing systems, 32, 2019.

Vepakomma, P.; Swedish, T.; Raskar, R.; Gupta, O.; Dubey, A. (2018). No peek: A survey of private distributed deep learning, arXiv preprint, arXiv:1812.03288, 2018.

Yang, W.; Wang, S.; Wu, D.; Cai, T.; Zhu, Y.; Wei, S.; Li, Y. (2025). Deep learning model inversion attacks and defenses: a comprehensive survey, Artificial Intelligence Review, 58(8), 242, 2025. https://doi.org/10.1007/s10462-025-11248-0

McMahan, B.; Moore, E.; Ramage, D.; Hampson, S.; y Arcas, B. A. (2017). Communicationefficient learning of deep networks from decentralized data, Artificial intelligence and statistics, PMLR 1273-1282, 2017.

Krauth, M.; Ribesmeier, M.; Bešinović, N. (2025). Optimising mode choice in a bi-modal freight network considering sustainability and urban logistic stakeholder perspectives, Transportation Research Interdisciplinary Perspectives, 31, 101442, 2025. https://doi.org/10.1016/j.trip.2025.101442

Additional Files

Published

2026-05-26

Most read articles by the same author(s)

Obs.: This plugin requires at least one statistics/report plugin to be enabled. If your statistics plugins provide more than one metric then please also select a main metric on the admin's site settings page and/or on the journal manager's settings pages.