Estimation of CO2 Emissions From Vehicles Using Machine Learning and Multi-Model Investigation

Loading...
Publication Logo

Date

2025

Journal Title

Journal ISSN

Volume Title

Publisher

Polska Akademia Nauk

Open Access Color

GOLD

Green Open Access

No

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Average
Influence
Average
Popularity
Average

Research Projects

Journal Issue

Abstract

This study presents a comprehensive analysis of the prediction of carbon dioxide emissions from vehicles using machine learning-based regression models. Linear regression, lasso regression, k-nearest neighbor regression, random forest, and CatBoostRegressor algorithms are systematically evaluated using a dataset of vehicle specifications and emissions data. Hyper-parameter optimization was performed using a grid search method and the performance of the models was measured using mean squared error, root mean squared error, mean absolute error, and R-squared metrics. CatBoostRegressor stood out for its high predictive accuracy, while random forest and k-nearest neighbor models also produced notable results, while linear models failed to model complex data relationships. Correlation analysis showed that engine displacement, number of cylinders, and fuel consumption were strongly correlated (0.92–0.99) with carbon dioxide emissions. The comparison with the literature showed that the study was characterized by its multi-model approach, rigorous data pre-processing, and systematic optimization. However, the geographical limitation of the dataset and the lack of dynamic variables such as driving conditions restrict its generalizability. In the future, explainable artificial intelligence methods and larger datasets may overcome these limitations. By highlighting the applicability of CatBoostRegressor, this study strengthens the contribution of machine learning to environmental sustainability policy and provides methodological innovation in the literature. © 2025 The Author(s).

Description

Keywords

Catboostregressor, CO2 Emissions, Environmental Sustainability, Machine Learning, Regression Analysis

Fields of Science

Citation

WoS Q

Q3

Scopus Q

Q3
OpenCitations Logo
OpenCitations Citation Count
N/A

Source

Bulletin of the Polish Academy of Sciences: Technical Sciences

Volume

73

Issue

4

Start Page

154287

End Page

154287
PlumX Metrics
Citations

Scopus : 1

Captures

Mendeley Readers : 6

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
0.0

Sustainable Development Goals