Identifying financial statement frauds via machine learning: A comparative analysis based on Chinese listed companies

Yue Chen; Guanming He

Previous Article in event

Current Trends and Challenges in the Selective Adoption of Central Bank Digital Currencies (CBDCs)

Previous Article in session

The Transformative Role of AI in Financial Reporting and auditing: Opportunities and Risks

Next Article in event

Graph- and machine-learning-based framework for short-selling risk assessment

Next Article in session

The Readability Level in Annual Reports of Chinese Listed Companies and the Manipulative Behaviors of Managers for Self-Serving Incentives

Identifying financial statement frauds via machine learning: A comparative analysis based on Chinese listed companies

Yue Chen

Guanming He

¹ Department of Accounting, Durham University, Durham, DH13LB, United Kingdom

Academic Editor: Mahmoud Elmarzouky

Published: 12 June 2025 by MDPI in The 1st International Online Conference on Risk and Financial Management session AI in Financial Reporting and Auditing

Abstract:

Purpose: The objective of this paper is to evaluate the effectiveness of the M-score and F-score in detecting financial statement fraud in the Chinese market and to develop machine learning models tailored for detecting such fraudulent activities.

Design/Methodology/Approach: We utilize the data of fraudulent cases from the CSMAR database for the period 2010-2019 and implement a random sampling by industry to match between fraudulent enterprises and non-fraudulent enterprises. Based on this sample, we first test the effectiveness of M-score and F-score in detecting financial frauds among Chinese listed companies. Next, we construct the machine learning models—Random Forest, Gradient Boosting Decision Tree (GBDT), K-Nearest Neighbor (KNN) and Support Vector Machine (SVM)—using the constituent variables of F-score and M-score, along with an additional loss indicator. The performance of these models in detecting financial frauds is then comparatively assessed.

Findings: The results reveal varying degrees of ineffectiveness of the M-score and F-score in accurately identifying financially fraudulent companies in the Chinese market. In contrast, the machine learning models show satisfactory performance, each exhibiting distinct advantages in reducing false negative and false positive rates.

Practical Implications: This research presents effective machine learning models for detecting and predicting financial statement fraud in the Chinese context, helping investors mitigate risks associated with stock investments in the Chinese stock market.

Keywords: F-score; M-score; machine learning; financial fraud detection

42 Reads
0 Recommendations

Yue Chen

Guanming He