Enhancing Going Concern Prediction Models: Integrating Text Mining with Data Mining Approaches

Document Type : Original Article

Authors

1 Department of Accounting, Bonab Branch, Islamic Azad University, Bonab, Iran

2 Department of Accounting, Bonab Branch, Islamic Azad University, Bonab, Iran.

3 Department of Accounting, Urmia Branch, Islamic Azad University, Urmia, Iran

Abstract

The linguistic features embedded within business unit information play a crucial role in effectively conveying economic realities, a consideration increasingly recognized in accounting and behavioral finance research. This study endeavors to assess the predictive capacity of companies' going concern status by integrating structured and unstructured data, while also evaluating the impact of incorporating unstructured variables into traditional data mining models. Spanning from 2012 to 2021, the study encompasses a sample of 540 company years listed on the Tehran Stock Exchange. Tone analysis of auditor reports was conducted using models by Mayew et al. (2015) and Visvanathan (2021), while MAXQDA 20 text analysis software and the Loughran and McDonald (2015) dictionary facilitated data processing. Data analysis and hypothesis testing were performed using the logit regression model and the Vuong test. The findings support the first hypothesis, indicating that the text-based model yields a higher coefficient of determination compared to the data-based approach. Moreover, the second hypothesis reveals a significant discrepancy in the explanatory power between the data-based and integrated text-based models within companies

Keywords

Main Subjects


©2024 The author(s). This is an open access article distributed under Creative Commons Attribution 4.0 International License (CC BY 4.0)

1. Altman, E.I. (1968). Financial ratios, discriminant analysis, and the prediction of corporate bankruptcy. Journal of Finance, 23(4), pp. 589-609. https://doi.org/10.2307/2978933
2. Altman, E.I., Sabato, G., and Wilson, N. (2010). The value of nonfinancial information in small and medium-sized enterprise risk management. Journal of Credit Risk, 2(1), pp. 95-127.https://api.semanticscholar.org/CorpusID:168632861
3. Banimahd, B., Arabi, M. and Hasanpor, Sh. (2016). Experimental research and methodology in accounting, Tehran, Termeh Publications. https://www.termehbook.com/product/9789649785547/
4. Beaver, W. H., McNichols, M. F., and Rhie, J. W. (2005). Have financial statements become less informative? Evidence from the ability of financial ratios to predict bankruptcy. Review of Accounting Studies, 10(1), pp. 93-122. https://doi.org/10.2139/ssrn.634921
5. Bozorg Asl, M., marfo, M., and mahannejad, M. (2021). The Effect of Financial Reporting Tone on Audit Fees of Listed Companies in Tehran Stock Exchange. Empirical Studies in Financial Accounting, 18(72), pp. 79-107. https://doi.org/10.22054/qjma.2021.57363.2211
6. Brown, S. V. and J. W. Tucker. (2011). Large-Sample Evidence on Firms’ Year-over-Year MD&A Modifications. Journal of Accounting Research. 49(2). pp. 309-346. https://doi.org/10.1111/j.1475-679X.2010.00396.x
7. Campbell, J. Y., Hilscher, J., and Szilagyi, J. (2008). In search of distress risk. The Journal of Finance, 63 (6), pp. 2899-2939. https://doi.org/10.1111/j.1540-6261.2008.01416.x
8. Carson, E., Fargher, N.L., Geiger, M.A., Lennox, C., Raghunandan, K., and Willekens, M. (2013). Audit Reporting for Going-Concern Uncertainty: A Research Synthesis. Auditing-a Journal of Practice & Theory, 32(1), pp. 353-384. https://doi.org/10.2308/AJPT-50324
9. Cole, C. J., and Jones, C. L. (2005). Management discussion and analysis: A review and implications for future research. Journal of Accounting Literature, 24(1), pp. 135-174.
https://www.proquest.com/openview/ce689fdf754d0f04d7b8e555e6e5cbe0/1?pq-origsite=gscholar&cbl=31366
10. Davis, A. K., Ge, W., Matsumoto, D., and Zhang, J. L. (2015). The effect of manager-specific optimism on the tone of earnings conference calls. Review of Accounting Studies, 20(2), pp. 639–673. https://doi.org/10.1007/s11142-014-9309-4
11. Dey, L., Meisheri, H., and Verma, I. (2017). Predictive Analytics with Structured and Unstructured data - a Deep Learning based Approach. IEEE. Informatics Bull., 18(1), pp. 27-34. https://api.semanticscholar.org/CorpusID:52012652
12. Elsayed, M., and Elshandidy, T (2020). Do narrative-related disclosures predict corporate failure? Evidence from UK nonfinancial publicly quoted firms. International Review of Financial Analysis, 71(1), A. 101555. https://doi.org/10.1016/j.irfa.2020.101555
13. Feldman, R., Govindaraj, S., Livnat, J., and Segal, B. (2010). Management's tone change, post earnings announcement drift and accruals. Review of Accounting Studies, 15(4), pp. 915–953. https://doi.org/10.1007/s11142-009-9111-x
14. Garcia, D. (2013). Sentiment during recessions. The Journal of Finance, 68(3), pp. 1267–1300. https://doi.org/10.1111/jofi.12027
15. Grunert, J.P., Norden, L. and Weber, M. (2005). The role of nonfinancial factors in internal credit ratings. Journal of Banking and Finance, 29(2), pp. 509-531. https://doi.org/10.1016/j.jbankfin.2004.05.017
16. Gutierrez, E., Krupa, J., Minutti-Meza, M., and Vulcheva, M. (2020). Do going concern opinions provide incremental information to predict corporate defaults?. Review of Accounting Studies, 25(4), pp. 1344-1381. https://doi.org/10.1007/s11142-020-09544-x
17. Hamer, M.M. (1983). Failure prediction: sensitivity of classification accuracy to alternative statistical methods and variable sets. Journal of Accounting and Public Policy, 2(4), pp. 289-307. https://doi.org/10.1016/0278-4254(83)90032-7
18. Hearst, M. (2003). What is text mining. SIMS, UC Berkeley, 5.https://www.jaist.ac.jp/~bao/MOT-Ishikawa/FurtherReadingNo1.pdf
19. Henry, E. (2008). Are investors influenced by how earnings press releases are written?. The Journal of Business Communication, 45(4), pp. 363-407. https://doi.org/10.1177/0021943608319388
20. Huang, X., Teoh, S. H., and Zhang, Y. (2014). Tone management. The Accounting Review,  89(3), pp. 1083–1113. https://doi.org/10.2308/accr-50684
21. Jo, N., and Shin, K. (2016). Bankruptcy prediction modeling using qualitative information based on big data analytics. Journal of Intelligence and Information Systems, 22(2), pp. 33-56.https://doi.org/10.13088/jiis.2016.22.2.033
22. Kashanipoor, M., Aghaee, M.A., and Mohseni Namaghi, D. (2020). Information Disclosure Tone and Future Performance. Accounting and Auditing Review, 26(4), pp. 570-594. (In Persian). https://doi.org/10.22059/acctgrev.2020.278084.1008146
23. Kou, J. (2022). Analysing Housing Price in Australia with Data Science Methods (Doctoral dissertation, Victoria University). https://vuir.vu.edu.au/id/eprint/43940
24. Kumar, L., and Bhatia, P. (2013). Text mining: concepts, process, and applications. Journal of Global Research in Computer Sciences, 4(3), pp. 36-39. https://api.semanticscholar.org/CorpusID:58813172
25. Law, K. K., and Mills, L. F. (2015). Taxes and financial constraints: Evidence from linguistic cues. Journal of Accounting Research, 53(4), pp. 777–819. https://doi.org/10.1111/1475-679X.12081
26. Li, F. (2010). The information content of forward‐looking statements in corporate filings—A naïve Bayesian machine learning approach. Journal of accounting research, 48(5), pp. 1049-1102. https://doi.org/10.1111/j.1475-679X.2010.00382.x
27. Li, F. (2011). Textual Analysis of Corporate Disclosures: A Survey of the Literature. Journal of Accounting Literature, 29(1), pp. 143-165. http://www.cuhk.edu.hk/acy2/workshop/20110215FengLI/Paper1.pdf
28. Li, Y., and Wang, Y. (2017). Machine learning methods of bankruptcy prediction using accounting ratios. Open Journal of Business and Management, 6(1), pp. 1-20. https://doi.org/10.4236/ojbm.2018.61001
29. Leshno, M., and Spector, Y. (1996). Neural network prediction analysis: the bankruptcy case. Neurocomputing, 10(2), pp. 125-147. https://doi.org/10.1016/0925-2312(94)00060-3
30. Levin, A., Lin, C.F., and Chu, C-S.J. (2002). Unit root tests in panel data. Journal of Econometrics, 108(1), pp. 1-24. https://doi.org/10.1016/S0304-4076(01)00098-7
31. Lohmann, C., and Ohliger, T. (2020). Bankruptcy prediction and the discriminatory power of annual reports: empirical evidence from financially distressed German companies. Journal of Business Economics, 90(1), pp. 137-172. https://doi.org/10.1007/s11573-019-00938-1
32. Lopatta, K., Gloger, M. A., and Jaeschke, R. (2017). Can language predict bankruptcy? The explanatory power of tone in 10‐K filings. Accounting Perspectives, 16(4), pp. 315-343. https://doi.org/10.1111/1911-3838.12150
33. Loughran, T., and McDonald, B. (2011). When is a liability not a liability? Textual analysis, dictionaries, and 10-Ks. The Journal of Finance, 66(1), pp. 35–65. https://doi.org/10.1111/j.1540-6261.2010.01625.x
34. Loughran, T., and McDonald, B. (2015). The Use of Word Lists in Textual Analysis. Journal of Behavioral Finance, 16(1), pp. 1–11. https://doi.org/10.1080/15427560.2015.1000335
35. Mayew, W.J., Sethuraman, M., and Venkatachalam, M. (2015). MD&A Disclosure and the Firm's Ability to Continue as a Going Concern. The Accounting Review, 90(4), pp. 1621-1651. https://doi.org/10.2308/accr-50983
36. Mirali, M., Gholami Moghaddam, F., and Hesarzadeh, R. (2018). Investigation of the Relationship between Financial Reporting Tone with Future Corporate Performance and Market Return. Financial Accounting Knowledge, 5(3), pp. 81-98. (In Persian). https://doi.org/10.30479/jfak.2018.1513
37. Mohammad, S. M., and Turney, P. D. (2013). Crowdsourcing a word–emotion association 41 RESEARCH ARTICLE
Hamid Abbaskhani et al. IJAAF; Vol. 8 No. 3 Summer 2024, pp: 27-42 lexicon. Computational intelligence, 29(3), pp. 436-465. https://doi.org/10.1111/j.1467-8640.2012.00460.x
38. Namazi, M., Hajiha, Z., & Chenaribokat, H. (2018). Modeling and Identifying Hierarchy of the Effective Measures of the Earning Management on the Prediction of the Bankruptcy. Financial Management Strategy, 6(4), pp. 1-27. https://doi.org/10.22051/jfm.2018.13604.1257
39. Najafzadeh, A., Farzinvash, A. A., Yousefi sheikh robaat, M., and Elahi, N. (2024). The Mechanism of Fiscal Policy Transfer in the Economy: Evidence of the Asymmetric Behavior of the Fiscal Expenditure Multiplier During Business Cycles. The Journal of Economic Studies and Policies, 10(2), pp.157-187. https://doi.org/10.22096/esp.2024.535691.1555
40. Nießner, T., Gross, D. H., and Schumann, M. (2022). Evidential Strategies in Financial Statement Analysis: A Corpus Linguistic Text Mining Approach to Bankruptcy Prediction. Journal of Risk and Financial Management, 15(10), 459.
https://doi.org/10.3390/jrfm15100459
41. Odom, M.D., and Sharda, R. (1990). A neural network model for bankruptcy prediction. Joint international conference on neural networks, 2(1), pp. 163-168. https://doi.org/10.1109/IJCNN.1990.137710
42. Ohlson, J.A. (1980). Financial ratios and the probabilistic prediction of bankruptcy. Journal of accounting research, 18(1), pp. 109-131. https://doi.org/10.2307/2490395
43. Pervan, I., and Kuvek, T. (2013). The relative importance of financial ratios and nonfinancial variables in predicting insolvency. Croatian Operational Research Review, 4(1), pp. 187-197. https://hrcak.srce.hr/file/143355
44. Rowland, Z., Kasych, A., and Suler, P. (2021). Prediction of financial distress: case of mining enterprises in Czech Republic. Ekonomicko-manazerske spektrum, 15(1), pp. 1-14. https://doi.org/10.26552/ems.2021.1.1-14
45. Shaw, M.J., and Gentry, J.A. (1990). Inductive learning for risk classification. IEEE Expert, 5(1), pp. 47-53. https://doi.ieeecomputersociety.org/10.1109/64.50856
46. Shin, K., Lee, T.S., and Kim, H. (2005). An application of support vector machines in the bankruptcy prediction model. Expert syst. Appl., 28(1), pp. 127-135. https://doi.org/10.1016/j.eswa.2004.08.009
47. Shumway, T. (2001). Forecasting bankruptcy more accurately: A simple hazard model. The Journal of Business, 74(1), pp.101-124. https://doi.org/10.2139/ssrn.171436
48. Siano, F., and Wysocki, P. (2021). Transfer learning and textual analysis of accounting disclosures: Applying big data methods to small (er) datasets. Accounting Horizons, 35(3), pp. 217-244. https://doi.org/10.2308/HORIZONS-19-161
49. Tetlock, P. C. (2007). Giving content to investor sentiment: The role of media in the stock market. The Journal of Finance, 62(3), pp. 1139-1168. https://doi.org/10.1111/j.1540-6261.2007.01232.x
50. Tetlock, P. C., Saar‐Tsechansky, M., and Macskassy, S. (2008). More than words: Quantifying language to measure firms' fundamentals. The Journal of Finance, 63(3), pp.1437-1467. https://doi.org/10.1111/j.1540-6261.2008.01362.x
51. Visvanathan, G. (2021). Is information in deferred tax valuation allowance useful in predicting the firm’s ability to continue as a going concern incremental to MD&A disclosures and auditor’s going concern opinions?. International Journal of Disclosure and Governance, 18(3), pp. 223-239. https://doi.org/10.1057/s41310-021-00107-3
52. Wang, C. J., Tsai, M. F., Liu, T., and Chang, C. T. (2013, October). Financial sentiment analysis for risk prediction. In Proceedings of the Sixth International Joint Conference on Natural Language Processing. pp. 802-808. http://aclweb.org/anthology/I/I13/I13-1097.pdf
53. Yekini, L. S., Wisniewski, T. P., and Millo, Y. (2016). Market reaction to the positiveness of annual report narratives. The British Accounting Review, 48(4), pp. 415–430. https://doi.org/10.1016/j.bar.2015.12.001
54. Zhang, Z., Luo, M., Hu, Z., and Niu, H. (2022). Textual Emotional Tone and Financial Crisis Identification in Chinese Companies: A Multi-Source Data Analysis Based on Machine Learning. Applied Sciences, 12(13), 6662. https://doi.org/10.3390/app12136662
55. Zhao, Y., Wei, S., Guo, Y., Yang, Q., and Kou, G. (2022). FisrEbp: Enterprise Bankruptcy Prediction via Fusing its Intra-risk and Spillover-Risk. arXiv preprint arXiv, 2202. https://api.semanticscholar.org/CorpusID:246652179
56. Zmijewski, M. E. (1984). Methodological issues related to the estimation of financial distress prediction models. Journal of Accounting Research, 22(1), pp. 59-82. https://doi.org/10.2307/2490859
CAPTCHA Image