Please login first
Developing a Multifaceted Central Bank Communication Dataset for Natural Language Processing-Driven Economic Analysis
* 1 , 2 , 3 , 4
1  Department of Development Economics, Universitas Terbuka, 15437, Indonesia
2  Department of English Language and Literature, Universitas Terbuka, 15437, Indonesia
3  Department of Business Administration, Universitas Terbuka, 15437, Indonesia
4  Department of Sharia Economics, Universitas Terbuka, 15437, Indonesia
Academic Editor: Thanasis Stengos

Abstract:

Central bank communication is a pivotal component in supporting economic and monetary policy in many countries. The efficacy of central bank communication affects market perception and the credibility of monetary policy, thus necessitating analytical tools to assess it. This study seeks to develop a dataset called CentralBankCorpus, the first multi-faceted dataset in Indonesia designed to comprehensively analyze monetary policy and central bank communication. This study employed a document analysis method with a labeling technique. It began by collecting official Bank Indonesia communication documents by means of transcription and scrapping. The collected data were further pre-processed and labeled with six linguistic tags. The dataset yields the CentralBankCorpus, comprising nearly half a million linguistically tagged tokens, spanning economic agent, topic, sentiment, transparency, key terms, and economic impact. This dataset will profoundly influence multiple facets. Academically, it will serve as the primary reference for NLP-focused research in economics, public policy, and organizational communication. Practically, it can assist Bank Indonesia in comprehending and addressing public perceptions of their policies, hence enhancing institutional accountability. This research ultimately endorses Bank Indonesia’s digital transformation through innovative application of NLP technology. Furthermore, it addresses a gap in the literature and contributes significantly to Indonesia’s economic development, while enhancing the nation’s role in the use of modern technology for policy communication at a broader level.

Keywords: Central Bank; corpus; dataset; economy; natural language processing
Comments on this paper
Currently there are no comments available.



 
 
Top