Text Classification
Transformers
Safetensors
English
roberta
File size: 3,941 Bytes
8ba54ce
5bfdd35
8ba54ce
5bfdd35
 
8ba54ce
5bfdd35
 
8ba54ce
5bfdd35
 
 
 
 
8ba54ce
5bfdd35
 
8ba54ce
5bfdd35
8ba54ce
5bfdd35
 
8ba54ce
5bfdd35
8ba54ce
5bfdd35
8ba54ce
5bfdd35
8ba54ce
5bfdd35
8ba54ce
5bfdd35
8ba54ce
5bfdd35
8ba54ce
5bfdd35
8ba54ce
5bfdd35
8ba54ce
5bfdd35
8ba54ce
5bfdd35
8ba54ce
5bfdd35
8ba54ce
5bfdd35
8ba54ce
5bfdd35
8ba54ce
5bfdd35
 
8ba54ce
5bfdd35
 
 
 
8ba54ce
5bfdd35
 
8ba54ce
5bfdd35
 
 
 
 
 
8ba54ce
5bfdd35
 
8ba54ce
5bfdd35
8ba54ce
5bfdd35
 
8ba54ce
5bfdd35
 
8ba54ce
5bfdd35
 
8ba54ce
5bfdd35
 
8ba54ce
5bfdd35
8ba54ce
5bfdd35
8ba54ce
5bfdd35
 
8ba54ce
5bfdd35
8ba54ce
5bfdd35
8ba54ce
5bfdd35
8ba54ce
5bfdd35
8ba54ce
5bfdd35
 
 
 
 
 
 
8ba54ce
5bfdd35
8ba54ce
5bfdd35
8ba54ce
5bfdd35
8ba54ce
5bfdd35
8ba54ce
5bfdd35
8ba54ce
5bfdd35
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
---
license: cc-by-nc-sa-4.0

datasets:
- gtfintechlab/central_bank_of_chile

language:
- en

metrics:
- accuracy
- f1
- precision
- recall

base_model:
- roberta-base

pipeline_tag: text-classification

library_name: transformers
---

# World of Central Banks Model

**Model Name:** Central Bank of Chile Uncertainty Estimation Model

**Model Type:** Text Classification

**Language:** English

**License:** [CC-BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/deed.en)

**Base Model:** [roberta-base](https://huggingface.co/FacebookAI/roberta-base)

**Dataset Used for Training:** [gtfintechlab/central_bank_of_chile](https://huggingface.co/datasets/gtfintechlab/central_bank_of_chile)

## Model Overview

Central Bank of Chile Uncertainty Estimation Model is a fine-tuned roberta-base model designed to classify text data on **Uncertain Estimation**. This label is annotated in the central_bank_of_chile dataset, which focuses on meeting minutes for the Central Bank of Chile.

## Intended Use

This model is intended for researchers and practitioners working on subjective text classification for the Central Bank of Chile, particularly within financial and economic contexts. It is specifically designed to assess the **Uncertain Estimation** label, aiding in the analysis of subjective content in financial and economic communications.

## How to Use

To utilize this model, load it using the Hugging Face `transformers` library:

```python
from transformers import pipeline, AutoTokenizer, AutoModelForSequenceClassification, AutoConfig

# Load tokenizer, model, and configuration
tokenizer = AutoTokenizer.from_pretrained("gtfintechlab/central_bank_of_chile", do_lower_case=True, do_basic_tokenize=True)
model = AutoModelForSequenceClassification.from_pretrained("gtfintechlab/central_bank_of_chile", num_labels=2)
config = AutoConfig.from_pretrained("gtfintechlab/central_bank_of_chile")

# Initialize text classification pipeline
classifier = pipeline('text-classification', model=model, tokenizer=tokenizer, config=config, framework="pt")

# Classify Uncertain Estimation
sentences = [
    "[Sentence 1]",
    "[Sentence 2]"
]
results = classifier(sentences, batch_size=128, truncation="only_first")

print(results)
```

In this script:

- **Tokenizer and Model Loading:**  
  Loads the pre-trained tokenizer and model from `gtfintechlab/central_bank_of_chile`.

- **Configuration:**  
  Loads model configuration parameters, including the number of labels.

- **Pipeline Initialization:**  
  Initializes a text classification pipeline with the model, tokenizer, and configuration.

- **Classification:**  
  Labels sentences based on **Uncertain Estimation**.

Ensure your environment has the necessary dependencies installed.

## Label Interpretation

- **LABEL_0:** Certain; indicates that the sentence presents information definitively.
- **LABEL_1:** Uncertain; indicates that the sentence presents information with speculation, possibility, or doubt.

## Training Data

The model was trained on the central_bank_of_chile dataset, comprising annotated sentences from the Central Bank of Chile meeting minutes, labeled by **Uncertain Estimation**. The dataset includes training, validation, and test splits.

## Citation

If you use this model in your research, please cite the central_bank_of_chile:

```bibtex
@article{WCBShahSukhaniPardawala,
  title={Words That Unite The World: A Unified Framework for Deciphering Global Central Bank Communications},
  author={Agam Shah, Siddhant Sukhani, Huzaifa Pardawala et al.},
  year={2025}
}
```

For more details, refer to the [central_bank_of_chile dataset documentation](https://huggingface.co/gtfintechlab/central_bank_of_chile).

## Contact

For any central_bank_of_chile related issues and questions, please contact:

- Huzaifa Pardawala: huzaifahp7[at]gatech[dot]edu

- Siddhant Sukhani: ssukhani3[at]gatech[dot]edu

- Agam Shah: ashah482[at]gatech[dot]edu