AngelPanizo's picture
Add BERTopic model
31c3f2f verified
---
tags:
- bertopic
library_name: bertopic
pipeline_tag: text-classification
---
# MARTINI_enrich_BERTopic_thepeopleslawyerchannel
This is a [BERTopic](https://github.com/MaartenGr/BERTopic) model.
BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.
## Usage
To use this model, please install BERTopic:
```
pip install -U bertopic
```
You can use the model as follows:
```python
from bertopic import BERTopic
topic_model = BERTopic.load("AIDA-UPM/MARTINI_enrich_BERTopic_thepeopleslawyerchannel")
topic_model.get_topic_info()
```
## Topic overview
* Number of topics: 26
* Number of training documents: 2705
<details>
<summary>Click here for an overview of all topics.</summary>
| Topic ID | Topic Keywords | Topic Frequency | Label |
|----------|----------------|-----------------|-------|
| -1 | unvaccinated - mandates - pfizer - 2021 - freedom | 20 | -1_unvaccinated_mandates_pfizer_2021 |
| 0 | vaers - myocarditis - 2022 - died - footballer | 1183 | 0_vaers_myocarditis_2022_died |
| 1 | donetsk - russians - zelensky - ukrainian - sanctions | 206 | 1_donetsk_russians_zelensky_ukrainian |
| 2 | awakening - indoctrination - reality - freedom - god | 190 | 2_awakening_indoctrination_reality_freedom |
| 3 | pandemics - amendments - sovereignty - parliament - geneva | 139 | 3_pandemics_amendments_sovereignty_parliament |
| 4 | cashless - banknotes - debit - wallet - rbnz | 100 | 4_cashless_banknotes_debit_wallet |
| 5 | alarmists - climatologist - thunberg - greenland - co2 | 94 | 5_alarmists_climatologist_thunberg_greenland |
| 6 | conspirach - msnbc - titanic - instagrammable - download | 92 | 6_conspirach_msnbc_titanic_instagrammable |
| 7 | trump - democrats - gabbard - warmongers - overthrow | 77 | 7_trump_democrats_gabbard_warmongers |
| 8 | millionlightpaperandleaflet - berkshirefreedom - canterbury - boards - yellow | 55 | 8_millionlightpaperandleaflet_berkshirefreedom_canterbury_boards |
| 9 | treasury - trillions - bankruptcy - blackrock - lockdown | 54 | 9_treasury_trillions_bankruptcy_blackrock |
| 10 | schooling - indoctrination - pedophiles - stopworldcontrol - prisons | 50 | 10_schooling_indoctrination_pedophiles_stopworldcontrol |
| 11 | vaxxed - thimerosal - adjuvants - injected - grafene | 49 | 11_vaxxed_thimerosal_adjuvants_injected |
| 12 | thepeopleslawyeruk - tpl - cornwall - plumstead - event | 44 | 12_thepeopleslawyeruk_tpl_cornwall_plumstead |
| 13 | vaccine - mhra - broadyorkshirelaw - constables - victims | 42 | 13_vaccine_mhra_broadyorkshirelaw_constables |
| 14 | trudeau - convoys - dictatorship - protesters - romania | 39 | 14_trudeau_convoys_dictatorship_protesters |
| 15 | hamas - zionists - matisyahu - israelis - haifa | 35 | 15_hamas_zionists_matisyahu_israelis |
| 16 | bolsonaro - brasilia - moraes - protesters - inviolable | 34 | 16_bolsonaro_brasilia_moraes_protesters |
| 17 | radiation - fluorescent - antenna - 5g - milliwatts | 33 | 17_radiation_fluorescent_antenna_5g |
| 18 | parliament - petition - deaths - saveoursovereignty - wales | 31 | 18_parliament_petition_deaths_saveoursovereignty |
| 19 | gmos - additives - edible - worms - nuts | 29 | 19_gmos_additives_edible_worms |
| 20 | immigrants - texas - billion - invading - border | 27 | 20_immigrants_texas_billion_invading |
| 21 | sarscov - peramivir - parainfluenza - adenovirus - swab | 21 | 21_sarscov_peramivir_parainfluenza_adenovirus |
| 22 | chemtrails - contrails - spraying - fog - pilot | 21 | 22_chemtrails_contrails_spraying_fog |
| 23 | britishlionsforfreedom - worldwidedemonstration - cardiff - 2023 - july | 20 | 23_britishlionsforfreedom_worldwidedemonstration_cardiff_2023 |
| 24 | electricity - billing - petrol - meters - fraudsters | 20 | 24_electricity_billing_petrol_meters |
</details>
## Training hyperparameters
* calculate_probabilities: True
* language: None
* low_memory: False
* min_topic_size: 10
* n_gram_range: (1, 1)
* nr_topics: None
* seed_topic_list: None
* top_n_words: 10
* verbose: False
* zeroshot_min_similarity: 0.7
* zeroshot_topic_list: None
## Framework versions
* Numpy: 1.26.4
* HDBSCAN: 0.8.40
* UMAP: 0.5.7
* Pandas: 2.2.3
* Scikit-Learn: 1.5.2
* Sentence-transformers: 3.3.1
* Transformers: 4.46.3
* Numba: 0.60.0
* Plotly: 5.24.1
* Python: 3.10.12