Text Generation
Adapters
Polish
English
Not-For-All-Audiences
hary0101 commited on
Commit
de9836b
·
verified ·
1 Parent(s): 8742b29

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +261 -0
README.md ADDED
@@ -0,0 +1,261 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-4.0
3
+ datasets:
4
+ - Heralax/philosophy-instruct
5
+ - burgerbee/philosophy_textbook
6
+ language:
7
+ - pl
8
+ - en
9
+ metrics:
10
+ - rouge
11
+ - bleu
12
+ - perplexity
13
+ base_model:
14
+ - lodrick-the-lafted/Synthetic-Minstrel-14B
15
+ - Heralax/philosophy-mistral
16
+ - rvchi-schwenn/beit-base-patch16-224-pt22k-ft22k-finetuned-conspiracy_imagery_2
17
+ - huggingtweets/conspiracyb0t-occultb0t
18
+ pipeline_tag: text-generation
19
+ library_name: adapter-transformers
20
+ tags:
21
+ - not-for-all-audiences
22
+ ---
23
+
24
+ # Model Card for Model Misza
25
+ AI Companion Misza
26
+ ## Model Details
27
+ dataset is a curated collection of philosophical discussions, conspiracy theories, alternative history narratives, and metaphysical explorations. Designed to serve as a foundation for AI models that analyze unconventional perspectives, this dataset blends deep analytical thinking with speculative reasoning. It supports text generation, text classification, and multi-language text-based interactions in English and Polish.
28
+
29
+ ### Model Description
30
+
31
+ This dataset is designed for applications in philosophy, conspiracy theories, and alternative viewpoints. It includes structured dialogues, Q&A formats, long-form narratives, and analytical breakdowns of controversial or unconventional ideas.
32
+
33
+ Topics Include:
34
+
35
+ Philosophy: Existentialism, metaphysics, epistemology, ethics.
36
+
37
+ Conspiracy Theories: Secret societies, hidden histories, government cover-ups, Antarctica/Ice Wall, UFOs, deep-state agendas.
38
+
39
+ Alternative History: Reinterpretations of historical events, suppressed discoveries, lost civilizations.
40
+
41
+ Metaphysics and Esoteric Knowledge: Law of attraction, vibrational energy, water memory, sacred geometry.
42
+
43
+ Electromagnetic Consciousness: Theories on thought frequencies, external amplification of emotions, and mind influence.
44
+
45
+
46
+
47
+ - **Developed by:** hary0101
48
+ - **Funded by [optional]:** [More Information Needed]
49
+ - **Shared by [optional]:** [More Information Needed]
50
+ - **Model type:**
51
+ - **Language(s) (NLP):** [More Information Needed]
52
+ - **License:** cc-by-4.0
53
+ - **Finetuned from model [optional]:** [More Information Needed]
54
+
55
+ ### Model Sources [optional]
56
+
57
+ <!-- Provide the basic links for the model. -->
58
+
59
+ - **Repository:** https://huggingface.co/datasets/conspiracy
60
+ - **Paper [optional]:** https://archive.org/stream/DinahSheltonEncyclopediaOfGenocideAndCrimesAgainstHumanityVolumeONE/Dinah_Shelton_Encyclopedia_of_Genocide_and_Crimes_against_Humanity_Volume_ONE_djvu.txt,
61
+ -
62
+ - **Demo [optional]:** [More Information Needed]
63
+
64
+ ### Use
65
+ Training AI assistants with philosophical and alternative viewpoints.
66
+
67
+ Enhancing LLM-based analysis of non-mainstream narratives.
68
+
69
+ Assisting research into esoteric and suppressed knowledge.
70
+
71
+ Creating synthetic dialogues about complex or hidden topics.ended to be used, including the foreseeable users of the model and those affected by the model. -->
72
+
73
+ ### Direct Use
74
+
75
+ <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
76
+
77
+ [More Information Needed]
78
+
79
+ ### Downstream Use [optional]
80
+
81
+ <!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
82
+
83
+ [More Information Needed]
84
+
85
+ ### Out-of-Scope
86
+ Scientific applications requiring strictly empirical verification.
87
+
88
+ Generating misleading or harmful misinformation.
89
+
90
+ Promoting extremism or baseless fearmongering.
91
+
92
+ [More Information Needed]
93
+
94
+ ## Bias, Risks, and Limitations
95
+
96
+ Selection Bias
97
+ The dataset is curated with a focus on alternative viewpoints, conspiracy theories, and esoteric knowledge, which may inherently introduce a selection bias. It prioritizes unconventional perspectives over mainstream academic or scientific consensus, leading to an emphasis on speculative and philosophical interpretations rather than empirical verification.
98
+
99
+ Confirmation Bias
100
+ Since the dataset contains discussions from sources that often challenge official narratives, it may reinforce specific worldviews rather than presenting balanced counterarguments. While efforts have been made to include multiple perspectives, certain topics may lean towards interpretations that validate pre-existing beliefs in conspiracy theories or alternative history.
101
+
102
+ Cultural and Linguistic Bias
103
+ The dataset primarily features English and Polish content, which may reflect Western and Slavic perspectives more prominently than those from other cultures.
104
+ Alternative theories often emerge from specific cultural, historical, or geopolitical contexts, which can influence how events and ideas are framed.
105
+ Epistemic Bias
106
+ Many of the ideas in the dataset rely on subjective interpretation, intuition, and anecdotal evidence rather than formal empirical studies.
107
+ The nature of speculative knowledge means that logical rigor and evidentiary standards can vary across different entries.
108
+ Mitigation Strategies
109
+ Users should be encouraged to cross-reference the dataset’s claims with mainstream sources and critical analyses.
110
+ AI models trained on this dataset should be fine-tuned with diverse datasets to prevent overfitting to speculative narratives.
111
+ Implementing bias-detection mechanisms can help identify when a response leans too heavily into unverified or one-sided perspectives.
112
+
113
+ Biases
114
+ The dataset includes a mix of philosophical, speculative, and conspiratorial content. Some topics may reflect subjective viewpoints rather than objective truths.
115
+ Selection bias may exist due to the dataset’s focus on alternative perspectives rather than mainstream scientific consensus.
116
+ The dataset may favor perspectives that resonate with metaphysical or alternative history communities.
117
+ Risks
118
+ Users should be aware that certain conspiracy theories can be linked to misinformation or pseudoscience. This dataset is meant for analytical exploration rather than validation of these theories.
119
+ Misinterpretation of speculative content as factual information could contribute to the spread of misleading narratives.
120
+ Some discussions may include controversial topics that require careful handling to avoid reinforcing harmful beliefs.
121
+ Limitations
122
+ The dataset does not claim to provide verifiable historical facts but rather presents alternative interpretations.
123
+ It is not suitable for scientific research that demands strict empirical validation.
124
+ Some areas of discussion may lack mainstream academic sources, relying instead on community discussions, esoteric texts, or theoretical arguments.
125
+
126
+
127
+ [More Information Needed]
128
+
129
+ ### Recommendations
130
+
131
+ Users should critically evaluate responses generated from this dataset and cross-check with verified sources when needed.
132
+ The dataset is best used for AI research, philosophical debate, and creative writing rather than as a sole source of factual information.
133
+ Implementing disclaimers in AI applications using this dataset is advised to clarify its speculative nature
134
+ Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
135
+
136
+ ## How to Get Started with the Model
137
+
138
+ Use the code below to get started with the model.
139
+
140
+ [More Information Needed]
141
+
142
+ ## Training Details
143
+
144
+ ### Training Data
145
+
146
+ <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
147
+
148
+ [More Information Needed]
149
+
150
+ ### Training Procedure
151
+
152
+ <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
153
+
154
+ #### Preprocessing [optional]
155
+
156
+ [More Information Needed]
157
+
158
+
159
+ #### Training Hyperparameters
160
+
161
+ - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
162
+
163
+ #### Speeds, Sizes, Times [optional]
164
+
165
+ <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
166
+
167
+ [More Information Needed]
168
+
169
+ ## Evaluation
170
+
171
+ <!-- This section describes the evaluation protocols and provides the results. -->
172
+
173
+ ### Testing Data, Factors & Metrics
174
+
175
+ #### Testing Data
176
+
177
+ <!-- This should link to a Dataset Card if possible. -->
178
+
179
+ [More Information Needed]
180
+
181
+ #### Factors
182
+
183
+ <!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
184
+
185
+ [More Information Needed]
186
+
187
+ #### Metrics
188
+
189
+ <!-- These are the evaluation metrics being used, ideally with a description of why. -->
190
+
191
+ [More Information Needed]
192
+
193
+ ### Results
194
+
195
+ [More Information Needed]
196
+
197
+ #### Summary
198
+
199
+
200
+
201
+ ## Model Examination [optional]
202
+
203
+ <!-- Relevant interpretability work for the model goes here -->
204
+
205
+ [More Information Needed]
206
+
207
+
208
+
209
+ - **Hardware Type:** [More Information Needed]
210
+ - **Hours used:** [More Information Needed]
211
+ - **Cloud Provider:** [More Information Needed]
212
+ - **Compute Region:** [More Information Needed]
213
+ - **Carbon Emitted:** [More Information Needed]
214
+
215
+ ## Technical Specifications [optional]
216
+
217
+ ### Model Architecture and Objective
218
+
219
+ [More Information Needed]
220
+
221
+ ### Compute Infrastructure
222
+
223
+ [More Information Needed]
224
+
225
+ #### Hardware
226
+
227
+ [More Information Needed]
228
+
229
+ #### Software
230
+
231
+ [More Information Needed]
232
+
233
+ ## Citation [optional]
234
+
235
+ <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
236
+
237
+ **BibTeX:**
238
+
239
+ [More Information Needed]
240
+
241
+ **APA:**
242
+
243
+ [More Information Needed]
244
+
245
+ ## Glossary [optional]
246
+
247
+ <!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
248
+
249
+ [More Information Needed]
250
+
251
+ ## More Information [optional]
252
+
253
+ [More Information Needed]
254
+
255
+ ## Model Card Authors [optional]
256
+
257
+ [More Information Needed]
258
+
259
+ ## Model Card Contact
260
+
261
+ [More Information Needed]