mohit19906 commited on
Commit
72bc614
1 Parent(s): 1f520bc

mohit19906/falcon-7b-instruct-SBCQNAUserAssist

Browse files
README.md CHANGED
@@ -15,6 +15,8 @@ should probably proofread and complete it, then remove this comment. -->
15
  # working
16
 
17
  This model is a fine-tuned version of [tiiuae/falcon-7b-instruct](https://huggingface.co/tiiuae/falcon-7b-instruct) on an unknown dataset.
 
 
18
 
19
  ## Model description
20
 
@@ -45,6 +47,60 @@ The following hyperparameters were used during training:
45
  - num_epochs: 50
46
  - mixed_precision_training: Native AMP
47
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
48
  ### Framework versions
49
 
50
  - PEFT 0.10.0
 
15
  # working
16
 
17
  This model is a fine-tuned version of [tiiuae/falcon-7b-instruct](https://huggingface.co/tiiuae/falcon-7b-instruct) on an unknown dataset.
18
+ It achieves the following results on the evaluation set:
19
+ - Loss: 0.3190
20
 
21
  ## Model description
22
 
 
47
  - num_epochs: 50
48
  - mixed_precision_training: Native AMP
49
 
50
+ ### Training results
51
+
52
+ | Training Loss | Epoch | Step | Validation Loss |
53
+ |:-------------:|:-----:|:----:|:---------------:|
54
+ | 2.7546 | 0.95 | 5 | 2.4304 |
55
+ | 2.4793 | 1.9 | 10 | 2.1344 |
56
+ | 2.0646 | 2.86 | 15 | 1.7881 |
57
+ | 1.4266 | 4.0 | 21 | 1.4999 |
58
+ | 1.4904 | 4.95 | 26 | 1.3272 |
59
+ | 1.2694 | 5.9 | 31 | 1.1806 |
60
+ | 1.0775 | 6.86 | 36 | 1.0679 |
61
+ | 0.7663 | 8.0 | 42 | 0.9399 |
62
+ | 0.7642 | 8.95 | 47 | 0.8395 |
63
+ | 0.6273 | 9.9 | 52 | 0.7471 |
64
+ | 0.5254 | 10.86 | 57 | 0.6759 |
65
+ | 0.3501 | 12.0 | 63 | 0.5922 |
66
+ | 0.3274 | 12.95 | 68 | 0.5323 |
67
+ | 0.2703 | 13.9 | 73 | 0.4832 |
68
+ | 0.2233 | 14.86 | 78 | 0.4473 |
69
+ | 0.1652 | 16.0 | 84 | 0.4036 |
70
+ | 0.1667 | 16.95 | 89 | 0.3839 |
71
+ | 0.1492 | 17.9 | 94 | 0.3695 |
72
+ | 0.15 | 18.86 | 99 | 0.3556 |
73
+ | 0.108 | 20.0 | 105 | 0.3486 |
74
+ | 0.1237 | 20.95 | 110 | 0.3377 |
75
+ | 0.1217 | 21.9 | 115 | 0.3277 |
76
+ | 0.11 | 22.86 | 120 | 0.3206 |
77
+ | 0.09 | 24.0 | 126 | 0.3118 |
78
+ | 0.1051 | 24.95 | 131 | 0.3165 |
79
+ | 0.098 | 25.9 | 136 | 0.3173 |
80
+ | 0.0992 | 26.86 | 141 | 0.3151 |
81
+ | 0.0804 | 28.0 | 147 | 0.3185 |
82
+ | 0.0991 | 28.95 | 152 | 0.3164 |
83
+ | 0.093 | 29.9 | 157 | 0.3119 |
84
+ | 0.0943 | 30.86 | 162 | 0.3173 |
85
+ | 0.0771 | 32.0 | 168 | 0.3150 |
86
+ | 0.0887 | 32.95 | 173 | 0.3162 |
87
+ | 0.0967 | 33.9 | 178 | 0.3193 |
88
+ | 0.089 | 34.86 | 183 | 0.3131 |
89
+ | 0.0793 | 36.0 | 189 | 0.3171 |
90
+ | 0.0882 | 36.95 | 194 | 0.3203 |
91
+ | 0.0893 | 37.9 | 199 | 0.3186 |
92
+ | 0.0879 | 38.86 | 204 | 0.3165 |
93
+ | 0.073 | 40.0 | 210 | 0.3211 |
94
+ | 0.0877 | 40.95 | 215 | 0.3202 |
95
+ | 0.0893 | 41.9 | 220 | 0.3202 |
96
+ | 0.086 | 42.86 | 225 | 0.3178 |
97
+ | 0.0735 | 44.0 | 231 | 0.3175 |
98
+ | 0.0868 | 44.95 | 236 | 0.3183 |
99
+ | 0.0855 | 45.9 | 241 | 0.3187 |
100
+ | 0.0834 | 46.86 | 246 | 0.3190 |
101
+ | 0.066 | 47.62 | 250 | 0.3190 |
102
+
103
+
104
  ### Framework versions
105
 
106
  - PEFT 0.10.0
adapter_config.json CHANGED
@@ -16,7 +16,7 @@
16
  "megatron_core": "megatron.core",
17
  "modules_to_save": null,
18
  "peft_type": "LORA",
19
- "r": 8,
20
  "rank_pattern": {},
21
  "revision": null,
22
  "target_modules": [
 
16
  "megatron_core": "megatron.core",
17
  "modules_to_save": null,
18
  "peft_type": "LORA",
19
+ "r": 16,
20
  "rank_pattern": {},
21
  "revision": null,
22
  "target_modules": [
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0ebc220dc768691467a003cff7c489dc4e8b73ed31554a6658f1a5cf113a736d
3
- size 9446600
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8d5ef5ca4a2131e813a9eca20e83ec0180716fa78070a899adf2cb779146ccd0
3
+ size 18883912
runs/Apr05_14-32-29_351216fd69aa/events.out.tfevents.1712327550.351216fd69aa.34.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4e402915b204efe42da96737fed9689ee8a3536c1086f00abafdf65be4d52bb3
3
+ size 29019
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b553d72328623f3ad527750dd3433a9398a1599ccc7fd8dd9d77b592836ea891
3
  size 4920
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:368f90e937953ca79ceaff9a2677f728e65e9fbf5ef9d35445cf45bb2fae91b4
3
  size 4920
wandb/debug-internal.log CHANGED
The diff for this file is too large to render. See raw diff
 
wandb/debug.log CHANGED
@@ -1,37 +1,37 @@
1
- 2024-04-05 13:20:53,205 INFO MainThread:34 [wandb_setup.py:_flush():76] Current SDK version is 0.16.4
2
- 2024-04-05 13:20:53,205 INFO MainThread:34 [wandb_setup.py:_flush():76] Configure stats pid to 34
3
- 2024-04-05 13:20:53,205 INFO MainThread:34 [wandb_setup.py:_flush():76] Loading settings from /root/.config/wandb/settings
4
- 2024-04-05 13:20:53,205 INFO MainThread:34 [wandb_setup.py:_flush():76] Loading settings from /kaggle/working/wandb/settings
5
- 2024-04-05 13:20:53,205 INFO MainThread:34 [wandb_setup.py:_flush():76] Loading settings from environment variables: {}
6
- 2024-04-05 13:20:53,205 INFO MainThread:34 [wandb_setup.py:_flush():76] Applying setup settings: {'_disable_service': False}
7
- 2024-04-05 13:20:53,205 INFO MainThread:34 [wandb_setup.py:_flush():76] Inferring run settings from compute environment: {'program': '<python with no main file>'}
8
- 2024-04-05 13:20:53,205 INFO MainThread:34 [wandb_setup.py:_flush():76] Applying login settings: {'api_key': '***REDACTED***'}
9
- 2024-04-05 13:20:53,205 INFO MainThread:34 [wandb_init.py:_log_setup():526] Logging user logs to /kaggle/working/wandb/run-20240405_132053-f56jlksk/logs/debug.log
10
- 2024-04-05 13:20:53,205 INFO MainThread:34 [wandb_init.py:_log_setup():527] Logging internal logs to /kaggle/working/wandb/run-20240405_132053-f56jlksk/logs/debug-internal.log
11
- 2024-04-05 13:20:53,205 INFO MainThread:34 [wandb_init.py:_jupyter_setup():472] configuring jupyter hooks <wandb.sdk.wandb_init._WandbInit object at 0x79e74ca05d80>
12
- 2024-04-05 13:20:53,206 INFO MainThread:34 [wandb_init.py:init():566] calling init triggers
13
- 2024-04-05 13:20:53,206 INFO MainThread:34 [wandb_init.py:init():573] wandb.init called with sweep_config: {}
14
  config: {}
15
- 2024-04-05 13:20:53,206 INFO MainThread:34 [wandb_init.py:init():616] starting backend
16
- 2024-04-05 13:20:53,206 INFO MainThread:34 [wandb_init.py:init():620] setting up manager
17
- 2024-04-05 13:20:53,208 INFO MainThread:34 [backend.py:_multiprocessing_setup():105] multiprocessing start_methods=fork,spawn,forkserver, using: spawn
18
- 2024-04-05 13:20:53,210 INFO MainThread:34 [wandb_init.py:init():628] backend started and connected
19
- 2024-04-05 13:20:53,222 INFO MainThread:34 [wandb_run.py:_label_probe_notebook():1295] probe notebook
20
- 2024-04-05 13:20:53,780 INFO MainThread:34 [wandb_init.py:init():720] updated telemetry
21
- 2024-04-05 13:20:53,784 INFO MainThread:34 [wandb_init.py:init():753] communicating run to backend with 90.0 second timeout
22
- 2024-04-05 13:20:54,060 INFO MainThread:34 [wandb_run.py:_on_init():2262] communicating current version
23
- 2024-04-05 13:20:54,123 INFO MainThread:34 [wandb_run.py:_on_init():2271] got version response upgrade_message: "wandb version 0.16.6 is available! To upgrade, please run:\n $ pip install wandb --upgrade"
24
 
25
- 2024-04-05 13:20:54,124 INFO MainThread:34 [wandb_init.py:init():804] starting run threads in backend
26
- 2024-04-05 13:21:25,152 INFO MainThread:34 [wandb_run.py:_console_start():2241] atexit reg
27
- 2024-04-05 13:21:25,152 INFO MainThread:34 [wandb_run.py:_redirect():2096] redirect: wrap_raw
28
- 2024-04-05 13:21:25,153 INFO MainThread:34 [wandb_run.py:_redirect():2161] Wrapping output streams.
29
- 2024-04-05 13:21:25,153 INFO MainThread:34 [wandb_run.py:_redirect():2186] Redirects installed.
30
- 2024-04-05 13:21:25,154 INFO MainThread:34 [wandb_init.py:init():847] run started, returning control to user process
31
- 2024-04-05 13:21:25,160 INFO MainThread:34 [wandb_run.py:_config_callback():1343] config_cb None None {'vocab_size': 65024, 'hidden_size': 4544, 'num_hidden_layers': 32, 'num_attention_heads': 71, 'layer_norm_epsilon': 1e-05, 'initializer_range': 0.02, 'use_cache': False, 'hidden_dropout': 0.0, 'attention_dropout': 0.0, 'bos_token_id': 11, 'eos_token_id': 11, 'num_kv_heads': 71, 'alibi': False, 'new_decoder_architecture': False, 'multi_query': True, 'parallel_attn': True, 'bias': False, 'max_position_embeddings': 2048, 'rope_theta': 10000.0, 'rope_scaling': None, 'return_dict': True, 'output_hidden_states': False, 'output_attentions': False, 'torchscript': False, 'torch_dtype': 'bfloat16', 'use_bfloat16': False, 'tf_legacy_loss': False, 'pruned_heads': {}, 'tie_word_embeddings': True, 'chunk_size_feed_forward': 0, 'is_encoder_decoder': False, 'is_decoder': False, 'cross_attention_hidden_size': None, 'add_cross_attention': False, 'tie_encoder_decoder': False, 'max_length': 20, 'min_length': 0, 'do_sample': False, 'early_stopping': False, 'num_beams': 1, 'num_beam_groups': 1, 'diversity_penalty': 0.0, 'temperature': 1.0, 'top_k': 50, 'top_p': 1.0, 'typical_p': 1.0, 'repetition_penalty': 1.0, 'length_penalty': 1.0, 'no_repeat_ngram_size': 0, 'encoder_no_repeat_ngram_size': 0, 'bad_words_ids': None, 'num_return_sequences': 1, 'output_scores': False, 'return_dict_in_generate': False, 'forced_bos_token_id': None, 'forced_eos_token_id': None, 'remove_invalid_values': False, 'exponential_decay_length_penalty': None, 'suppress_tokens': None, 'begin_suppress_tokens': None, 'architectures': ['FalconForCausalLM'], 'finetuning_task': None, 'id2label': {0: 'LABEL_0', 1: 'LABEL_1'}, 'label2id': {'LABEL_0': 0, 'LABEL_1': 1}, 'tokenizer_class': None, 'prefix': None, 'pad_token_id': None, 'sep_token_id': None, 'decoder_start_token_id': None, 'task_specific_params': None, 'problem_type': None, '_name_or_path': 'tiiuae/falcon-7b-instruct', 'transformers_version': '4.38.2', 'apply_residual_connection_post_layernorm': False, 'auto_map': {'AutoConfig': 'tiiuae/falcon-7b-instruct--configuration_falcon.FalconConfig', 'AutoModel': 'tiiuae/falcon-7b-instruct--modeling_falcon.FalconModel', 'AutoModelForSequenceClassification': 'tiiuae/falcon-7b-instruct--modeling_falcon.FalconForSequenceClassification', 'AutoModelForTokenClassification': 'tiiuae/falcon-7b-instruct--modeling_falcon.FalconForTokenClassification', 'AutoModelForQuestionAnswering': 'tiiuae/falcon-7b-instruct--modeling_falcon.FalconForQuestionAnswering', 'AutoModelForCausalLM': 'tiiuae/falcon-7b-instruct--modeling_falcon.FalconForCausalLM'}, 'model_type': 'falcon', 'output_dir': '/kaggle/working/', 'overwrite_output_dir': False, 'do_train': False, 'do_eval': True, 'do_predict': False, 'evaluation_strategy': 'epoch', 'prediction_loss_only': False, 'per_device_train_batch_size': 6, 'per_device_eval_batch_size': 6, 'per_gpu_train_batch_size': None, 'per_gpu_eval_batch_size': None, 'gradient_accumulation_steps': 4, 'eval_accumulation_steps': None, 'eval_delay': 0, 'learning_rate': 0.0002, 'weight_decay': 0.01, 'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'max_grad_norm': 1.0, 'num_train_epochs': 50, 'max_steps': -1, 'lr_scheduler_type': 'linear', 'lr_scheduler_kwargs': {}, 'warmup_ratio': 0.0, 'warmup_steps': 2, 'log_level': 'passive', 'log_level_replica': 'warning', 'log_on_each_node': True, 'logging_dir': '/kaggle/working/runs/Apr05_13-20-32_77aca515e0d8', 'logging_strategy': 'epoch', 'logging_first_step': False, 'logging_steps': 500, 'logging_nan_inf_filter': True, 'save_strategy': 'epoch', 'save_steps': 500, 'save_total_limit': None, 'save_safetensors': True, 'save_on_each_node': False, 'save_only_model': False, 'no_cuda': False, 'use_cpu': False, 'use_mps_device': False, 'seed': 42, 'data_seed': None, 'jit_mode_eval': False, 'use_ipex': False, 'bf16': False, 'fp16': True, 'fp16_opt_level': 'O1', 'half_precision_backend': 'auto', 'bf16_full_eval': False, 'fp16_full_eval': False, 'tf32': None, 'local_rank': 0, 'ddp_backend': None, 'tpu_num_cores': None, 'tpu_metrics_debug': False, 'debug': [], 'dataloader_drop_last': False, 'eval_steps': None, 'dataloader_num_workers': 0, 'dataloader_prefetch_factor': None, 'past_index': -1, 'run_name': '/kaggle/working/', 'disable_tqdm': False, 'remove_unused_columns': True, 'label_names': None, 'load_best_model_at_end': True, 'metric_for_best_model': 'loss', 'greater_is_better': False, 'ignore_data_skip': False, 'fsdp': [], 'fsdp_min_num_params': 0, 'fsdp_config': {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}, 'fsdp_transformer_layer_cls_to_wrap': None, 'accelerator_config': {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True}, 'deepspeed': None, 'label_smoothing_factor': 0.0, 'optim': 'paged_adamw_8bit', 'optim_args': None, 'adafactor': False, 'group_by_length': False, 'length_column_name': 'length', 'report_to': ['tensorboard', 'wandb'], 'ddp_find_unused_parameters': None, 'ddp_bucket_cap_mb': None, 'ddp_broadcast_buffers': None, 'dataloader_pin_memory': True, 'dataloader_persistent_workers': False, 'skip_memory_metrics': True, 'use_legacy_prediction_loop': False, 'push_to_hub': False, 'resume_from_checkpoint': None, 'hub_model_id': None, 'hub_strategy': 'every_save', 'hub_token': '<HUB_TOKEN>', 'hub_private_repo': False, 'hub_always_push': False, 'gradient_checkpointing': False, 'gradient_checkpointing_kwargs': None, 'include_inputs_for_metrics': False, 'fp16_backend': 'auto', 'push_to_hub_model_id': None, 'push_to_hub_organization': None, 'push_to_hub_token': '<PUSH_TO_HUB_TOKEN>', 'mp_parameters': '', 'auto_find_batch_size': False, 'full_determinism': False, 'torchdynamo': None, 'ray_scope': 'last', 'ddp_timeout': 1800, 'torch_compile': False, 'torch_compile_backend': None, 'torch_compile_mode': None, 'dispatch_batches': None, 'split_batches': None, 'include_tokens_per_second': False, 'include_num_input_tokens_seen': False, 'neftune_noise_alpha': None}
32
- 2024-04-05 13:21:29,087 INFO MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
33
- 2024-04-05 13:21:29,087 INFO MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
34
- 2024-04-05 13:22:28,071 INFO MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend
35
- 2024-04-05 13:22:28,072 INFO MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
36
- 2024-04-05 13:22:28,072 INFO MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
37
- 2024-04-05 13:22:31,094 INFO MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend
 
1
+ 2024-04-05 14:32:45,563 INFO MainThread:34 [wandb_setup.py:_flush():76] Current SDK version is 0.16.4
2
+ 2024-04-05 14:32:45,563 INFO MainThread:34 [wandb_setup.py:_flush():76] Configure stats pid to 34
3
+ 2024-04-05 14:32:45,563 INFO MainThread:34 [wandb_setup.py:_flush():76] Loading settings from /root/.config/wandb/settings
4
+ 2024-04-05 14:32:45,563 INFO MainThread:34 [wandb_setup.py:_flush():76] Loading settings from /kaggle/working/wandb/settings
5
+ 2024-04-05 14:32:45,563 INFO MainThread:34 [wandb_setup.py:_flush():76] Loading settings from environment variables: {}
6
+ 2024-04-05 14:32:45,563 INFO MainThread:34 [wandb_setup.py:_flush():76] Applying setup settings: {'_disable_service': False}
7
+ 2024-04-05 14:32:45,563 INFO MainThread:34 [wandb_setup.py:_flush():76] Inferring run settings from compute environment: {'program': '<python with no main file>'}
8
+ 2024-04-05 14:32:45,563 INFO MainThread:34 [wandb_setup.py:_flush():76] Applying login settings: {'api_key': '***REDACTED***'}
9
+ 2024-04-05 14:32:45,563 INFO MainThread:34 [wandb_init.py:_log_setup():526] Logging user logs to /kaggle/working/wandb/run-20240405_143245-z6ibr5j0/logs/debug.log
10
+ 2024-04-05 14:32:45,563 INFO MainThread:34 [wandb_init.py:_log_setup():527] Logging internal logs to /kaggle/working/wandb/run-20240405_143245-z6ibr5j0/logs/debug-internal.log
11
+ 2024-04-05 14:32:45,564 INFO MainThread:34 [wandb_init.py:_jupyter_setup():472] configuring jupyter hooks <wandb.sdk.wandb_init._WandbInit object at 0x7ac638f76e30>
12
+ 2024-04-05 14:32:45,564 INFO MainThread:34 [wandb_init.py:init():566] calling init triggers
13
+ 2024-04-05 14:32:45,564 INFO MainThread:34 [wandb_init.py:init():573] wandb.init called with sweep_config: {}
14
  config: {}
15
+ 2024-04-05 14:32:45,564 INFO MainThread:34 [wandb_init.py:init():616] starting backend
16
+ 2024-04-05 14:32:45,564 INFO MainThread:34 [wandb_init.py:init():620] setting up manager
17
+ 2024-04-05 14:32:45,566 INFO MainThread:34 [backend.py:_multiprocessing_setup():105] multiprocessing start_methods=fork,spawn,forkserver, using: spawn
18
+ 2024-04-05 14:32:45,568 INFO MainThread:34 [wandb_init.py:init():628] backend started and connected
19
+ 2024-04-05 14:32:45,580 INFO MainThread:34 [wandb_run.py:_label_probe_notebook():1295] probe notebook
20
+ 2024-04-05 14:32:46,063 INFO MainThread:34 [wandb_init.py:init():720] updated telemetry
21
+ 2024-04-05 14:32:46,067 INFO MainThread:34 [wandb_init.py:init():753] communicating run to backend with 90.0 second timeout
22
+ 2024-04-05 14:32:46,313 INFO MainThread:34 [wandb_run.py:_on_init():2262] communicating current version
23
+ 2024-04-05 14:32:46,379 INFO MainThread:34 [wandb_run.py:_on_init():2271] got version response upgrade_message: "wandb version 0.16.6 is available! To upgrade, please run:\n $ pip install wandb --upgrade"
24
 
25
+ 2024-04-05 14:32:46,379 INFO MainThread:34 [wandb_init.py:init():804] starting run threads in backend
26
+ 2024-04-05 14:33:17,409 INFO MainThread:34 [wandb_run.py:_console_start():2241] atexit reg
27
+ 2024-04-05 14:33:17,409 INFO MainThread:34 [wandb_run.py:_redirect():2096] redirect: wrap_raw
28
+ 2024-04-05 14:33:17,410 INFO MainThread:34 [wandb_run.py:_redirect():2161] Wrapping output streams.
29
+ 2024-04-05 14:33:17,410 INFO MainThread:34 [wandb_run.py:_redirect():2186] Redirects installed.
30
+ 2024-04-05 14:33:17,411 INFO MainThread:34 [wandb_init.py:init():847] run started, returning control to user process
31
+ 2024-04-05 14:33:17,416 INFO MainThread:34 [wandb_run.py:_config_callback():1343] config_cb None None {'vocab_size': 65024, 'hidden_size': 4544, 'num_hidden_layers': 32, 'num_attention_heads': 71, 'layer_norm_epsilon': 1e-05, 'initializer_range': 0.02, 'use_cache': False, 'hidden_dropout': 0.0, 'attention_dropout': 0.0, 'bos_token_id': 11, 'eos_token_id': 11, 'num_kv_heads': 71, 'alibi': False, 'new_decoder_architecture': False, 'multi_query': True, 'parallel_attn': True, 'bias': False, 'max_position_embeddings': 2048, 'rope_theta': 10000.0, 'rope_scaling': None, 'return_dict': True, 'output_hidden_states': False, 'output_attentions': False, 'torchscript': False, 'torch_dtype': 'bfloat16', 'use_bfloat16': False, 'tf_legacy_loss': False, 'pruned_heads': {}, 'tie_word_embeddings': True, 'chunk_size_feed_forward': 0, 'is_encoder_decoder': False, 'is_decoder': False, 'cross_attention_hidden_size': None, 'add_cross_attention': False, 'tie_encoder_decoder': False, 'max_length': 20, 'min_length': 0, 'do_sample': False, 'early_stopping': False, 'num_beams': 1, 'num_beam_groups': 1, 'diversity_penalty': 0.0, 'temperature': 1.0, 'top_k': 50, 'top_p': 1.0, 'typical_p': 1.0, 'repetition_penalty': 1.0, 'length_penalty': 1.0, 'no_repeat_ngram_size': 0, 'encoder_no_repeat_ngram_size': 0, 'bad_words_ids': None, 'num_return_sequences': 1, 'output_scores': False, 'return_dict_in_generate': False, 'forced_bos_token_id': None, 'forced_eos_token_id': None, 'remove_invalid_values': False, 'exponential_decay_length_penalty': None, 'suppress_tokens': None, 'begin_suppress_tokens': None, 'architectures': ['FalconForCausalLM'], 'finetuning_task': None, 'id2label': {0: 'LABEL_0', 1: 'LABEL_1'}, 'label2id': {'LABEL_0': 0, 'LABEL_1': 1}, 'tokenizer_class': None, 'prefix': None, 'pad_token_id': None, 'sep_token_id': None, 'decoder_start_token_id': None, 'task_specific_params': None, 'problem_type': None, '_name_or_path': 'tiiuae/falcon-7b-instruct', 'transformers_version': '4.38.2', 'apply_residual_connection_post_layernorm': False, 'auto_map': {'AutoConfig': 'tiiuae/falcon-7b-instruct--configuration_falcon.FalconConfig', 'AutoModel': 'tiiuae/falcon-7b-instruct--modeling_falcon.FalconModel', 'AutoModelForSequenceClassification': 'tiiuae/falcon-7b-instruct--modeling_falcon.FalconForSequenceClassification', 'AutoModelForTokenClassification': 'tiiuae/falcon-7b-instruct--modeling_falcon.FalconForTokenClassification', 'AutoModelForQuestionAnswering': 'tiiuae/falcon-7b-instruct--modeling_falcon.FalconForQuestionAnswering', 'AutoModelForCausalLM': 'tiiuae/falcon-7b-instruct--modeling_falcon.FalconForCausalLM'}, 'model_type': 'falcon', 'quantization_config': {'quant_method': 'QuantizationMethod.BITS_AND_BYTES', '_load_in_8bit': False, '_load_in_4bit': True, 'llm_int8_threshold': 6.0, 'llm_int8_skip_modules': None, 'llm_int8_enable_fp32_cpu_offload': False, 'llm_int8_has_fp16_weight': False, 'bnb_4bit_quant_type': 'nf4', 'bnb_4bit_use_double_quant': True, 'bnb_4bit_compute_dtype': 'bfloat16', 'load_in_4bit': True, 'load_in_8bit': False}, 'output_dir': '/kaggle/working/', 'overwrite_output_dir': False, 'do_train': False, 'do_eval': True, 'do_predict': False, 'evaluation_strategy': 'epoch', 'prediction_loss_only': False, 'per_device_train_batch_size': 6, 'per_device_eval_batch_size': 6, 'per_gpu_train_batch_size': None, 'per_gpu_eval_batch_size': None, 'gradient_accumulation_steps': 4, 'eval_accumulation_steps': None, 'eval_delay': 0, 'learning_rate': 0.0002, 'weight_decay': 0.01, 'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'max_grad_norm': 1.0, 'num_train_epochs': 50, 'max_steps': -1, 'lr_scheduler_type': 'linear', 'lr_scheduler_kwargs': {}, 'warmup_ratio': 0.0, 'warmup_steps': 2, 'log_level': 'passive', 'log_level_replica': 'warning', 'log_on_each_node': True, 'logging_dir': '/kaggle/working/runs/Apr05_14-32-29_351216fd69aa', 'logging_strategy': 'epoch', 'logging_first_step': False, 'logging_steps': 500, 'logging_nan_inf_filter': True, 'save_strategy': 'epoch', 'save_steps': 500, 'save_total_limit': None, 'save_safetensors': True, 'save_on_each_node': False, 'save_only_model': False, 'no_cuda': False, 'use_cpu': False, 'use_mps_device': False, 'seed': 42, 'data_seed': None, 'jit_mode_eval': False, 'use_ipex': False, 'bf16': False, 'fp16': True, 'fp16_opt_level': 'O1', 'half_precision_backend': 'auto', 'bf16_full_eval': False, 'fp16_full_eval': False, 'tf32': None, 'local_rank': 0, 'ddp_backend': None, 'tpu_num_cores': None, 'tpu_metrics_debug': False, 'debug': [], 'dataloader_drop_last': False, 'eval_steps': None, 'dataloader_num_workers': 0, 'dataloader_prefetch_factor': None, 'past_index': -1, 'run_name': '/kaggle/working/', 'disable_tqdm': False, 'remove_unused_columns': True, 'label_names': None, 'load_best_model_at_end': True, 'metric_for_best_model': 'loss', 'greater_is_better': False, 'ignore_data_skip': False, 'fsdp': [], 'fsdp_min_num_params': 0, 'fsdp_config': {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}, 'fsdp_transformer_layer_cls_to_wrap': None, 'accelerator_config': {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True}, 'deepspeed': None, 'label_smoothing_factor': 0.0, 'optim': 'paged_adamw_8bit', 'optim_args': None, 'adafactor': False, 'group_by_length': False, 'length_column_name': 'length', 'report_to': ['tensorboard', 'wandb'], 'ddp_find_unused_parameters': None, 'ddp_bucket_cap_mb': None, 'ddp_broadcast_buffers': None, 'dataloader_pin_memory': True, 'dataloader_persistent_workers': False, 'skip_memory_metrics': True, 'use_legacy_prediction_loop': False, 'push_to_hub': False, 'resume_from_checkpoint': None, 'hub_model_id': None, 'hub_strategy': 'every_save', 'hub_token': '<HUB_TOKEN>', 'hub_private_repo': False, 'hub_always_push': False, 'gradient_checkpointing': False, 'gradient_checkpointing_kwargs': None, 'include_inputs_for_metrics': False, 'fp16_backend': 'auto', 'push_to_hub_model_id': None, 'push_to_hub_organization': None, 'push_to_hub_token': '<PUSH_TO_HUB_TOKEN>', 'mp_parameters': '', 'auto_find_batch_size': False, 'full_determinism': False, 'torchdynamo': None, 'ray_scope': 'last', 'ddp_timeout': 1800, 'torch_compile': False, 'torch_compile_backend': None, 'torch_compile_mode': None, 'dispatch_batches': None, 'split_batches': None, 'include_tokens_per_second': False, 'include_num_input_tokens_seen': False, 'neftune_noise_alpha': None}
32
+ 2024-04-05 15:23:50,947 INFO MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
33
+ 2024-04-05 15:23:50,947 INFO MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
34
+ 2024-04-05 15:23:50,955 INFO MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend
35
+ 2024-04-05 15:23:52,718 INFO MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
36
+ 2024-04-05 15:23:52,718 INFO MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
37
+ 2024-04-05 15:35:51,534 INFO MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend
wandb/run-20240405_143245-z6ibr5j0/files/conda-environment.yaml ADDED
@@ -0,0 +1,1071 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ name: base
2
+ channels:
3
+ - pytorch
4
+ - file:///tmp/conda
5
+ - rapidsai
6
+ - nvidia
7
+ - conda-forge
8
+ - defaults
9
+ dependencies:
10
+ - _libgcc_mutex=0.1=conda_forge
11
+ - _openmp_mutex=4.5=2_gnu
12
+ - aiohttp=3.9.1=py310h2372a71_0
13
+ - aiosignal=1.3.1=pyhd8ed1ab_0
14
+ - annotated-types=0.6.0=pyhd8ed1ab_0
15
+ - anyio=4.2.0=pyhd8ed1ab_0
16
+ - archspec=0.2.2=pyhd8ed1ab_0
17
+ - argon2-cffi=23.1.0=pyhd8ed1ab_0
18
+ - argon2-cffi-bindings=21.2.0=py310h2372a71_4
19
+ - arrow=1.3.0=pyhd8ed1ab_0
20
+ - arrow-cpp=11.0.0=ha770c72_9_cpu
21
+ - asttokens=2.4.1=pyhd8ed1ab_0
22
+ - async-timeout=4.0.3=pyhd8ed1ab_0
23
+ - atk-1.0=2.38.0=hd4edc92_1
24
+ - attrs=23.2.0=pyh71513ae_0
25
+ - aws-c-auth=0.6.26=h987a71b_2
26
+ - aws-c-cal=0.5.21=h48707d8_2
27
+ - aws-c-common=0.8.14=h0b41bf4_0
28
+ - aws-c-compression=0.2.16=h03acc5a_5
29
+ - aws-c-event-stream=0.2.20=h00877a2_4
30
+ - aws-c-http=0.7.6=hf342b9f_0
31
+ - aws-c-io=0.13.19=h5b20300_3
32
+ - aws-c-mqtt=0.8.6=hc4349f7_12
33
+ - aws-c-s3=0.2.7=h909e904_1
34
+ - aws-c-sdkutils=0.1.9=h03acc5a_0
35
+ - aws-checksums=0.1.14=h03acc5a_5
36
+ - aws-crt-cpp=0.19.8=hf7fbfca_12
37
+ - aws-sdk-cpp=1.10.57=h17c43bd_8
38
+ - beautifulsoup4=4.12.2=pyha770c72_0
39
+ - bleach=6.1.0=pyhd8ed1ab_0
40
+ - boltons=23.1.1=pyhd8ed1ab_0
41
+ - brotli=1.0.9=h166bdaf_9
42
+ - brotli-bin=1.0.9=h166bdaf_9
43
+ - brotli-python=1.0.9=py310hd8f1fbe_9
44
+ - brotlipy=0.7.0=py310h7f8727e_1002
45
+ - bzip2=1.0.8=h7b6447c_0
46
+ - c-ares=1.25.0=hd590300_0
47
+ - ca-certificates=2024.2.2=hbcca054_0
48
+ - cached-property=1.5.2=hd8ed1ab_1
49
+ - cached_property=1.5.2=pyha770c72_1
50
+ - cairo=1.18.0=h3faef2a_0
51
+ - cartopy=0.22.0=py310hcc13569_1
52
+ - catalogue=2.0.10=py310hff52083_0
53
+ - certifi=2024.2.2=pyhd8ed1ab_0
54
+ - cffi=1.16.0=py310h2fee648_0
55
+ - charset-normalizer=3.3.2=pyhd8ed1ab_0
56
+ - click=8.1.7=unix_pyh707e725_0
57
+ - cloudpathlib=0.16.0=pyhd8ed1ab_0
58
+ - colorama=0.4.6=pyhd8ed1ab_0
59
+ - comm=0.2.1=pyhd8ed1ab_0
60
+ - conda=23.7.4=py310hff52083_0
61
+ - conda-libmamba-solver=23.7.0=pyhd8ed1ab_0
62
+ - conda-package-handling=2.2.0=pyh38be061_0
63
+ - conda-package-streaming=0.9.0=pyhd8ed1ab_0
64
+ - confection=0.1.4=py310h17c5347_0
65
+ - contourpy=1.2.0=py310hd41b1e2_0
66
+ - cryptography=41.0.7=py310hb8475ec_1
67
+ - cuda-cccl=12.4.99=0
68
+ - cuda-cudart=12.4.99=0
69
+ - cuda-cudart-dev=12.4.99=0
70
+ - cuda-nvcc-dev_linux-64=12.1.105=ha770c72_0
71
+ - cuda-nvcc-impl=12.1.105=hd3aeb46_0
72
+ - cuda-nvcc-tools=12.1.105=hd3aeb46_0
73
+ - cuda-nvrtc=12.1.105=hd3aeb46_0
74
+ - cuda-profiler-api=12.4.99=0
75
+ - cuda-python=12.4.0=py310h52dc4f0_0
76
+ - cuda-version=12.1=h1d6eff3_3
77
+ - cudf=23.08.00=cuda12_py310_230809_g8150d38e08_0
78
+ - cuml=23.08.00=cuda12_py310_230809_gd7162cdea_0
79
+ - cupy=13.0.0=py310h7aad9d2_3
80
+ - cupy-core=13.0.0=py310had4011e_3
81
+ - curl=8.6.0=hca28451_0
82
+ - cycler=0.12.1=pyhd8ed1ab_0
83
+ - cymem=2.0.8=py310hc6cd4ac_1
84
+ - cython-blis=0.7.10=py310h1f7b6fc_2
85
+ - cytoolz=0.12.3=py310h2372a71_0
86
+ - dask-cuda=23.08.00=py310_230809_gefbd6ca_0
87
+ - dask-cudf=23.08.00=cuda12_py310_230809_g8150d38e08_0
88
+ - debugpy=1.8.0=py310hc6cd4ac_1
89
+ - decorator=5.1.1=pyhd8ed1ab_0
90
+ - defusedxml=0.7.1=pyhd8ed1ab_0
91
+ - distributed=2023.7.1=pyhd8ed1ab_0
92
+ - distro=1.9.0=pyhd8ed1ab_0
93
+ - dlenv-tf-2-15-gpu=1.0.20240111=py310ha20f8e0_0
94
+ - dlpack=0.5=h9c3ff4c_0
95
+ - entrypoints=0.4=pyhd8ed1ab_0
96
+ - exceptiongroup=1.2.0=pyhd8ed1ab_2
97
+ - executing=2.0.1=pyhd8ed1ab_0
98
+ - expat=2.6.2=h59595ed_0
99
+ - fastrlock=0.8.2=py310hc6cd4ac_2
100
+ - fftw=3.3.10=nompi_hc118613_108
101
+ - fmt=9.1.0=h924138e_0
102
+ - font-ttf-dejavu-sans-mono=2.37=hab24e00_0
103
+ - font-ttf-inconsolata=3.000=h77eed37_0
104
+ - font-ttf-source-code-pro=2.038=h77eed37_0
105
+ - font-ttf-ubuntu=0.83=h77eed37_1
106
+ - fontconfig=2.14.2=h14ed4e7_0
107
+ - fonts-conda-ecosystem=1=0
108
+ - fonts-conda-forge=1=0
109
+ - fqdn=1.5.1=pyhd8ed1ab_0
110
+ - freetype=2.12.1=h267a509_2
111
+ - fribidi=1.0.10=h36c2ea0_0
112
+ - frozenlist=1.4.1=py310h2372a71_0
113
+ - fsspec=2024.3.0=pyhca7485f_0
114
+ - gdk-pixbuf=2.42.10=h829c605_5
115
+ - geos=3.11.1=h27087fc_0
116
+ - gettext=0.21.1=h27087fc_0
117
+ - gflags=2.2.2=he1b5a44_1004
118
+ - ghostscript=10.03.0=h59595ed_0
119
+ - giflib=5.2.1=h0b41bf4_3
120
+ - glog=0.6.0=h6f12383_0
121
+ - gmock=1.14.0=ha770c72_1
122
+ - gmp=6.3.0=h59595ed_0
123
+ - google-api-core-grpc=2.11.1=hd8ed1ab_0
124
+ - google-auth=2.26.1=pyhca7485f_0
125
+ - google-cloud-core=2.4.1=pyhd8ed1ab_0
126
+ - google-cloud-datastore=2.19.0=pyhd8ed1ab_0
127
+ - googleapis-common-protos=1.62.0=pyhd8ed1ab_0
128
+ - graphite2=1.3.13=h58526e2_1001
129
+ - graphviz=9.0.0=h78e8752_1
130
+ - grpc-cpp=1.51.1=h27aab58_3
131
+ - gtest=1.14.0=h00ab1b0_1
132
+ - gtk2=2.24.33=h280cfa0_4
133
+ - gts=0.7.6=h977cf35_4
134
+ - harfbuzz=8.3.0=h3d44ed6_0
135
+ - icu=73.2=h59595ed_0
136
+ - idna=3.6=pyhd8ed1ab_0
137
+ - imagemagick=7.1.1_29=pl5321hb90aeea_0
138
+ - importlib_metadata=7.0.1=hd8ed1ab_0
139
+ - importlib_resources=6.1.1=pyhd8ed1ab_0
140
+ - intel-openmp=2023.1.0=hdb19cb5_46306
141
+ - ipykernel=6.28.0=pyhd33586a_0
142
+ - ipython=8.20.0=pyh707e725_0
143
+ - ipython_genutils=0.2.0=py_1
144
+ - isoduration=20.11.0=pyhd8ed1ab_0
145
+ - jbig=2.1=h7f98852_2003
146
+ - jedi=0.19.1=pyhd8ed1ab_0
147
+ - jinja2=3.1.2=pyhd8ed1ab_1
148
+ - joblib=1.3.2=pyhd8ed1ab_0
149
+ - jsonpatch=1.33=pyhd8ed1ab_0
150
+ - jsonpointer=2.4=py310hff52083_3
151
+ - jsonschema=4.20.0=pyhd8ed1ab_0
152
+ - jsonschema-specifications=2023.12.1=pyhd8ed1ab_0
153
+ - jsonschema-with-format-nongpl=4.20.0=pyhd8ed1ab_0
154
+ - jupyter_client=8.6.0=pyhd8ed1ab_0
155
+ - jupyter_core=5.7.1=py310hff52083_0
156
+ - jupyter_events=0.9.0=pyhd8ed1ab_0
157
+ - jupyter_server_terminals=0.5.1=pyhd8ed1ab_0
158
+ - jupyterlab_pygments=0.3.0=pyhd8ed1ab_0
159
+ - keyutils=1.6.1=h166bdaf_0
160
+ - kiwisolver=1.4.5=py310hd41b1e2_1
161
+ - krb5=1.21.2=h659d440_0
162
+ - langcodes=3.3.0=pyhd8ed1ab_0
163
+ - lcms2=2.16=hb7c19ff_0
164
+ - ld_impl_linux-64=2.40=h41732ed_0
165
+ - lerc=4.0.0=h27087fc_0
166
+ - libabseil=20230125.0=cxx17_hcb278e6_1
167
+ - libarchive=3.6.2=h039dbb9_1
168
+ - libarrow=11.0.0=h33598ff_9_cpu
169
+ - libblas=3.9.0=21_linux64_openblas
170
+ - libbrotlicommon=1.0.9=h166bdaf_9
171
+ - libbrotlidec=1.0.9=h166bdaf_9
172
+ - libbrotlienc=1.0.9=h166bdaf_9
173
+ - libcblas=3.9.0=21_linux64_openblas
174
+ - libcrc32c=1.1.2=h9c3ff4c_0
175
+ - libcublas=12.1.3.1=hd3aeb46_0
176
+ - libcublas-dev=12.1.3.1=0
177
+ - libcudf=23.08.00=cuda12_230809_g8150d38e08_0
178
+ - libcufft=11.0.2.54=hd3aeb46_0
179
+ - libcufile=1.9.0.20=0
180
+ - libcufile-dev=1.9.0.20=0
181
+ - libcuml=23.08.00=cuda12_230809_gd7162cdea_0
182
+ - libcumlprims=23.08.00=cuda12_230809_g71c0a86_0
183
+ - libcurand=10.3.2.106=hd3aeb46_0
184
+ - libcurand-dev=10.3.2.106=0
185
+ - libcurl=8.6.0=hca28451_0
186
+ - libcusolver=11.4.5.107=hd3aeb46_0
187
+ - libcusolver-dev=11.4.5.107=0
188
+ - libcusparse=12.1.0.106=hd3aeb46_0
189
+ - libcusparse-dev=12.1.0.106=0
190
+ - libdeflate=1.19=hd590300_0
191
+ - libedit=3.1.20191231=he28a2e2_2
192
+ - libev=4.33=hd590300_2
193
+ - libevent=2.1.10=h28343ad_4
194
+ - libexpat=2.6.2=h59595ed_0
195
+ - libffi=3.4.2=h7f98852_5
196
+ - libgcc-ng=13.2.0=h807b86a_3
197
+ - libgd=2.3.3=h119a65a_9
198
+ - libgfortran-ng=13.2.0=h69a702a_5
199
+ - libgfortran5=13.2.0=ha4646dd_5
200
+ - libglib=2.80.0=hf2295e7_0
201
+ - libgomp=13.2.0=h807b86a_3
202
+ - libgoogle-cloud=2.8.0=h3c06191_0
203
+ - libgrpc=1.51.1=hcf146ea_3
204
+ - libhwloc=2.9.3=default_h554bfaf_1009
205
+ - libiconv=1.17=hd590300_2
206
+ - libjpeg-turbo=3.0.0=hd590300_1
207
+ - libkvikio=23.08.00=cuda12_230809_g51a9036_0
208
+ - liblapack=3.9.0=21_linux64_openblas
209
+ - libllvm14=14.0.6=hcd5def8_4
210
+ - libmamba=1.5.0=h658169a_0
211
+ - libmambapy=1.5.0=py310h8aae740_0
212
+ - libnghttp2=1.58.0=h47da74e_1
213
+ - libnsl=2.0.1=hd590300_0
214
+ - libnuma=2.0.18=hd590300_0
215
+ - libnvjitlink=12.1.105=hd3aeb46_0
216
+ - libopenblas=0.3.26=pthreads_h413a1c8_0
217
+ - libpng=1.6.43=h2797004_0
218
+ - libprotobuf=3.21.12=hfc55251_2
219
+ - libraft=23.08.00=cuda12_230809_ge588d7b5_0
220
+ - libraft-headers=23.08.00=cuda12_230809_ge588d7b5_0
221
+ - libraft-headers-only=23.08.00=cuda12_230809_ge588d7b5_0
222
+ - librmm=23.08.00=cuda12_230809_gf3af0e8d_0
223
+ - librsvg=2.56.3=he3f83f7_1
224
+ - libsodium=1.0.18=h36c2ea0_1
225
+ - libsolv=0.7.27=hfc55251_0
226
+ - libsqlite=3.44.2=h2797004_0
227
+ - libssh2=1.11.0=h0841786_0
228
+ - libstdcxx-ng=13.2.0=h7e041cc_3
229
+ - libthrift=0.18.0=h5e4af38_0
230
+ - libtiff=4.6.0=ha9c0a0a_2
231
+ - libutf8proc=2.8.0=h166bdaf_0
232
+ - libuuid=2.38.1=h0b41bf4_0
233
+ - libuv=1.46.0=hd590300_0
234
+ - libwebp=1.3.2=h658648e_1
235
+ - libwebp-base=1.3.2=hd590300_0
236
+ - libxcb=1.15=h0b41bf4_0
237
+ - libxcrypt=4.4.36=hd590300_1
238
+ - libxml2=2.12.6=h232c23b_0
239
+ - libzlib=1.2.13=hd590300_5
240
+ - llvm-openmp=8.0.1=hc9558a2_0
241
+ - locket=1.0.0=pyhd8ed1ab_0
242
+ - lz4=4.3.3=py310h350c4a5_0
243
+ - lz4-c=1.9.4=hcb278e6_0
244
+ - lzo=2.10=h516909a_1000
245
+ - magma-cuda121=2.6.1=1
246
+ - mamba=1.5.0=py310h51d5547_0
247
+ - markdown-it-py=3.0.0=pyhd8ed1ab_0
248
+ - matplotlib-base=3.8.3=py310h62c0568_0
249
+ - matplotlib-inline=0.1.6=pyhd8ed1ab_0
250
+ - mdurl=0.1.2=pyhd8ed1ab_0
251
+ - menuinst=2.0.1=py310hff52083_0
252
+ - mkl=2023.1.0=h213fc3f_46344
253
+ - msgpack-python=1.0.7=py310hd41b1e2_0
254
+ - multidict=6.0.4=py310h2372a71_1
255
+ - munkres=1.1.4=pyh9f0ad1d_0
256
+ - murmurhash=1.0.10=py310hc6cd4ac_1
257
+ - nb_conda=2.2.1=unix_7
258
+ - nb_conda_kernels=2.3.1=pyhd8ed1ab_3
259
+ - nbclassic=1.0.0=pyhb4ecaf3_1
260
+ - nbconvert-pandoc=7.14.0=pyhd8ed1ab_0
261
+ - nbformat=5.9.2=pyhd8ed1ab_0
262
+ - nccl=2.20.5.1=h3a97aeb_0
263
+ - ncurses=6.4=h59595ed_2
264
+ - nest-asyncio=1.5.8=pyhd8ed1ab_0
265
+ - nodejs=20.9.0=hb753e55_0
266
+ - notebook-shim=0.2.3=pyhd8ed1ab_0
267
+ - numpy=1.26.4=py310hb13e2d6_0
268
+ - nvcomp=2.6.1=h10b603f_3
269
+ - nvtx=0.2.10=py310h2372a71_0
270
+ - openjpeg=2.5.2=h488ebb8_0
271
+ - openmp=8.0.1=0
272
+ - openssl=3.2.1=hd590300_0
273
+ - orc=1.8.2=hfdbbad2_2
274
+ - overrides=7.4.0=pyhd8ed1ab_0
275
+ - pandoc=3.1.3=h32600fe_0
276
+ - pandocfilters=1.5.0=pyhd8ed1ab_0
277
+ - pango=1.52.1=ha41ecd1_0
278
+ - parquet-cpp=1.5.1=2
279
+ - parso=0.8.3=pyhd8ed1ab_0
280
+ - partd=1.4.1=pyhd8ed1ab_0
281
+ - pathy=0.10.3=py310h06a4308_0
282
+ - pcre2=10.43=hcad00b1_0
283
+ - perl=5.32.1=7_hd590300_perl5
284
+ - pickleshare=0.7.5=py_1003
285
+ - pip=23.3.2=pyhd8ed1ab_0
286
+ - pixman=0.43.2=h59595ed_0
287
+ - pkg-config=0.29.2=h36c2ea0_1008
288
+ - pkgutil-resolve-name=1.3.10=pyhd8ed1ab_1
289
+ - preshed=3.0.9=py310hc6cd4ac_1
290
+ - proj=9.3.1=h1d62c97_0
291
+ - prometheus_client=0.19.0=pyhd8ed1ab_0
292
+ - proto-plus=1.23.0=pyhd8ed1ab_0
293
+ - pthread-stubs=0.4=h36c2ea0_1001
294
+ - ptyprocess=0.7.0=pyhd3deb0d_0
295
+ - pure_eval=0.2.2=pyhd8ed1ab_0
296
+ - pyarrow=11.0.0=py310h633f555_9_cpu
297
+ - pyasn1=0.5.1=pyhd8ed1ab_0
298
+ - pyasn1-modules=0.3.0=pyhd8ed1ab_0
299
+ - pybind11-abi=4=hd8ed1ab_3
300
+ - pycosat=0.6.6=py310h2372a71_0
301
+ - pygments=2.17.2=pyhd8ed1ab_0
302
+ - pylibraft=23.08.00=cuda12_py310_230809_ge588d7b5_0
303
+ - pynvml=11.4.1=pyhd8ed1ab_0
304
+ - pyopenssl=23.3.0=pyhd8ed1ab_0
305
+ - pyproj=3.6.1=py310hd5c30f3_5
306
+ - pyshp=2.3.1=pyhd8ed1ab_0
307
+ - pysocks=1.7.1=py310h06a4308_0
308
+ - python=3.10.13=hd12c33a_1_cpython
309
+ - python-fastjsonschema=2.19.1=pyhd8ed1ab_0
310
+ - python-json-logger=2.0.7=pyhd8ed1ab_0
311
+ - python_abi=3.10=4_cp310
312
+ - pyu2f=0.1.5=pyhd8ed1ab_0
313
+ - pyyaml=6.0.1=py310h2372a71_1
314
+ - raft-dask=23.08.00=cuda12_py310_230809_ge588d7b5_0
315
+ - rdma-core=28.9=h59595ed_1
316
+ - re2=2023.02.02=hcb278e6_0
317
+ - readline=8.2=h8228510_1
318
+ - referencing=0.32.1=pyhd8ed1ab_0
319
+ - reproc=14.2.4.post0=hd590300_1
320
+ - reproc-cpp=14.2.4.post0=h59595ed_1
321
+ - requests=2.31.0=pyhd8ed1ab_0
322
+ - rfc3339-validator=0.1.4=pyhd8ed1ab_0
323
+ - rfc3986-validator=0.1.1=pyh9f0ad1d_0
324
+ - rmm=23.08.00=cuda12_py310_230809_gf3af0e8d_0
325
+ - rpds-py=0.16.2=py310hcb5633a_0
326
+ - rsa=4.9=pyhd8ed1ab_0
327
+ - ruamel.yaml=0.17.40=py310h2372a71_0
328
+ - ruamel.yaml.clib=0.2.7=py310h2372a71_2
329
+ - ruamel_yaml=0.15.100=py310h7f8727e_0
330
+ - s2n=1.3.41=h3358134_0
331
+ - send2trash=1.8.2=pyh41d4057_0
332
+ - setuptools=69.0.3=pyhd8ed1ab_0
333
+ - shellingham=1.5.4=pyhd8ed1ab_0
334
+ - smart_open=6.4.0=pyhd8ed1ab_0
335
+ - snappy=1.1.10=h9fff704_0
336
+ - sniffio=1.3.0=pyhd8ed1ab_0
337
+ - sortedcontainers=2.4.0=pyhd8ed1ab_0
338
+ - soupsieve=2.5=pyhd8ed1ab_1
339
+ - spacy=3.7.2=py310hcb52e73_0
340
+ - spacy-legacy=3.0.12=pyhd8ed1ab_0
341
+ - spacy-loggers=1.0.5=pyhd8ed1ab_0
342
+ - spdlog=1.11.0=h9b3ece8_1
343
+ - sqlite=3.38.2=hc218d9a_0
344
+ - srsly=2.4.8=py310hc6cd4ac_1
345
+ - stack_data=0.6.2=pyhd8ed1ab_0
346
+ - tblib=3.0.0=pyhd8ed1ab_0
347
+ - terminado=0.18.0=pyh0d859eb_0
348
+ - thinc=8.2.2=py310hcb52e73_0
349
+ - tinycss2=1.2.1=pyhd8ed1ab_0
350
+ - tk=8.6.13=noxft_h4845f30_101
351
+ - toolz=0.12.1=pyhd8ed1ab_0
352
+ - tornado=6.3.3=py310h2372a71_1
353
+ - tqdm=4.66.1=pyhd8ed1ab_0
354
+ - traitlets=5.9.0=pyhd8ed1ab_0
355
+ - treelite=3.2.0=py310h1be96d9_0
356
+ - truststore=0.8.0=pyhd8ed1ab_0
357
+ - typer=0.9.0=pyhd8ed1ab_0
358
+ - types-python-dateutil=2.8.19.20240106=pyhd8ed1ab_0
359
+ - typing-extensions=4.9.0=hd8ed1ab_0
360
+ - typing_extensions=4.9.0=pyha770c72_0
361
+ - typing_utils=0.1.0=pyhd8ed1ab_0
362
+ - ucx=1.14.1=h195a15c_5
363
+ - ucx-proc=1.0.0=gpu
364
+ - ucx-py=0.33.00=py310_230809_gea1eb8f_0
365
+ - unicodedata2=15.1.0=py310h2372a71_0
366
+ - uri-template=1.3.0=pyhd8ed1ab_0
367
+ - wasabi=1.1.2=py310hff52083_0
368
+ - wcwidth=0.2.13=pyhd8ed1ab_0
369
+ - weasel=0.3.4=pyhd8ed1ab_0
370
+ - webcolors=1.13=pyhd8ed1ab_0
371
+ - webencodings=0.5.1=pyhd8ed1ab_2
372
+ - websocket-client=1.7.0=pyhd8ed1ab_0
373
+ - wheel=0.42.0=pyhd8ed1ab_0
374
+ - xorg-kbproto=1.0.7=h7f98852_1002
375
+ - xorg-libice=1.1.1=hd590300_0
376
+ - xorg-libsm=1.2.4=h7391055_0
377
+ - xorg-libx11=1.8.7=h8ee46fc_0
378
+ - xorg-libxau=1.0.11=hd590300_0
379
+ - xorg-libxdmcp=1.1.3=h7f98852_0
380
+ - xorg-libxext=1.3.4=h0b41bf4_2
381
+ - xorg-libxrender=0.9.11=hd590300_0
382
+ - xorg-libxt=1.3.0=hd590300_1
383
+ - xorg-renderproto=0.11.1=h7f98852_1002
384
+ - xorg-xextproto=7.3.0=h0b41bf4_1003
385
+ - xorg-xproto=7.0.31=h7f98852_1007
386
+ - xyzservices=2023.10.1=pyhd8ed1ab_0
387
+ - xz=5.2.6=h166bdaf_0
388
+ - yaml=0.2.5=h7b6447c_0
389
+ - yaml-cpp=0.7.0=h59595ed_3
390
+ - zeromq=4.3.5=h59595ed_0
391
+ - zict=3.0.0=pyhd8ed1ab_0
392
+ - zipp=3.17.0=pyhd8ed1ab_0
393
+ - zlib=1.2.13=hd590300_5
394
+ - zstandard=0.22.0=py310h1275a96_0
395
+ - zstd=1.5.5=hfc55251_0
396
+ - pip:
397
+ - absl-py==1.4.0
398
+ - accelerate==0.28.0
399
+ - access==1.1.9
400
+ - affine==2.4.0
401
+ - aiobotocore==2.12.1
402
+ - aiofiles==22.1.0
403
+ - aiohttp-cors==0.7.0
404
+ - aioitertools==0.11.0
405
+ - aiorwlock==1.3.0
406
+ - aiosqlite==0.19.0
407
+ - albumentations==1.4.0
408
+ - alembic==1.13.1
409
+ - altair==5.2.0
410
+ - annoy==1.17.3
411
+ - apache-beam==2.46.0
412
+ - aplus==0.11.0
413
+ - appdirs==1.4.4
414
+ - array-record==0.5.0
415
+ - arviz==0.17.1
416
+ - astroid==3.0.3
417
+ - astropy==6.0.0
418
+ - astropy-iers-data==0.2024.3.18.0.29.47
419
+ - astunparse==1.6.3
420
+ - async-lru==2.0.4
421
+ - audioread==3.0.1
422
+ - auto-gptq==0.7.1
423
+ - autopep8==2.0.4
424
+ - babel==2.14.0
425
+ - backoff==2.2.1
426
+ - bayesian-optimization==1.4.3
427
+ - beatrix-jupyterlab==2023.128.151533
428
+ - bidict==0.23.1
429
+ - bitsandbytes==0.43.0
430
+ - blake3==0.2.1
431
+ - blessed==1.20.0
432
+ - blinker==1.7.0
433
+ - blosc2==2.5.1
434
+ - bokeh==3.3.4
435
+ - boruta==0.3
436
+ - boto3==1.26.100
437
+ - botocore==1.34.51
438
+ - bqplot==0.12.43
439
+ - branca==0.7.1
440
+ - brewer2mpl==1.4.1
441
+ - cachetools==4.2.4
442
+ - catalyst==22.4
443
+ - catboost==1.2.3
444
+ - category-encoders==2.6.3
445
+ - cesium==0.12.1
446
+ - chex==0.1.85
447
+ - cleverhans==4.0.0
448
+ - click-plugins==1.1.1
449
+ - cligj==0.7.2
450
+ - cloud-tpu-client==0.10
451
+ - cloud-tpu-profiler==2.4.0
452
+ - cloudpickle==2.2.1
453
+ - cmdstanpy==1.2.1
454
+ - cmudict==1.0.21
455
+ - colorcet==3.1.0
456
+ - coloredlogs==15.0.1
457
+ - colorful==0.5.6
458
+ - colorlog==6.8.2
459
+ - colorlover==0.3.0
460
+ - contextily==1.5.2
461
+ - convertdate==2.4.0
462
+ - crcmod==1.7
463
+ - cufflinks==0.17.3
464
+ - cvxcanon==0.1.2
465
+ - cython==3.0.8
466
+ - daal==2024.1.0
467
+ - daal4py==2024.1.0
468
+ - dacite==1.8.1
469
+ - dask==2024.3.1
470
+ - dask-expr==1.0.4
471
+ - dataclasses-json==0.6.4
472
+ - dataproc-jupyter-plugin==0.1.66
473
+ - datasets==2.1.0
474
+ - datashader==0.16.0
475
+ - datatile==1.0.3
476
+ - db-dtypes==1.2.0
477
+ - deap==1.4.1
478
+ - deepdiff==6.7.1
479
+ - deprecated==1.2.14
480
+ - deprecation==2.1.0
481
+ - descartes==1.1.0
482
+ - dill==0.3.8
483
+ - dipy==1.9.0
484
+ - distlib==0.3.8
485
+ - dm-tree==0.1.8
486
+ - docker==7.0.0
487
+ - docker-pycreds==0.4.0
488
+ - docopt==0.6.2
489
+ - docstring-parser==0.15
490
+ - docstring-to-markdown==0.15
491
+ - docutils==0.20.1
492
+ - earthengine-api==0.1.394
493
+ - easydict==1.13
494
+ - easyocr==1.7.1
495
+ - ecos==2.0.13
496
+ - einops==0.7.0
497
+ - eli5==0.13.0
498
+ - emoji==2.10.1
499
+ - en-core-web-lg==3.7.1
500
+ - en-core-web-sm==3.7.1
501
+ - ephem==4.1.5
502
+ - esda==2.5.1
503
+ - essentia==2.1b6.dev1110
504
+ - et-xmlfile==1.1.0
505
+ - etils==1.6.0
506
+ - explainable-ai-sdk==1.3.3
507
+ - farama-notifications==0.0.4
508
+ - fastai==2.7.14
509
+ - fastapi==0.108.0
510
+ - fastavro==1.9.3
511
+ - fastcore==1.5.29
512
+ - fastdownload==0.0.7
513
+ - fasteners==0.19
514
+ - fastprogress==1.0.3
515
+ - fasttext==0.9.2
516
+ - feather-format==0.4.1
517
+ - featuretools==1.30.0
518
+ - filelock==3.13.1
519
+ - fiona==1.9.6
520
+ - fitter==1.7.0
521
+ - flake8==7.0.0
522
+ - flashtext==2.7
523
+ - flask==3.0.2
524
+ - flatbuffers==23.5.26
525
+ - flax==0.8.2
526
+ - folium==0.16.0
527
+ - fonttools==4.47.0
528
+ - frozendict==2.4.0
529
+ - funcy==2.0
530
+ - fury==0.10.0
531
+ - future==1.0.0
532
+ - fuzzywuzzy==0.18.0
533
+ - gast==0.5.4
534
+ - gatspy==0.3
535
+ - gcsfs==2023.12.2.post1
536
+ - gekko==1.1.0
537
+ - gensim==4.3.2
538
+ - geographiclib==2.0
539
+ - geohash==1.0
540
+ - geojson==3.1.0
541
+ - geopandas==0.14.3
542
+ - geoplot==0.5.1
543
+ - geopy==2.4.1
544
+ - geoviews==1.11.1
545
+ - ggplot==0.11.5
546
+ - giddy==2.3.5
547
+ - gitdb==4.0.11
548
+ - gitpython==3.1.41
549
+ - google-ai-generativelanguage==0.4.0
550
+ - google-api-core==2.17.1
551
+ - google-api-python-client==2.122.0
552
+ - google-apitools==0.5.31
553
+ - google-auth-httplib2==0.1.1
554
+ - google-auth-oauthlib==1.2.0
555
+ - google-cloud-aiplatform==0.6.0a1
556
+ - google-cloud-artifact-registry==1.10.0
557
+ - google-cloud-automl==1.0.1
558
+ - google-cloud-bigquery==2.34.4
559
+ - google-cloud-bigtable==1.7.3
560
+ - google-cloud-dlp==3.14.0
561
+ - google-cloud-jupyter-config==0.0.5
562
+ - google-cloud-language==2.13.3
563
+ - google-cloud-monitoring==2.18.0
564
+ - google-cloud-pubsub==2.19.0
565
+ - google-cloud-pubsublite==1.9.0
566
+ - google-cloud-recommendations-ai==0.7.1
567
+ - google-cloud-resource-manager==1.11.0
568
+ - google-cloud-spanner==3.40.1
569
+ - google-cloud-storage==1.44.0
570
+ - google-cloud-translate==3.12.1
571
+ - google-cloud-videointelligence==2.13.3
572
+ - google-cloud-vision==2.8.0
573
+ - google-crc32c==1.5.0
574
+ - google-generativeai==0.4.1
575
+ - google-pasta==0.2.0
576
+ - google-resumable-media==2.7.0
577
+ - gplearn==0.4.2
578
+ - gpustat==1.0.0
579
+ - gpxpy==1.6.2
580
+ - greenlet==3.0.3
581
+ - grpc-google-iam-v1==0.12.7
582
+ - grpcio==1.60.0
583
+ - grpcio-status==1.48.2
584
+ - gviz-api==1.10.0
585
+ - gym==0.26.2
586
+ - gym-notices==0.0.8
587
+ - gymnasium==0.29.0
588
+ - h11==0.14.0
589
+ - h2o==3.46.0.1
590
+ - h5netcdf==1.3.0
591
+ - h5py==3.10.0
592
+ - haversine==2.8.1
593
+ - hdfs==2.7.3
594
+ - hep-ml==0.7.2
595
+ - hijri-converter==2.3.1
596
+ - hmmlearn==0.3.2
597
+ - holidays==0.24
598
+ - holoviews==1.18.3
599
+ - hpsklearn==0.1.0
600
+ - html5lib==1.1
601
+ - htmlmin==0.1.12
602
+ - httpcore==1.0.4
603
+ - httplib2==0.21.0
604
+ - httptools==0.6.1
605
+ - httpx==0.27.0
606
+ - huggingface-hub==0.21.4
607
+ - humanfriendly==10.0
608
+ - hunspell==0.5.5
609
+ - husl==4.0.3
610
+ - hydra-slayer==0.5.0
611
+ - hyperopt==0.2.7
612
+ - hypertools==0.8.0
613
+ - igraph==0.11.4
614
+ - imagecodecs==2024.1.1
615
+ - imagehash==4.3.1
616
+ - imageio==2.33.1
617
+ - imbalanced-learn==0.12.0
618
+ - imgaug==0.4.0
619
+ - importlib-metadata==6.11.0
620
+ - inequality==1.0.1
621
+ - iniconfig==2.0.0
622
+ - ipydatawidgets==4.3.5
623
+ - ipyleaflet==0.18.2
624
+ - ipympl==0.7.0
625
+ - ipython-genutils==0.2.0
626
+ - ipython-sql==0.5.0
627
+ - ipyvolume==0.6.3
628
+ - ipyvue==1.10.2
629
+ - ipyvuetify==1.9.2
630
+ - ipywebrtc==0.6.0
631
+ - ipywidgets==7.7.1
632
+ - isort==5.13.2
633
+ - isoweek==1.3.3
634
+ - itsdangerous==2.1.2
635
+ - janome==0.5.0
636
+ - jaraco-classes==3.3.0
637
+ - jax==0.4.23
638
+ - jax-jumpy==1.0.0
639
+ - jaxlib==0.4.23.dev20240116
640
+ - jeepney==0.8.0
641
+ - jieba==0.42.1
642
+ - jmespath==1.0.1
643
+ - json5==0.9.14
644
+ - jupyter-client==7.4.9
645
+ - jupyter-console==6.6.3
646
+ - jupyter-http-over-ws==0.0.8
647
+ - jupyter-lsp==1.5.1
648
+ - jupyter-server==2.13.0
649
+ - jupyter-server-fileid==0.9.1
650
+ - jupyter-server-mathjax==0.2.6
651
+ - jupyter-server-proxy==4.1.0
652
+ - jupyter-server-ydoc==0.8.0
653
+ - jupyter-ydoc==0.2.5
654
+ - jupyterlab==4.1.5
655
+ - jupyterlab-git==0.44.0
656
+ - jupyterlab-lsp==5.1.0
657
+ - jupyterlab-server==2.25.2
658
+ - jupyterlab-widgets==3.0.9
659
+ - jupytext==1.16.0
660
+ - kaggle==1.6.6
661
+ - kaggle-environments==1.14.3
662
+ - kagglehub==0.2.0
663
+ - keras==3.0.5
664
+ - keras-cv==0.8.2
665
+ - keras-nlp==0.8.2
666
+ - keras-tuner==1.4.6
667
+ - kernels-mixer==0.0.7
668
+ - keyring==24.3.0
669
+ - keyrings-google-artifactregistry-auth==1.1.2
670
+ - kfp==2.5.0
671
+ - kfp-pipeline-spec==0.2.2
672
+ - kfp-server-api==2.0.5
673
+ - kmapper==2.0.1
674
+ - kmodes==0.12.2
675
+ - korean-lunar-calendar==0.3.1
676
+ - kornia==0.7.2
677
+ - kornia-rs==0.1.2
678
+ - kt-legacy==1.0.5
679
+ - kubernetes==26.1.0
680
+ - langid==1.1.6
681
+ - lazy-loader==0.3
682
+ - learntools==0.3.4
683
+ - leven==1.0.4
684
+ - levenshtein==0.25.0
685
+ - libclang==16.0.6
686
+ - libpysal==4.9.2
687
+ - librosa==0.10.1
688
+ - lightgbm==4.2.0
689
+ - lightning-utilities==0.10.1
690
+ - lime==0.2.0.1
691
+ - line-profiler==4.1.2
692
+ - linkify-it-py==2.0.3
693
+ - llvmlite==0.41.1
694
+ - lml==0.1.0
695
+ - loguru==0.7.2
696
+ - lunarcalendar==0.0.9
697
+ - lxml==5.1.0
698
+ - mako==1.3.2
699
+ - mapclassify==2.6.1
700
+ - markdown==3.5.2
701
+ - markovify==0.9.4
702
+ - markupsafe==2.1.5
703
+ - marshmallow==3.21.1
704
+ - matplotlib==3.7.5
705
+ - matplotlib-venn==0.11.10
706
+ - mccabe==0.7.0
707
+ - mdit-py-plugins==0.4.0
708
+ - memory-profiler==0.61.0
709
+ - mercantile==1.2.1
710
+ - mgwr==2.2.1
711
+ - missingno==0.5.2
712
+ - mistune==0.8.4
713
+ - mizani==0.11.0
714
+ - ml-dtypes==0.2.0
715
+ - mlcrate==0.2.0
716
+ - mlens==0.2.3
717
+ - mlxtend==0.23.1
718
+ - mmh3==4.1.0
719
+ - mne==1.6.1
720
+ - mnist==0.2.2
721
+ - mock==5.1.0
722
+ - momepy==0.7.0
723
+ - more-itertools==10.2.0
724
+ - mpld3==0.5.10
725
+ - mpmath==1.3.0
726
+ - msgpack-numpy==0.4.8
727
+ - multimethod==1.10
728
+ - multipledispatch==1.0.0
729
+ - multiprocess==0.70.16
730
+ - mypy-extensions==1.0.0
731
+ - namex==0.0.7
732
+ - nbclient==0.5.13
733
+ - nbconvert==6.4.5
734
+ - nbdime==3.2.0
735
+ - ndindex==1.8
736
+ - networkx==3.2.1
737
+ - nibabel==5.2.1
738
+ - nilearn==0.10.3
739
+ - ninja==1.11.1.1
740
+ - nltk==3.2.4
741
+ - nose==1.3.7
742
+ - notebook==6.5.6
743
+ - notebook-executor==0.2
744
+ - numba==0.58.1
745
+ - numexpr==2.9.0
746
+ - nvidia-ml-py==11.495.46
747
+ - oauth2client==4.1.3
748
+ - oauthlib==3.2.2
749
+ - objsize==0.6.1
750
+ - odfpy==1.4.1
751
+ - olefile==0.47
752
+ - onnx==1.15.0
753
+ - opencensus==0.11.4
754
+ - opencensus-context==0.1.3
755
+ - opencv-contrib-python==4.9.0.80
756
+ - opencv-python==4.9.0.80
757
+ - opencv-python-headless==4.9.0.80
758
+ - openpyxl==3.1.2
759
+ - openslide-python==1.3.1
760
+ - opentelemetry-api==1.22.0
761
+ - opentelemetry-exporter-otlp==1.22.0
762
+ - opentelemetry-exporter-otlp-proto-common==1.22.0
763
+ - opentelemetry-exporter-otlp-proto-grpc==1.22.0
764
+ - opentelemetry-exporter-otlp-proto-http==1.22.0
765
+ - opentelemetry-proto==1.22.0
766
+ - opentelemetry-sdk==1.22.0
767
+ - opentelemetry-semantic-conventions==0.43b0
768
+ - opt-einsum==3.3.0
769
+ - optax==0.2.1
770
+ - optimum==1.18.0
771
+ - optuna==3.6.0
772
+ - orbax-checkpoint==0.5.6
773
+ - ordered-set==4.1.0
774
+ - orjson==3.9.10
775
+ - ortools==9.4.1874
776
+ - osmnx==1.9.1
777
+ - packaging==21.3
778
+ - pandas==2.2.1
779
+ - pandas-datareader==0.10.0
780
+ - pandas-profiling==3.6.6
781
+ - pandas-summary==0.2.0
782
+ - pandasql==0.7.3
783
+ - panel==1.3.8
784
+ - papermill==2.5.0
785
+ - param==2.0.2
786
+ - path==16.10.0
787
+ - path-py==12.5.0
788
+ - pathos==0.3.2
789
+ - patsy==0.5.6
790
+ - pdf2image==1.17.0
791
+ - peft==0.10.0
792
+ - pettingzoo==1.24.0
793
+ - pexpect==4.9.0
794
+ - phik==0.12.4
795
+ - pillow==9.5.0
796
+ - platformdirs==4.2.0
797
+ - plotly==5.18.0
798
+ - plotly-express==0.4.1
799
+ - plotnine==0.13.2
800
+ - pluggy==1.4.0
801
+ - pointpats==2.4.0
802
+ - polars==0.20.15
803
+ - polyglot==16.7.4
804
+ - pooch==1.8.1
805
+ - pox==0.3.4
806
+ - ppca==0.0.4
807
+ - ppft==1.7.6.8
808
+ - preprocessing==0.1.13
809
+ - prettytable==3.9.0
810
+ - progressbar2==4.4.2
811
+ - promise==2.3
812
+ - prompt-toolkit==3.0.43
813
+ - pronouncing==0.2.0
814
+ - prophet==1.1.1
815
+ - protobuf==3.20.3
816
+ - psutil==5.9.3
817
+ - pudb==2024.1
818
+ - pulp==2.8.0
819
+ - py-cpuinfo==9.0.0
820
+ - py-spy==0.3.14
821
+ - py4j==0.10.9.7
822
+ - pyaml==23.12.0
823
+ - pyarabic==0.6.15
824
+ - pyastronomy==0.21.0
825
+ - pybind11==2.11.1
826
+ - pyclipper==1.3.0.post5
827
+ - pycodestyle==2.11.1
828
+ - pycparser==2.21
829
+ - pycryptodome==3.20.0
830
+ - pyct==0.5.0
831
+ - pycuda==2024.1
832
+ - pydantic==2.5.3
833
+ - pydantic-core==2.14.6
834
+ - pydegensac==0.1.2
835
+ - pydicom==2.4.4
836
+ - pydocstyle==6.3.0
837
+ - pydot==1.4.2
838
+ - pydub==0.25.1
839
+ - pyemd==1.0.0
840
+ - pyerfa==2.0.1.1
841
+ - pyexcel-io==0.6.6
842
+ - pyexcel-ods==0.6.0
843
+ - pyflakes==3.2.0
844
+ - pygltflib==1.16.2
845
+ - pyjwt==2.8.0
846
+ - pykalman==0.9.5
847
+ - pyldavis==3.4.1
848
+ - pylint==3.0.4
849
+ - pymc3==3.11.4
850
+ - pymeeus==0.5.12
851
+ - pymongo==3.13.0
852
+ - pympler==1.0.1
853
+ - pynndescent==0.5.11
854
+ - pynvrtc==9.2
855
+ - pyocr==0.8.5
856
+ - pyparsing==3.1.1
857
+ - pypdf==4.1.0
858
+ - pysal==24.1
859
+ - pytesseract==0.3.10
860
+ - pytest==8.1.1
861
+ - python-bidi==0.4.2
862
+ - python-dateutil==2.9.0.post0
863
+ - python-dotenv==1.0.0
864
+ - python-graphviz==0.20.2
865
+ - python-levenshtein==0.25.0
866
+ - python-louvain==0.16
867
+ - python-lsp-jsonrpc==1.1.2
868
+ - python-lsp-server==1.10.1
869
+ - python-slugify==8.0.4
870
+ - python-utils==3.8.2
871
+ - pythreejs==2.4.2
872
+ - pytoolconfig==1.3.1
873
+ - pytools==2023.1.1
874
+ - pytorch-ignite==0.4.13
875
+ - pytorch-lightning==2.2.1
876
+ - pytz==2023.3.post1
877
+ - pyupset==0.1.1.post7
878
+ - pyviz-comms==3.0.1
879
+ - pywavelets==1.5.0
880
+ - pyzmq==24.0.1
881
+ - qgrid==1.3.1
882
+ - qtconsole==5.5.1
883
+ - qtpy==2.4.1
884
+ - quantecon==0.7.2
885
+ - quantities==0.15.0
886
+ - qudida==0.0.4
887
+ - rapidfuzz==3.6.2
888
+ - rasterio==1.3.9
889
+ - rasterstats==0.19.0
890
+ - ray==2.9.0
891
+ - ray-cpp==2.9.0
892
+ - regex==2023.12.25
893
+ - requests-oauthlib==1.3.1
894
+ - requests-toolbelt==0.10.1
895
+ - responses==0.18.0
896
+ - retrying==1.3.4
897
+ - rgf-python==3.12.0
898
+ - rich==13.7.0
899
+ - rich-click==1.7.4
900
+ - rope==1.12.0
901
+ - rouge==1.0.1
902
+ - rtree==1.2.0
903
+ - s2sphere==0.2.5
904
+ - s3fs==2024.3.0
905
+ - s3transfer==0.6.2
906
+ - safetensors==0.4.2
907
+ - scattertext==0.1.19
908
+ - scikit-image==0.22.0
909
+ - scikit-learn==1.2.2
910
+ - scikit-learn-intelex==2024.1.0
911
+ - scikit-multilearn==0.2.0
912
+ - scikit-optimize==0.10.1
913
+ - scikit-plot==0.3.7
914
+ - scikit-surprise==1.1.3
915
+ - scipy==1.11.4
916
+ - seaborn==0.12.2
917
+ - secretstorage==3.3.3
918
+ - segment-anything==1.0
919
+ - segregation==2.5
920
+ - semver==3.0.2
921
+ - sentencepiece==0.2.0
922
+ - sentry-sdk==1.42.0
923
+ - setproctitle==1.3.3
924
+ - setuptools-git==1.2
925
+ - setuptools-scm==8.0.4
926
+ - shap==0.44.1
927
+ - shapely==2.0.3
928
+ - shimmy==1.3.0
929
+ - simpervisor==1.0.0
930
+ - simpleitk==2.3.1
931
+ - simplejson==3.19.2
932
+ - six==1.16.0
933
+ - sklearn-pandas==2.2.0
934
+ - slicer==0.0.7
935
+ - smmap==5.0.1
936
+ - snowballstemmer==2.2.0
937
+ - snuggs==1.4.7
938
+ - soundfile==0.12.1
939
+ - soxr==0.3.7
940
+ - spaghetti==1.7.5.post1
941
+ - spectral==0.23.1
942
+ - spglm==1.1.0
943
+ - sphinx-rtd-theme==0.2.4
944
+ - spint==1.0.7
945
+ - splot==1.1.5.post1
946
+ - spopt==0.6.0
947
+ - spreg==1.4.2
948
+ - spvcm==0.3.0
949
+ - sqlalchemy==2.0.25
950
+ - sqlparse==0.4.4
951
+ - squarify==0.4.3
952
+ - stable-baselines3==2.1.0
953
+ - stack-data==0.6.3
954
+ - stanio==0.3.0
955
+ - starlette==0.32.0.post1
956
+ - statsmodels==0.14.1
957
+ - stemming==1.0.1
958
+ - stop-words==2018.7.23
959
+ - stopit==1.1.2
960
+ - stumpy==1.12.0
961
+ - sympy==1.12
962
+ - tables==3.9.2
963
+ - tabulate==0.9.0
964
+ - tangled-up-in-unicode==0.2.0
965
+ - tbb==2021.11.0
966
+ - tenacity==8.2.3
967
+ - tensorboard==2.15.1
968
+ - tensorboard-data-server==0.7.2
969
+ - tensorboard-plugin-profile==2.15.0
970
+ - tensorboardx==2.6.2.2
971
+ - tensorflow==2.15.0
972
+ - tensorflow-cloud==0.1.16
973
+ - tensorflow-datasets==4.9.4
974
+ - tensorflow-decision-forests==1.8.1
975
+ - tensorflow-estimator==2.15.0
976
+ - tensorflow-hub==0.16.1
977
+ - tensorflow-io==0.35.0
978
+ - tensorflow-io-gcs-filesystem==0.35.0
979
+ - tensorflow-metadata==0.14.0
980
+ - tensorflow-probability==0.23.0
981
+ - tensorflow-serving-api==2.14.1
982
+ - tensorflow-text==2.15.0
983
+ - tensorflow-transform==0.14.0
984
+ - tensorpack==0.11
985
+ - tensorstore==0.1.56
986
+ - termcolor==2.4.0
987
+ - testpath==0.6.0
988
+ - text-unidecode==1.3
989
+ - textblob==0.18.0.post0
990
+ - texttable==1.7.0
991
+ - tf-keras==2.15.1
992
+ - tfp-nightly==0.24.0.dev0
993
+ - theano==1.0.5
994
+ - theano-pymc==1.1.2
995
+ - threadpoolctl==3.2.0
996
+ - tifffile==2023.12.9
997
+ - timm==0.9.16
998
+ - tobler==0.11.2
999
+ - tokenizers==0.15.2
1000
+ - toml==0.10.2
1001
+ - tomli==2.0.1
1002
+ - tomlkit==0.12.4
1003
+ - torch==2.1.2
1004
+ - torchaudio==2.1.2
1005
+ - torchdata==0.7.1
1006
+ - torchinfo==1.8.0
1007
+ - torchmetrics==1.3.2
1008
+ - torchtext==0.16.2
1009
+ - torchvision==0.16.2
1010
+ - tpot==0.12.1
1011
+ - traceml==1.0.8
1012
+ - traittypes==0.2.1
1013
+ - transformers==4.38.2
1014
+ - treelite-runtime==3.2.0
1015
+ - trueskill==0.4.5
1016
+ - trx-python==0.2.9
1017
+ - tsfresh==0.20.2
1018
+ - typeguard==4.1.5
1019
+ - typing-inspect==0.9.0
1020
+ - tzdata==2023.4
1021
+ - uc-micro-py==1.0.3
1022
+ - ujson==5.9.0
1023
+ - umap-learn==0.5.5
1024
+ - unidecode==1.3.8
1025
+ - update-checker==0.18.0
1026
+ - uritemplate==3.0.1
1027
+ - urllib3==1.26.18
1028
+ - urwid==2.6.9
1029
+ - urwid-readline==0.14
1030
+ - uvicorn==0.25.0
1031
+ - uvloop==0.19.0
1032
+ - vaex==4.17.0
1033
+ - vaex-astro==0.9.3
1034
+ - vaex-core==4.17.1
1035
+ - vaex-hdf5==0.14.1
1036
+ - vaex-jupyter==0.8.2
1037
+ - vaex-ml==0.18.3
1038
+ - vaex-server==0.9.0
1039
+ - vaex-viz==0.5.4
1040
+ - vec-noise==1.1.4
1041
+ - vecstack==0.4.0
1042
+ - virtualenv==20.21.0
1043
+ - visions==0.7.5
1044
+ - vowpalwabbit==9.9.0
1045
+ - vtk==9.3.0
1046
+ - wand==0.6.13
1047
+ - wandb==0.16.4
1048
+ - watchfiles==0.21.0
1049
+ - wavio==0.0.8
1050
+ - websockets==12.0
1051
+ - werkzeug==3.0.1
1052
+ - wfdb==4.1.2
1053
+ - whatthepatch==1.0.5
1054
+ - widgetsnbextension==3.6.6
1055
+ - witwidget==1.8.1
1056
+ - woodwork==0.29.0
1057
+ - wordcloud==1.9.3
1058
+ - wordsegment==1.3.1
1059
+ - wrapt==1.14.1
1060
+ - xarray==2024.2.0
1061
+ - xarray-einstats==0.7.0
1062
+ - xgboost==2.0.3
1063
+ - xvfbwrapper==0.2.9
1064
+ - xxhash==3.4.1
1065
+ - y-py==0.6.2
1066
+ - yapf==0.40.2
1067
+ - yarl==1.9.4
1068
+ - ydata-profiling==4.6.4
1069
+ - yellowbrick==1.5
1070
+ - ypy-websocket==0.8.4
1071
+ prefix: /opt/conda
wandb/run-20240405_143245-z6ibr5j0/files/config.yaml ADDED
@@ -0,0 +1,734 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ wandb_version: 1
2
+
3
+ _wandb:
4
+ desc: null
5
+ value:
6
+ python_version: 3.10.13
7
+ cli_version: 0.16.4
8
+ framework: huggingface
9
+ huggingface_version: 4.38.2
10
+ is_jupyter_run: true
11
+ is_kaggle_kernel: true
12
+ start_time: 1712327565.0
13
+ t:
14
+ 1:
15
+ - 1
16
+ - 2
17
+ - 3
18
+ - 5
19
+ - 11
20
+ - 12
21
+ - 49
22
+ - 51
23
+ - 53
24
+ - 55
25
+ - 71
26
+ - 98
27
+ - 105
28
+ 2:
29
+ - 1
30
+ - 2
31
+ - 3
32
+ - 5
33
+ - 11
34
+ - 12
35
+ - 49
36
+ - 51
37
+ - 53
38
+ - 55
39
+ - 71
40
+ - 98
41
+ - 105
42
+ 3:
43
+ - 7
44
+ - 23
45
+ 4: 3.10.13
46
+ 5: 0.16.4
47
+ 6: 4.38.2
48
+ 8:
49
+ - 1
50
+ - 2
51
+ - 5
52
+ 9:
53
+ 1: transformers_trainer
54
+ 13: linux-x86_64
55
+ m:
56
+ - 1: train/global_step
57
+ 6:
58
+ - 3
59
+ - 1: train/loss
60
+ 5: 1
61
+ 6:
62
+ - 1
63
+ - 1: train/grad_norm
64
+ 5: 1
65
+ 6:
66
+ - 1
67
+ - 1: train/learning_rate
68
+ 5: 1
69
+ 6:
70
+ - 1
71
+ - 1: train/epoch
72
+ 5: 1
73
+ 6:
74
+ - 1
75
+ - 1: eval/loss
76
+ 5: 1
77
+ 6:
78
+ - 1
79
+ - 1: eval/runtime
80
+ 5: 1
81
+ 6:
82
+ - 1
83
+ - 1: eval/samples_per_second
84
+ 5: 1
85
+ 6:
86
+ - 1
87
+ - 1: eval/steps_per_second
88
+ 5: 1
89
+ 6:
90
+ - 1
91
+ - 1: train/train_runtime
92
+ 5: 1
93
+ 6:
94
+ - 1
95
+ - 1: train/train_samples_per_second
96
+ 5: 1
97
+ 6:
98
+ - 1
99
+ - 1: train/train_steps_per_second
100
+ 5: 1
101
+ 6:
102
+ - 1
103
+ - 1: train/total_flos
104
+ 5: 1
105
+ 6:
106
+ - 1
107
+ - 1: train/train_loss
108
+ 5: 1
109
+ 6:
110
+ - 1
111
+ vocab_size:
112
+ desc: null
113
+ value: 65024
114
+ hidden_size:
115
+ desc: null
116
+ value: 4544
117
+ num_hidden_layers:
118
+ desc: null
119
+ value: 32
120
+ num_attention_heads:
121
+ desc: null
122
+ value: 71
123
+ layer_norm_epsilon:
124
+ desc: null
125
+ value: 1.0e-05
126
+ initializer_range:
127
+ desc: null
128
+ value: 0.02
129
+ use_cache:
130
+ desc: null
131
+ value: false
132
+ hidden_dropout:
133
+ desc: null
134
+ value: 0.0
135
+ attention_dropout:
136
+ desc: null
137
+ value: 0.0
138
+ bos_token_id:
139
+ desc: null
140
+ value: 11
141
+ eos_token_id:
142
+ desc: null
143
+ value: 11
144
+ num_kv_heads:
145
+ desc: null
146
+ value: 71
147
+ alibi:
148
+ desc: null
149
+ value: false
150
+ new_decoder_architecture:
151
+ desc: null
152
+ value: false
153
+ multi_query:
154
+ desc: null
155
+ value: true
156
+ parallel_attn:
157
+ desc: null
158
+ value: true
159
+ bias:
160
+ desc: null
161
+ value: false
162
+ max_position_embeddings:
163
+ desc: null
164
+ value: 2048
165
+ rope_theta:
166
+ desc: null
167
+ value: 10000.0
168
+ rope_scaling:
169
+ desc: null
170
+ value: null
171
+ return_dict:
172
+ desc: null
173
+ value: true
174
+ output_hidden_states:
175
+ desc: null
176
+ value: false
177
+ output_attentions:
178
+ desc: null
179
+ value: false
180
+ torchscript:
181
+ desc: null
182
+ value: false
183
+ torch_dtype:
184
+ desc: null
185
+ value: bfloat16
186
+ use_bfloat16:
187
+ desc: null
188
+ value: false
189
+ tf_legacy_loss:
190
+ desc: null
191
+ value: false
192
+ pruned_heads:
193
+ desc: null
194
+ value: {}
195
+ tie_word_embeddings:
196
+ desc: null
197
+ value: true
198
+ chunk_size_feed_forward:
199
+ desc: null
200
+ value: 0
201
+ is_encoder_decoder:
202
+ desc: null
203
+ value: false
204
+ is_decoder:
205
+ desc: null
206
+ value: false
207
+ cross_attention_hidden_size:
208
+ desc: null
209
+ value: null
210
+ add_cross_attention:
211
+ desc: null
212
+ value: false
213
+ tie_encoder_decoder:
214
+ desc: null
215
+ value: false
216
+ max_length:
217
+ desc: null
218
+ value: 20
219
+ min_length:
220
+ desc: null
221
+ value: 0
222
+ do_sample:
223
+ desc: null
224
+ value: false
225
+ early_stopping:
226
+ desc: null
227
+ value: false
228
+ num_beams:
229
+ desc: null
230
+ value: 1
231
+ num_beam_groups:
232
+ desc: null
233
+ value: 1
234
+ diversity_penalty:
235
+ desc: null
236
+ value: 0.0
237
+ temperature:
238
+ desc: null
239
+ value: 1.0
240
+ top_k:
241
+ desc: null
242
+ value: 50
243
+ top_p:
244
+ desc: null
245
+ value: 1.0
246
+ typical_p:
247
+ desc: null
248
+ value: 1.0
249
+ repetition_penalty:
250
+ desc: null
251
+ value: 1.0
252
+ length_penalty:
253
+ desc: null
254
+ value: 1.0
255
+ no_repeat_ngram_size:
256
+ desc: null
257
+ value: 0
258
+ encoder_no_repeat_ngram_size:
259
+ desc: null
260
+ value: 0
261
+ bad_words_ids:
262
+ desc: null
263
+ value: null
264
+ num_return_sequences:
265
+ desc: null
266
+ value: 1
267
+ output_scores:
268
+ desc: null
269
+ value: false
270
+ return_dict_in_generate:
271
+ desc: null
272
+ value: false
273
+ forced_bos_token_id:
274
+ desc: null
275
+ value: null
276
+ forced_eos_token_id:
277
+ desc: null
278
+ value: null
279
+ remove_invalid_values:
280
+ desc: null
281
+ value: false
282
+ exponential_decay_length_penalty:
283
+ desc: null
284
+ value: null
285
+ suppress_tokens:
286
+ desc: null
287
+ value: null
288
+ begin_suppress_tokens:
289
+ desc: null
290
+ value: null
291
+ architectures:
292
+ desc: null
293
+ value:
294
+ - FalconForCausalLM
295
+ finetuning_task:
296
+ desc: null
297
+ value: null
298
+ id2label:
299
+ desc: null
300
+ value:
301
+ '0': LABEL_0
302
+ '1': LABEL_1
303
+ label2id:
304
+ desc: null
305
+ value:
306
+ LABEL_0: 0
307
+ LABEL_1: 1
308
+ tokenizer_class:
309
+ desc: null
310
+ value: null
311
+ prefix:
312
+ desc: null
313
+ value: null
314
+ pad_token_id:
315
+ desc: null
316
+ value: null
317
+ sep_token_id:
318
+ desc: null
319
+ value: null
320
+ decoder_start_token_id:
321
+ desc: null
322
+ value: null
323
+ task_specific_params:
324
+ desc: null
325
+ value: null
326
+ problem_type:
327
+ desc: null
328
+ value: null
329
+ _name_or_path:
330
+ desc: null
331
+ value: tiiuae/falcon-7b-instruct
332
+ transformers_version:
333
+ desc: null
334
+ value: 4.38.2
335
+ apply_residual_connection_post_layernorm:
336
+ desc: null
337
+ value: false
338
+ auto_map:
339
+ desc: null
340
+ value:
341
+ AutoConfig: tiiuae/falcon-7b-instruct--configuration_falcon.FalconConfig
342
+ AutoModel: tiiuae/falcon-7b-instruct--modeling_falcon.FalconModel
343
+ AutoModelForSequenceClassification: tiiuae/falcon-7b-instruct--modeling_falcon.FalconForSequenceClassification
344
+ AutoModelForTokenClassification: tiiuae/falcon-7b-instruct--modeling_falcon.FalconForTokenClassification
345
+ AutoModelForQuestionAnswering: tiiuae/falcon-7b-instruct--modeling_falcon.FalconForQuestionAnswering
346
+ AutoModelForCausalLM: tiiuae/falcon-7b-instruct--modeling_falcon.FalconForCausalLM
347
+ model_type:
348
+ desc: null
349
+ value: falcon
350
+ quantization_config:
351
+ desc: null
352
+ value:
353
+ quant_method: QuantizationMethod.BITS_AND_BYTES
354
+ _load_in_8bit: false
355
+ _load_in_4bit: true
356
+ llm_int8_threshold: 6.0
357
+ llm_int8_skip_modules: null
358
+ llm_int8_enable_fp32_cpu_offload: false
359
+ llm_int8_has_fp16_weight: false
360
+ bnb_4bit_quant_type: nf4
361
+ bnb_4bit_use_double_quant: true
362
+ bnb_4bit_compute_dtype: bfloat16
363
+ load_in_4bit: true
364
+ load_in_8bit: false
365
+ output_dir:
366
+ desc: null
367
+ value: /kaggle/working/
368
+ overwrite_output_dir:
369
+ desc: null
370
+ value: false
371
+ do_train:
372
+ desc: null
373
+ value: false
374
+ do_eval:
375
+ desc: null
376
+ value: true
377
+ do_predict:
378
+ desc: null
379
+ value: false
380
+ evaluation_strategy:
381
+ desc: null
382
+ value: epoch
383
+ prediction_loss_only:
384
+ desc: null
385
+ value: false
386
+ per_device_train_batch_size:
387
+ desc: null
388
+ value: 6
389
+ per_device_eval_batch_size:
390
+ desc: null
391
+ value: 6
392
+ per_gpu_train_batch_size:
393
+ desc: null
394
+ value: null
395
+ per_gpu_eval_batch_size:
396
+ desc: null
397
+ value: null
398
+ gradient_accumulation_steps:
399
+ desc: null
400
+ value: 4
401
+ eval_accumulation_steps:
402
+ desc: null
403
+ value: null
404
+ eval_delay:
405
+ desc: null
406
+ value: 0
407
+ learning_rate:
408
+ desc: null
409
+ value: 0.0002
410
+ weight_decay:
411
+ desc: null
412
+ value: 0.01
413
+ adam_beta1:
414
+ desc: null
415
+ value: 0.9
416
+ adam_beta2:
417
+ desc: null
418
+ value: 0.999
419
+ adam_epsilon:
420
+ desc: null
421
+ value: 1.0e-08
422
+ max_grad_norm:
423
+ desc: null
424
+ value: 1.0
425
+ num_train_epochs:
426
+ desc: null
427
+ value: 50
428
+ max_steps:
429
+ desc: null
430
+ value: -1
431
+ lr_scheduler_type:
432
+ desc: null
433
+ value: linear
434
+ lr_scheduler_kwargs:
435
+ desc: null
436
+ value: {}
437
+ warmup_ratio:
438
+ desc: null
439
+ value: 0.0
440
+ warmup_steps:
441
+ desc: null
442
+ value: 2
443
+ log_level:
444
+ desc: null
445
+ value: passive
446
+ log_level_replica:
447
+ desc: null
448
+ value: warning
449
+ log_on_each_node:
450
+ desc: null
451
+ value: true
452
+ logging_dir:
453
+ desc: null
454
+ value: /kaggle/working/runs/Apr05_14-32-29_351216fd69aa
455
+ logging_strategy:
456
+ desc: null
457
+ value: epoch
458
+ logging_first_step:
459
+ desc: null
460
+ value: false
461
+ logging_steps:
462
+ desc: null
463
+ value: 500
464
+ logging_nan_inf_filter:
465
+ desc: null
466
+ value: true
467
+ save_strategy:
468
+ desc: null
469
+ value: epoch
470
+ save_steps:
471
+ desc: null
472
+ value: 500
473
+ save_total_limit:
474
+ desc: null
475
+ value: null
476
+ save_safetensors:
477
+ desc: null
478
+ value: true
479
+ save_on_each_node:
480
+ desc: null
481
+ value: false
482
+ save_only_model:
483
+ desc: null
484
+ value: false
485
+ no_cuda:
486
+ desc: null
487
+ value: false
488
+ use_cpu:
489
+ desc: null
490
+ value: false
491
+ use_mps_device:
492
+ desc: null
493
+ value: false
494
+ seed:
495
+ desc: null
496
+ value: 42
497
+ data_seed:
498
+ desc: null
499
+ value: null
500
+ jit_mode_eval:
501
+ desc: null
502
+ value: false
503
+ use_ipex:
504
+ desc: null
505
+ value: false
506
+ bf16:
507
+ desc: null
508
+ value: false
509
+ fp16:
510
+ desc: null
511
+ value: true
512
+ fp16_opt_level:
513
+ desc: null
514
+ value: O1
515
+ half_precision_backend:
516
+ desc: null
517
+ value: auto
518
+ bf16_full_eval:
519
+ desc: null
520
+ value: false
521
+ fp16_full_eval:
522
+ desc: null
523
+ value: false
524
+ tf32:
525
+ desc: null
526
+ value: null
527
+ local_rank:
528
+ desc: null
529
+ value: 0
530
+ ddp_backend:
531
+ desc: null
532
+ value: null
533
+ tpu_num_cores:
534
+ desc: null
535
+ value: null
536
+ tpu_metrics_debug:
537
+ desc: null
538
+ value: false
539
+ debug:
540
+ desc: null
541
+ value: []
542
+ dataloader_drop_last:
543
+ desc: null
544
+ value: false
545
+ eval_steps:
546
+ desc: null
547
+ value: null
548
+ dataloader_num_workers:
549
+ desc: null
550
+ value: 0
551
+ dataloader_prefetch_factor:
552
+ desc: null
553
+ value: null
554
+ past_index:
555
+ desc: null
556
+ value: -1
557
+ run_name:
558
+ desc: null
559
+ value: /kaggle/working/
560
+ disable_tqdm:
561
+ desc: null
562
+ value: false
563
+ remove_unused_columns:
564
+ desc: null
565
+ value: true
566
+ label_names:
567
+ desc: null
568
+ value: null
569
+ load_best_model_at_end:
570
+ desc: null
571
+ value: true
572
+ metric_for_best_model:
573
+ desc: null
574
+ value: loss
575
+ greater_is_better:
576
+ desc: null
577
+ value: false
578
+ ignore_data_skip:
579
+ desc: null
580
+ value: false
581
+ fsdp:
582
+ desc: null
583
+ value: []
584
+ fsdp_min_num_params:
585
+ desc: null
586
+ value: 0
587
+ fsdp_config:
588
+ desc: null
589
+ value:
590
+ min_num_params: 0
591
+ xla: false
592
+ xla_fsdp_v2: false
593
+ xla_fsdp_grad_ckpt: false
594
+ fsdp_transformer_layer_cls_to_wrap:
595
+ desc: null
596
+ value: null
597
+ accelerator_config:
598
+ desc: null
599
+ value:
600
+ split_batches: false
601
+ dispatch_batches: null
602
+ even_batches: true
603
+ use_seedable_sampler: true
604
+ deepspeed:
605
+ desc: null
606
+ value: null
607
+ label_smoothing_factor:
608
+ desc: null
609
+ value: 0.0
610
+ optim:
611
+ desc: null
612
+ value: paged_adamw_8bit
613
+ optim_args:
614
+ desc: null
615
+ value: null
616
+ adafactor:
617
+ desc: null
618
+ value: false
619
+ group_by_length:
620
+ desc: null
621
+ value: false
622
+ length_column_name:
623
+ desc: null
624
+ value: length
625
+ report_to:
626
+ desc: null
627
+ value:
628
+ - tensorboard
629
+ - wandb
630
+ ddp_find_unused_parameters:
631
+ desc: null
632
+ value: null
633
+ ddp_bucket_cap_mb:
634
+ desc: null
635
+ value: null
636
+ ddp_broadcast_buffers:
637
+ desc: null
638
+ value: null
639
+ dataloader_pin_memory:
640
+ desc: null
641
+ value: true
642
+ dataloader_persistent_workers:
643
+ desc: null
644
+ value: false
645
+ skip_memory_metrics:
646
+ desc: null
647
+ value: true
648
+ use_legacy_prediction_loop:
649
+ desc: null
650
+ value: false
651
+ push_to_hub:
652
+ desc: null
653
+ value: false
654
+ resume_from_checkpoint:
655
+ desc: null
656
+ value: null
657
+ hub_model_id:
658
+ desc: null
659
+ value: null
660
+ hub_strategy:
661
+ desc: null
662
+ value: every_save
663
+ hub_token:
664
+ desc: null
665
+ value: <HUB_TOKEN>
666
+ hub_private_repo:
667
+ desc: null
668
+ value: false
669
+ hub_always_push:
670
+ desc: null
671
+ value: false
672
+ gradient_checkpointing:
673
+ desc: null
674
+ value: false
675
+ gradient_checkpointing_kwargs:
676
+ desc: null
677
+ value: null
678
+ include_inputs_for_metrics:
679
+ desc: null
680
+ value: false
681
+ fp16_backend:
682
+ desc: null
683
+ value: auto
684
+ push_to_hub_model_id:
685
+ desc: null
686
+ value: null
687
+ push_to_hub_organization:
688
+ desc: null
689
+ value: null
690
+ push_to_hub_token:
691
+ desc: null
692
+ value: <PUSH_TO_HUB_TOKEN>
693
+ mp_parameters:
694
+ desc: null
695
+ value: ''
696
+ auto_find_batch_size:
697
+ desc: null
698
+ value: false
699
+ full_determinism:
700
+ desc: null
701
+ value: false
702
+ torchdynamo:
703
+ desc: null
704
+ value: null
705
+ ray_scope:
706
+ desc: null
707
+ value: last
708
+ ddp_timeout:
709
+ desc: null
710
+ value: 1800
711
+ torch_compile:
712
+ desc: null
713
+ value: false
714
+ torch_compile_backend:
715
+ desc: null
716
+ value: null
717
+ torch_compile_mode:
718
+ desc: null
719
+ value: null
720
+ dispatch_batches:
721
+ desc: null
722
+ value: null
723
+ split_batches:
724
+ desc: null
725
+ value: null
726
+ include_tokens_per_second:
727
+ desc: null
728
+ value: false
729
+ include_num_input_tokens_seen:
730
+ desc: null
731
+ value: false
732
+ neftune_noise_alpha:
733
+ desc: null
734
+ value: null
wandb/run-20240405_143245-z6ibr5j0/files/output.log ADDED
@@ -0,0 +1,95 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
2
+ warnings.warn(
3
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
4
+ warnings.warn(
5
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
6
+ warnings.warn(
7
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
8
+ warnings.warn(
9
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
10
+ warnings.warn(
11
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
12
+ warnings.warn(
13
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
14
+ warnings.warn(
15
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
16
+ warnings.warn(
17
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
18
+ warnings.warn(
19
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
20
+ warnings.warn(
21
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
22
+ warnings.warn(
23
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
24
+ warnings.warn(
25
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
26
+ warnings.warn(
27
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
28
+ warnings.warn(
29
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
30
+ warnings.warn(
31
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
32
+ warnings.warn(
33
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
34
+ warnings.warn(
35
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
36
+ warnings.warn(
37
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
38
+ warnings.warn(
39
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
40
+ warnings.warn(
41
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
42
+ warnings.warn(
43
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
44
+ warnings.warn(
45
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
46
+ warnings.warn(
47
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
48
+ warnings.warn(
49
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
50
+ warnings.warn(
51
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
52
+ warnings.warn(
53
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
54
+ warnings.warn(
55
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
56
+ warnings.warn(
57
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
58
+ warnings.warn(
59
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
60
+ warnings.warn(
61
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
62
+ warnings.warn(
63
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
64
+ warnings.warn(
65
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
66
+ warnings.warn(
67
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
68
+ warnings.warn(
69
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
70
+ warnings.warn(
71
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
72
+ warnings.warn(
73
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
74
+ warnings.warn(
75
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
76
+ warnings.warn(
77
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
78
+ warnings.warn(
79
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
80
+ warnings.warn(
81
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
82
+ warnings.warn(
83
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
84
+ warnings.warn(
85
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
86
+ warnings.warn(
87
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
88
+ warnings.warn(
89
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
90
+ warnings.warn(
91
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
92
+ warnings.warn(
93
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
94
+ warnings.warn(
95
+ /opt/conda/lib/python3.10/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants.
wandb/run-20240405_143245-z6ibr5j0/files/requirements.txt ADDED
@@ -0,0 +1,881 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Babel==2.14.0
2
+ Boruta==0.3
3
+ Brotli==1.0.9
4
+ CVXcanon==0.1.2
5
+ Cartopy==0.22.0
6
+ Cython==3.0.8
7
+ Deprecated==1.2.14
8
+ Farama-Notifications==0.0.4
9
+ Flask==3.0.2
10
+ Geohash==1.0
11
+ GitPython==3.1.41
12
+ ImageHash==4.3.1
13
+ Janome==0.5.0
14
+ Jinja2==3.1.2
15
+ Levenshtein==0.25.0
16
+ LunarCalendar==0.0.9
17
+ Mako==1.3.2
18
+ Markdown==3.5.2
19
+ MarkupSafe==2.1.3
20
+ MarkupSafe==2.1.5
21
+ Pillow==9.5.0
22
+ PuLP==2.8.0
23
+ PyArabic==0.6.15
24
+ PyAstronomy==0.21.0
25
+ PyJWT==2.8.0
26
+ PyMeeus==0.5.12
27
+ PySocks==1.7.1
28
+ PyUpSet==0.1.1.post7
29
+ PyWavelets==1.5.0
30
+ PyYAML==6.0.1
31
+ Pygments==2.17.2
32
+ Pympler==1.0.1
33
+ QtPy==2.4.1
34
+ Rtree==1.2.0
35
+ SQLAlchemy==2.0.25
36
+ SecretStorage==3.3.3
37
+ Send2Trash==1.8.2
38
+ Shapely==1.8.5.post1
39
+ Shimmy==1.3.0
40
+ SimpleITK==2.3.1
41
+ TPOT==0.12.1
42
+ Theano-PyMC==1.1.2
43
+ Theano==1.0.5
44
+ Unidecode==1.3.8
45
+ Wand==0.6.13
46
+ Werkzeug==3.0.1
47
+ absl-py==1.4.0
48
+ accelerate==0.28.0
49
+ access==1.1.9
50
+ affine==2.4.0
51
+ aiobotocore==2.12.1
52
+ aiofiles==22.1.0
53
+ aiohttp-cors==0.7.0
54
+ aiohttp==3.9.1
55
+ aioitertools==0.11.0
56
+ aiorwlock==1.3.0
57
+ aiosignal==1.3.1
58
+ aiosqlite==0.19.0
59
+ albumentations==1.4.0
60
+ alembic==1.13.1
61
+ altair==5.2.0
62
+ annotated-types==0.6.0
63
+ annoy==1.17.3
64
+ anyio==4.2.0
65
+ apache-beam==2.46.0
66
+ aplus==0.11.0
67
+ appdirs==1.4.4
68
+ archspec==0.2.2
69
+ argon2-cffi-bindings==21.2.0
70
+ argon2-cffi==23.1.0
71
+ array-record==0.5.0
72
+ arrow==1.3.0
73
+ arviz==0.17.1
74
+ astroid==3.0.3
75
+ astropy-iers-data==0.2024.3.18.0.29.47
76
+ astropy==6.0.0
77
+ asttokens==2.4.1
78
+ astunparse==1.6.3
79
+ async-lru==2.0.4
80
+ async-timeout==4.0.3
81
+ attrs==23.2.0
82
+ audioread==3.0.1
83
+ auto_gptq==0.7.1
84
+ autopep8==2.0.4
85
+ backoff==2.2.1
86
+ bayesian-optimization==1.4.3
87
+ beatrix_jupyterlab==2023.128.151533
88
+ beautifulsoup4==4.12.2
89
+ bidict==0.23.1
90
+ bitsandbytes==0.43.0
91
+ blake3==0.2.1
92
+ bleach==6.1.0
93
+ blessed==1.20.0
94
+ blinker==1.7.0
95
+ blis==0.7.10
96
+ blosc2==2.5.1
97
+ bokeh==3.3.4
98
+ boltons==23.1.1
99
+ boto3==1.26.100
100
+ botocore==1.34.51
101
+ bq_helper==0.4.1
102
+ bqplot==0.12.43
103
+ branca==0.7.1
104
+ brewer2mpl==1.4.1
105
+ brotlipy==0.7.0
106
+ cached-property==1.5.2
107
+ cachetools==4.2.4
108
+ cachetools==5.3.2
109
+ catalogue==2.0.10
110
+ catalyst==22.4
111
+ catboost==1.2.3
112
+ category-encoders==2.6.3
113
+ certifi==2024.2.2
114
+ cesium==0.12.1
115
+ cffi==1.16.0
116
+ charset-normalizer==3.3.2
117
+ chex==0.1.85
118
+ cleverhans==4.0.0
119
+ click-plugins==1.1.1
120
+ click==8.1.7
121
+ cligj==0.7.2
122
+ cloud-tpu-client==0.10
123
+ cloud-tpu-profiler==2.4.0
124
+ cloudpathlib==0.16.0
125
+ cloudpickle==2.2.1
126
+ cloudpickle==3.0.0
127
+ cmdstanpy==1.2.1
128
+ cmudict==1.0.21
129
+ colorama==0.4.6
130
+ colorcet==3.1.0
131
+ coloredlogs==15.0.1
132
+ colorful==0.5.6
133
+ colorlog==6.8.2
134
+ colorlover==0.3.0
135
+ comm==0.2.1
136
+ conda-libmamba-solver==23.7.0
137
+ conda-package-handling==2.2.0
138
+ conda==23.7.4
139
+ conda_package_streaming==0.9.0
140
+ confection==0.1.4
141
+ contextily==1.5.2
142
+ contourpy==1.2.0
143
+ convertdate==2.4.0
144
+ crcmod==1.7
145
+ cryptography==41.0.7
146
+ cuda-python==12.4.0
147
+ cudf==23.8.0
148
+ cufflinks==0.17.3
149
+ cuml==23.8.0
150
+ cupy==13.0.0
151
+ cycler==0.12.1
152
+ cymem==2.0.8
153
+ cytoolz==0.12.3
154
+ daal4py==2024.1.0
155
+ daal==2024.1.0
156
+ dacite==1.8.1
157
+ dask-cuda==23.8.0
158
+ dask-cudf==23.8.0
159
+ dask-expr==1.0.4
160
+ dask==2024.3.1
161
+ dataclasses-json==0.6.4
162
+ dataproc_jupyter_plugin==0.1.66
163
+ datasets==2.1.0
164
+ datashader==0.16.0
165
+ datatile==1.0.3
166
+ db-dtypes==1.2.0
167
+ deap==1.4.1
168
+ debugpy==1.8.0
169
+ decorator==5.1.1
170
+ deepdiff==6.7.1
171
+ defusedxml==0.7.1
172
+ deprecation==2.1.0
173
+ descartes==1.1.0
174
+ dill==0.3.8
175
+ dipy==1.9.0
176
+ distlib==0.3.8
177
+ distributed==2023.7.1
178
+ distro==1.9.0
179
+ dm-tree==0.1.8
180
+ docker-pycreds==0.4.0
181
+ docker==7.0.0
182
+ docopt==0.6.2
183
+ docstring-parser==0.15
184
+ docstring-to-markdown==0.15
185
+ docutils==0.20.1
186
+ earthengine-api==0.1.394
187
+ easydict==1.13
188
+ easyocr==1.7.1
189
+ ecos==2.0.13
190
+ einops==0.7.0
191
+ eli5==0.13.0
192
+ emoji==2.10.1
193
+ en-core-web-lg==3.7.1
194
+ en-core-web-sm==3.7.1
195
+ entrypoints==0.4
196
+ ephem==4.1.5
197
+ esda==2.5.1
198
+ essentia==2.1b6.dev1110
199
+ et-xmlfile==1.1.0
200
+ etils==1.6.0
201
+ exceptiongroup==1.2.0
202
+ executing==2.0.1
203
+ explainable-ai-sdk==1.3.3
204
+ fastai==2.7.14
205
+ fastapi==0.108.0
206
+ fastavro==1.9.3
207
+ fastcore==1.5.29
208
+ fastdownload==0.0.7
209
+ fasteners==0.19
210
+ fastjsonschema==2.19.1
211
+ fastprogress==1.0.3
212
+ fastrlock==0.8.2
213
+ fasttext==0.9.2
214
+ feather-format==0.4.1
215
+ featuretools==1.30.0
216
+ filelock==3.13.1
217
+ fiona==1.9.6
218
+ fitter==1.7.0
219
+ flake8==7.0.0
220
+ flashtext==2.7
221
+ flatbuffers==23.5.26
222
+ flax==0.8.2
223
+ folium==0.16.0
224
+ fonttools==4.47.0
225
+ fonttools==4.49.0
226
+ fqdn==1.5.1
227
+ frozendict==2.4.0
228
+ frozenlist==1.4.1
229
+ fsspec==2024.3.0
230
+ funcy==2.0
231
+ fury==0.10.0
232
+ future==1.0.0
233
+ fuzzywuzzy==0.18.0
234
+ gast==0.5.4
235
+ gatspy==0.3
236
+ gcsfs==2023.12.2.post1
237
+ gekko==1.1.0
238
+ gensim==4.3.2
239
+ geographiclib==2.0
240
+ geojson==3.1.0
241
+ geopandas==0.14.3
242
+ geoplot==0.5.1
243
+ geopy==2.4.1
244
+ geoviews==1.11.1
245
+ ggplot==0.11.5
246
+ giddy==2.3.5
247
+ gitdb==4.0.11
248
+ google-ai-generativelanguage==0.4.0
249
+ google-api-core==2.11.1
250
+ google-api-core==2.17.1
251
+ google-api-python-client==2.122.0
252
+ google-apitools==0.5.31
253
+ google-auth-httplib2==0.1.1
254
+ google-auth-oauthlib==1.2.0
255
+ google-auth==2.26.1
256
+ google-cloud-aiplatform==0.6.0a1
257
+ google-cloud-artifact-registry==1.10.0
258
+ google-cloud-automl==1.0.1
259
+ google-cloud-bigquery==2.34.4
260
+ google-cloud-bigtable==1.7.3
261
+ google-cloud-core==2.4.1
262
+ google-cloud-datastore==2.19.0
263
+ google-cloud-dlp==3.14.0
264
+ google-cloud-jupyter-config==0.0.5
265
+ google-cloud-language==2.13.3
266
+ google-cloud-monitoring==2.18.0
267
+ google-cloud-pubsub==2.19.0
268
+ google-cloud-pubsublite==1.9.0
269
+ google-cloud-recommendations-ai==0.7.1
270
+ google-cloud-resource-manager==1.11.0
271
+ google-cloud-spanner==3.40.1
272
+ google-cloud-storage==1.44.0
273
+ google-cloud-translate==3.12.1
274
+ google-cloud-videointelligence==2.13.3
275
+ google-cloud-vision==2.8.0
276
+ google-crc32c==1.5.0
277
+ google-generativeai==0.4.1
278
+ google-pasta==0.2.0
279
+ google-resumable-media==2.7.0
280
+ googleapis-common-protos==1.62.0
281
+ gplearn==0.4.2
282
+ gpustat==1.0.0
283
+ gpxpy==1.6.2
284
+ graphviz==0.20.2
285
+ greenlet==3.0.3
286
+ grpc-google-iam-v1==0.12.7
287
+ grpcio-status==1.48.1
288
+ grpcio-status==1.48.2
289
+ grpcio==1.51.1
290
+ grpcio==1.60.0
291
+ gviz-api==1.10.0
292
+ gym-notices==0.0.8
293
+ gym==0.26.2
294
+ gymnasium==0.29.0
295
+ h11==0.14.0
296
+ h2o==3.46.0.1
297
+ h5netcdf==1.3.0
298
+ h5py==3.10.0
299
+ haversine==2.8.1
300
+ hdfs==2.7.3
301
+ hep-ml==0.7.2
302
+ hijri-converter==2.3.1
303
+ hmmlearn==0.3.2
304
+ holidays==0.24
305
+ holoviews==1.18.3
306
+ hpsklearn==0.1.0
307
+ html5lib==1.1
308
+ htmlmin==0.1.12
309
+ httpcore==1.0.4
310
+ httplib2==0.21.0
311
+ httptools==0.6.1
312
+ httpx==0.27.0
313
+ huggingface-hub==0.21.4
314
+ humanfriendly==10.0
315
+ hunspell==0.5.5
316
+ husl==4.0.3
317
+ hydra-slayer==0.5.0
318
+ hyperopt==0.2.7
319
+ hypertools==0.8.0
320
+ idna==3.6
321
+ igraph==0.11.4
322
+ imagecodecs==2024.1.1
323
+ imageio==2.33.1
324
+ imbalanced-learn==0.12.0
325
+ imgaug==0.4.0
326
+ importlib-metadata==6.11.0
327
+ importlib-metadata==7.0.1
328
+ importlib-resources==6.1.1
329
+ inequality==1.0.1
330
+ iniconfig==2.0.0
331
+ ipydatawidgets==4.3.5
332
+ ipykernel==6.28.0
333
+ ipyleaflet==0.18.2
334
+ ipympl==0.7.0
335
+ ipython-genutils==0.2.0
336
+ ipython-genutils==0.2.0
337
+ ipython-sql==0.5.0
338
+ ipython==8.20.0
339
+ ipyvolume==0.6.3
340
+ ipyvue==1.10.2
341
+ ipyvuetify==1.9.2
342
+ ipywebrtc==0.6.0
343
+ ipywidgets==7.7.1
344
+ isoduration==20.11.0
345
+ isort==5.13.2
346
+ isoweek==1.3.3
347
+ itsdangerous==2.1.2
348
+ jaraco.classes==3.3.0
349
+ jax-jumpy==1.0.0
350
+ jax==0.4.23
351
+ jaxlib==0.4.23.dev20240116
352
+ jedi==0.19.1
353
+ jeepney==0.8.0
354
+ jieba==0.42.1
355
+ jmespath==1.0.1
356
+ joblib==1.3.2
357
+ json5==0.9.14
358
+ jsonpatch==1.33
359
+ jsonpointer==2.4
360
+ jsonschema-specifications==2023.12.1
361
+ jsonschema==4.20.0
362
+ jupyter-console==6.6.3
363
+ jupyter-events==0.9.0
364
+ jupyter-http-over-ws==0.0.8
365
+ jupyter-lsp==1.5.1
366
+ jupyter-server-mathjax==0.2.6
367
+ jupyter-ydoc==0.2.5
368
+ jupyter_client==7.4.9
369
+ jupyter_client==8.6.0
370
+ jupyter_core==5.7.1
371
+ jupyter_server==2.13.0
372
+ jupyter_server_fileid==0.9.1
373
+ jupyter_server_proxy==4.1.0
374
+ jupyter_server_terminals==0.5.1
375
+ jupyter_server_ydoc==0.8.0
376
+ jupyterlab-lsp==5.1.0
377
+ jupyterlab-widgets==3.0.9
378
+ jupyterlab==4.1.5
379
+ jupyterlab_git==0.44.0
380
+ jupyterlab_pygments==0.3.0
381
+ jupyterlab_server==2.25.2
382
+ jupytext==1.16.0
383
+ kaggle-environments==1.14.3
384
+ kaggle==1.6.6
385
+ kagglehub==0.2.0
386
+ keras-cv==0.8.2
387
+ keras-nlp==0.8.2
388
+ keras-tuner==1.4.6
389
+ keras==3.0.5
390
+ kernels-mixer==0.0.7
391
+ keyring==24.3.0
392
+ keyrings.google-artifactregistry-auth==1.1.2
393
+ kfp-pipeline-spec==0.2.2
394
+ kfp-server-api==2.0.5
395
+ kfp==2.5.0
396
+ kiwisolver==1.4.5
397
+ kmapper==2.0.1
398
+ kmodes==0.12.2
399
+ korean-lunar-calendar==0.3.1
400
+ kornia==0.7.2
401
+ kornia_rs==0.1.2
402
+ kt-legacy==1.0.5
403
+ kubernetes==26.1.0
404
+ langcodes==3.3.0
405
+ langid==1.1.6
406
+ lazy_loader==0.3
407
+ learntools==0.3.4
408
+ leven==1.0.4
409
+ libclang==16.0.6
410
+ libmambapy==1.5.0
411
+ libpysal==4.9.2
412
+ librosa==0.10.1
413
+ lightgbm==4.2.0
414
+ lightning-utilities==0.10.1
415
+ lime==0.2.0.1
416
+ line-profiler==4.1.2
417
+ linkify-it-py==2.0.3
418
+ llvmlite==0.41.1
419
+ llvmlite==0.42.0
420
+ lml==0.1.0
421
+ locket==1.0.0
422
+ loguru==0.7.2
423
+ lxml==5.1.0
424
+ lz4==4.3.3
425
+ mamba==1.5.0
426
+ mapclassify==2.6.1
427
+ markdown-it-py==3.0.0
428
+ markovify==0.9.4
429
+ marshmallow==3.21.1
430
+ matplotlib-inline==0.1.6
431
+ matplotlib-venn==0.11.10
432
+ matplotlib==3.7.5
433
+ matplotlib==3.8.3
434
+ mccabe==0.7.0
435
+ mdit-py-plugins==0.4.0
436
+ mdurl==0.1.2
437
+ memory-profiler==0.61.0
438
+ menuinst==2.0.1
439
+ mercantile==1.2.1
440
+ mgwr==2.2.1
441
+ missingno==0.5.2
442
+ mistune==0.8.4
443
+ mizani==0.11.0
444
+ ml-dtypes==0.2.0
445
+ mlcrate==0.2.0
446
+ mlens==0.2.3
447
+ mlxtend==0.23.1
448
+ mmh3==4.1.0
449
+ mne==1.6.1
450
+ mnist==0.2.2
451
+ mock==5.1.0
452
+ momepy==0.7.0
453
+ more-itertools==10.2.0
454
+ mpld3==0.5.10
455
+ mpmath==1.3.0
456
+ msgpack-numpy==0.4.8
457
+ msgpack==1.0.7
458
+ multidict==6.0.4
459
+ multimethod==1.10
460
+ multipledispatch==1.0.0
461
+ multiprocess==0.70.16
462
+ munkres==1.1.4
463
+ murmurhash==1.0.10
464
+ mypy-extensions==1.0.0
465
+ namex==0.0.7
466
+ nb-conda-kernels==2.3.1
467
+ nb_conda==2.2.1
468
+ nbclassic==1.0.0
469
+ nbclient==0.5.13
470
+ nbconvert==6.4.5
471
+ nbdime==3.2.0
472
+ nbformat==5.9.2
473
+ ndindex==1.8
474
+ nest-asyncio==1.5.8
475
+ networkx==3.2.1
476
+ nibabel==5.2.1
477
+ nilearn==0.10.3
478
+ ninja==1.11.1.1
479
+ nltk==3.2.4
480
+ nose==1.3.7
481
+ notebook==6.5.4
482
+ notebook==6.5.6
483
+ notebook_executor==0.2
484
+ notebook_shim==0.2.3
485
+ numba==0.58.1
486
+ numba==0.59.0
487
+ numexpr==2.9.0
488
+ numpy==1.26.4
489
+ nvidia-ml-py==11.495.46
490
+ nvtx==0.2.10
491
+ oauth2client==4.1.3
492
+ oauthlib==3.2.2
493
+ objsize==0.6.1
494
+ odfpy==1.4.1
495
+ olefile==0.47
496
+ onnx==1.15.0
497
+ opencensus-context==0.1.3
498
+ opencensus==0.11.4
499
+ opencv-contrib-python==4.9.0.80
500
+ opencv-python-headless==4.9.0.80
501
+ opencv-python==4.9.0.80
502
+ openpyxl==3.1.2
503
+ openslide-python==1.3.1
504
+ opentelemetry-api==1.22.0
505
+ opentelemetry-exporter-otlp-proto-common==1.22.0
506
+ opentelemetry-exporter-otlp-proto-grpc==1.22.0
507
+ opentelemetry-exporter-otlp-proto-http==1.22.0
508
+ opentelemetry-exporter-otlp==1.22.0
509
+ opentelemetry-proto==1.22.0
510
+ opentelemetry-sdk==1.22.0
511
+ opentelemetry-semantic-conventions==0.43b0
512
+ opt-einsum==3.3.0
513
+ optax==0.2.1
514
+ optimum==1.18.0
515
+ optuna==3.6.0
516
+ orbax-checkpoint==0.5.6
517
+ ordered-set==4.1.0
518
+ orjson==3.9.10
519
+ ortools==9.4.1874
520
+ osmnx==1.9.1
521
+ overrides==7.4.0
522
+ packaging==21.3
523
+ pandas-datareader==0.10.0
524
+ pandas-profiling==3.6.6
525
+ pandas-summary==0.2.0
526
+ pandas==2.1.4
527
+ pandas==2.2.1
528
+ pandasql==0.7.3
529
+ pandocfilters==1.5.0
530
+ panel==1.3.8
531
+ papermill==2.5.0
532
+ param==2.0.2
533
+ parso==0.8.3
534
+ partd==1.4.1
535
+ path.py==12.5.0
536
+ path==16.10.0
537
+ pathos==0.3.2
538
+ pathy==0.10.3
539
+ patsy==0.5.6
540
+ pdf2image==1.17.0
541
+ peft==0.10.0
542
+ pettingzoo==1.24.0
543
+ pexpect==4.8.0
544
+ pexpect==4.9.0
545
+ phik==0.12.4
546
+ pickleshare==0.7.5
547
+ pip==23.3.2
548
+ pkgutil_resolve_name==1.3.10
549
+ platformdirs==4.2.0
550
+ plotly-express==0.4.1
551
+ plotly==5.18.0
552
+ plotnine==0.13.2
553
+ pluggy==1.4.0
554
+ pointpats==2.4.0
555
+ polars==0.20.15
556
+ polyglot==16.7.4
557
+ pooch==1.8.1
558
+ pox==0.3.4
559
+ ppca==0.0.4
560
+ ppft==1.7.6.8
561
+ preprocessing==0.1.13
562
+ preshed==3.0.9
563
+ prettytable==3.9.0
564
+ progressbar2==4.4.2
565
+ prometheus-client==0.19.0
566
+ promise==2.3
567
+ prompt-toolkit==3.0.42
568
+ prompt-toolkit==3.0.43
569
+ pronouncing==0.2.0
570
+ prophet==1.1.1
571
+ proto-plus==1.23.0
572
+ protobuf==3.20.3
573
+ protobuf==4.21.12
574
+ psutil==5.9.3
575
+ psutil==5.9.7
576
+ ptyprocess==0.7.0
577
+ pudb==2024.1
578
+ pure-eval==0.2.2
579
+ py-cpuinfo==9.0.0
580
+ py-spy==0.3.14
581
+ py4j==0.10.9.7
582
+ pyLDAvis==3.4.1
583
+ pyOpenSSL==23.3.0
584
+ pyaml==23.12.0
585
+ pyarrow==11.0.0
586
+ pyasn1-modules==0.3.0
587
+ pyasn1==0.5.1
588
+ pybind11==2.11.1
589
+ pyclipper==1.3.0.post5
590
+ pycodestyle==2.11.1
591
+ pycosat==0.6.6
592
+ pycparser==2.21
593
+ pycryptodome==3.20.0
594
+ pyct==0.5.0
595
+ pycuda==2024.1
596
+ pydantic==2.5.3
597
+ pydantic==2.6.4
598
+ pydantic_core==2.14.6
599
+ pydantic_core==2.16.3
600
+ pydegensac==0.1.2
601
+ pydicom==2.4.4
602
+ pydocstyle==6.3.0
603
+ pydot==1.4.2
604
+ pydub==0.25.1
605
+ pyemd==1.0.0
606
+ pyerfa==2.0.1.1
607
+ pyexcel-io==0.6.6
608
+ pyexcel-ods==0.6.0
609
+ pyflakes==3.2.0
610
+ pygltflib==1.16.2
611
+ pykalman==0.9.5
612
+ pylibraft==23.8.0
613
+ pylint==3.0.4
614
+ pymc3==3.11.4
615
+ pymongo==3.13.0
616
+ pynndescent==0.5.11
617
+ pynvml==11.4.1
618
+ pynvrtc==9.2
619
+ pyocr==0.8.5
620
+ pyparsing==3.1.1
621
+ pyparsing==3.1.2
622
+ pypdf==4.1.0
623
+ pyproj==3.6.1
624
+ pysal==24.1
625
+ pyshp==2.3.1
626
+ pytesseract==0.3.10
627
+ pytest==8.1.1
628
+ python-Levenshtein==0.25.0
629
+ python-bidi==0.4.2
630
+ python-dateutil==2.9.0.post0
631
+ python-dotenv==1.0.0
632
+ python-json-logger==2.0.7
633
+ python-louvain==0.16
634
+ python-lsp-jsonrpc==1.1.2
635
+ python-lsp-server==1.10.1
636
+ python-slugify==8.0.4
637
+ python-utils==3.8.2
638
+ pythreejs==2.4.2
639
+ pytoolconfig==1.3.1
640
+ pytools==2023.1.1
641
+ pytorch-ignite==0.4.13
642
+ pytorch-lightning==2.2.1
643
+ pytz==2023.3.post1
644
+ pytz==2024.1
645
+ pyu2f==0.1.5
646
+ pyviz_comms==3.0.1
647
+ pyzmq==24.0.1
648
+ pyzmq==25.1.2
649
+ qgrid==1.3.1
650
+ qtconsole==5.5.1
651
+ quantecon==0.7.2
652
+ quantities==0.15.0
653
+ qudida==0.0.4
654
+ raft-dask==23.8.0
655
+ rapidfuzz==3.6.2
656
+ rasterio==1.3.9
657
+ rasterstats==0.19.0
658
+ ray-cpp==2.9.0
659
+ ray==2.9.0
660
+ referencing==0.32.1
661
+ regex==2023.12.25
662
+ requests-oauthlib==1.3.1
663
+ requests-toolbelt==0.10.1
664
+ requests==2.31.0
665
+ responses==0.18.0
666
+ retrying==1.3.3
667
+ retrying==1.3.4
668
+ rfc3339-validator==0.1.4
669
+ rfc3986-validator==0.1.1
670
+ rgf-python==3.12.0
671
+ rich-click==1.7.4
672
+ rich==13.7.0
673
+ rich==13.7.1
674
+ rmm==23.8.0
675
+ rope==1.12.0
676
+ rouge==1.0.1
677
+ rpds-py==0.16.2
678
+ rsa==4.9
679
+ ruamel-yaml-conda==0.15.100
680
+ ruamel.yaml.clib==0.2.7
681
+ ruamel.yaml==0.17.40
682
+ s2sphere==0.2.5
683
+ s3fs==2024.3.0
684
+ s3transfer==0.6.2
685
+ safetensors==0.4.2
686
+ scattertext==0.1.19
687
+ scikit-image==0.22.0
688
+ scikit-learn-intelex==2024.1.0
689
+ scikit-learn==1.2.2
690
+ scikit-multilearn==0.2.0
691
+ scikit-optimize==0.10.1
692
+ scikit-plot==0.3.7
693
+ scikit-surprise==1.1.3
694
+ scipy==1.11.4
695
+ scipy==1.12.0
696
+ seaborn==0.12.2
697
+ segment_anything==1.0
698
+ segregation==2.5
699
+ semver==3.0.2
700
+ sentencepiece==0.2.0
701
+ sentry-sdk==1.42.0
702
+ setproctitle==1.3.3
703
+ setuptools-git==1.2
704
+ setuptools-scm==8.0.4
705
+ setuptools==69.0.3
706
+ shap==0.44.1
707
+ shapely==2.0.3
708
+ shellingham==1.5.4
709
+ simpervisor==1.0.0
710
+ simplejson==3.19.2
711
+ six==1.16.0
712
+ sklearn-pandas==2.2.0
713
+ slicer==0.0.7
714
+ smart-open==6.4.0
715
+ smmap==5.0.1
716
+ sniffio==1.3.0
717
+ snowballstemmer==2.2.0
718
+ snuggs==1.4.7
719
+ sortedcontainers==2.4.0
720
+ soundfile==0.12.1
721
+ soupsieve==2.5
722
+ soxr==0.3.7
723
+ spacy-legacy==3.0.12
724
+ spacy-loggers==1.0.5
725
+ spacy==3.7.2
726
+ spaghetti==1.7.5.post1
727
+ spectral==0.23.1
728
+ spglm==1.1.0
729
+ sphinx-rtd-theme==0.2.4
730
+ spint==1.0.7
731
+ splot==1.1.5.post1
732
+ spopt==0.6.0
733
+ spreg==1.4.2
734
+ spvcm==0.3.0
735
+ sqlparse==0.4.4
736
+ squarify==0.4.3
737
+ srsly==2.4.8
738
+ stable-baselines3==2.1.0
739
+ stack-data==0.6.2
740
+ stack-data==0.6.3
741
+ stanio==0.3.0
742
+ starlette==0.32.0.post1
743
+ statsmodels==0.14.1
744
+ stemming==1.0.1
745
+ stop-words==2018.7.23
746
+ stopit==1.1.2
747
+ stumpy==1.12.0
748
+ sympy==1.12
749
+ tables==3.9.2
750
+ tabulate==0.9.0
751
+ tangled-up-in-unicode==0.2.0
752
+ tbb==2021.11.0
753
+ tblib==3.0.0
754
+ tenacity==8.2.3
755
+ tensorboard-data-server==0.7.2
756
+ tensorboard-plugin-profile==2.15.0
757
+ tensorboard==2.15.1
758
+ tensorboardX==2.6.2.2
759
+ tensorflow-cloud==0.1.16
760
+ tensorflow-datasets==4.9.4
761
+ tensorflow-decision-forests==1.8.1
762
+ tensorflow-estimator==2.15.0
763
+ tensorflow-hub==0.16.1
764
+ tensorflow-io-gcs-filesystem==0.35.0
765
+ tensorflow-io==0.35.0
766
+ tensorflow-metadata==0.14.0
767
+ tensorflow-probability==0.23.0
768
+ tensorflow-serving-api==2.14.1
769
+ tensorflow-text==2.15.0
770
+ tensorflow-transform==0.14.0
771
+ tensorflow==2.15.0
772
+ tensorpack==0.11
773
+ tensorstore==0.1.56
774
+ termcolor==2.4.0
775
+ terminado==0.18.0
776
+ testpath==0.6.0
777
+ text-unidecode==1.3
778
+ textblob==0.18.0.post0
779
+ texttable==1.7.0
780
+ tf_keras==2.15.1
781
+ tfp-nightly==0.24.0.dev0
782
+ thinc==8.2.2
783
+ threadpoolctl==3.2.0
784
+ tifffile==2023.12.9
785
+ timm==0.9.16
786
+ tinycss2==1.2.1
787
+ tobler==0.11.2
788
+ tokenizers==0.15.2
789
+ toml==0.10.2
790
+ tomli==2.0.1
791
+ tomlkit==0.12.4
792
+ toolz==0.12.1
793
+ torch==2.1.2
794
+ torchaudio==2.1.2
795
+ torchdata==0.7.1
796
+ torchinfo==1.8.0
797
+ torchmetrics==1.3.2
798
+ torchtext==0.16.2
799
+ torchvision==0.16.2
800
+ tornado==6.3.3
801
+ tqdm==4.66.1
802
+ traceml==1.0.8
803
+ traitlets==5.9.0
804
+ traittypes==0.2.1
805
+ transformers==4.38.2
806
+ treelite-runtime==3.2.0
807
+ treelite==3.2.0
808
+ trueskill==0.4.5
809
+ truststore==0.8.0
810
+ trx-python==0.2.9
811
+ tsfresh==0.20.2
812
+ typeguard==4.1.5
813
+ typer==0.9.0
814
+ types-python-dateutil==2.8.19.20240106
815
+ typing-inspect==0.9.0
816
+ typing-utils==0.1.0
817
+ typing_extensions==4.9.0
818
+ tzdata==2023.4
819
+ uc-micro-py==1.0.3
820
+ ucx-py==0.33.0
821
+ ujson==5.9.0
822
+ umap-learn==0.5.5
823
+ unicodedata2==15.1.0
824
+ update-checker==0.18.0
825
+ uri-template==1.3.0
826
+ uritemplate==3.0.1
827
+ urllib3==1.26.18
828
+ urllib3==2.1.0
829
+ urwid==2.6.9
830
+ urwid_readline==0.14
831
+ uvicorn==0.25.0
832
+ uvloop==0.19.0
833
+ vaex-astro==0.9.3
834
+ vaex-core==4.17.1
835
+ vaex-hdf5==0.14.1
836
+ vaex-jupyter==0.8.2
837
+ vaex-ml==0.18.3
838
+ vaex-server==0.9.0
839
+ vaex-viz==0.5.4
840
+ vaex==4.17.0
841
+ vec_noise==1.1.4
842
+ vecstack==0.4.0
843
+ virtualenv==20.21.0
844
+ visions==0.7.5
845
+ vowpalwabbit==9.9.0
846
+ vtk==9.3.0
847
+ wandb==0.16.4
848
+ wasabi==1.1.2
849
+ watchfiles==0.21.0
850
+ wavio==0.0.8
851
+ wcwidth==0.2.13
852
+ weasel==0.3.4
853
+ webcolors==1.13
854
+ webencodings==0.5.1
855
+ websocket-client==1.7.0
856
+ websockets==12.0
857
+ wfdb==4.1.2
858
+ whatthepatch==1.0.5
859
+ wheel==0.42.0
860
+ widgetsnbextension==3.6.6
861
+ witwidget==1.8.1
862
+ woodwork==0.29.0
863
+ wordcloud==1.9.3
864
+ wordsegment==1.3.1
865
+ wrapt==1.14.1
866
+ xarray-einstats==0.7.0
867
+ xarray==2024.2.0
868
+ xgboost==2.0.3
869
+ xvfbwrapper==0.2.9
870
+ xxhash==3.4.1
871
+ xyzservices==2023.10.1
872
+ y-py==0.6.2
873
+ yapf==0.40.2
874
+ yarl==1.9.3
875
+ yarl==1.9.4
876
+ ydata-profiling==4.6.4
877
+ yellowbrick==1.5
878
+ ypy-websocket==0.8.4
879
+ zict==3.0.0
880
+ zipp==3.17.0
881
+ zstandard==0.22.0
wandb/run-20240405_143245-z6ibr5j0/files/wandb-metadata.json ADDED
@@ -0,0 +1,66 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "os": "Linux-5.15.133+-x86_64-with-glibc2.31",
3
+ "python": "3.10.13",
4
+ "heartbeatAt": "2024-04-05T14:32:46.419706",
5
+ "startedAt": "2024-04-05T14:32:45.561691",
6
+ "docker": null,
7
+ "cuda": null,
8
+ "args": [],
9
+ "state": "running",
10
+ "program": "kaggle.ipynb",
11
+ "codePathLocal": null,
12
+ "root": "/kaggle/working",
13
+ "host": "351216fd69aa",
14
+ "username": "root",
15
+ "executable": "/opt/conda/bin/python3.10",
16
+ "cpu_count": 2,
17
+ "cpu_count_logical": 4,
18
+ "cpu_freq": {
19
+ "current": 2000.148,
20
+ "min": 0.0,
21
+ "max": 0.0
22
+ },
23
+ "cpu_freq_per_core": [
24
+ {
25
+ "current": 2000.148,
26
+ "min": 0.0,
27
+ "max": 0.0
28
+ },
29
+ {
30
+ "current": 2000.148,
31
+ "min": 0.0,
32
+ "max": 0.0
33
+ },
34
+ {
35
+ "current": 2000.148,
36
+ "min": 0.0,
37
+ "max": 0.0
38
+ },
39
+ {
40
+ "current": 2000.148,
41
+ "min": 0.0,
42
+ "max": 0.0
43
+ }
44
+ ],
45
+ "disk": {
46
+ "/": {
47
+ "total": 8062.387607574463,
48
+ "used": 5533.654884338379
49
+ }
50
+ },
51
+ "gpu": "Tesla T4",
52
+ "gpu_count": 2,
53
+ "gpu_devices": [
54
+ {
55
+ "name": "Tesla T4",
56
+ "memory_total": 16106127360
57
+ },
58
+ {
59
+ "name": "Tesla T4",
60
+ "memory_total": 16106127360
61
+ }
62
+ ],
63
+ "memory": {
64
+ "total": 31.357559204101562
65
+ }
66
+ }
wandb/run-20240405_143245-z6ibr5j0/files/wandb-summary.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train/loss": 0.066, "train/grad_norm": 0.8965990543365479, "train/learning_rate": 0.0, "train/epoch": 47.62, "train/global_step": 250, "_timestamp": 1712330630.9422815, "_runtime": 3065.373146533966, "_step": 96, "eval/loss": 0.3189685344696045, "eval/runtime": 5.9485, "eval/samples_per_second": 5.211, "eval/steps_per_second": 1.009, "train/train_runtime": 3080.6119, "train/train_samples_per_second": 1.964, "train/train_steps_per_second": 0.081, "train/total_flos": 2.2984269045888e+16, "train/train_loss": 0.40684156107902525}
wandb/run-20240405_143245-z6ibr5j0/logs/debug-internal.log ADDED
The diff for this file is too large to render. See raw diff
 
wandb/run-20240405_143245-z6ibr5j0/logs/debug.log ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ 2024-04-05 14:32:45,563 INFO MainThread:34 [wandb_setup.py:_flush():76] Current SDK version is 0.16.4
2
+ 2024-04-05 14:32:45,563 INFO MainThread:34 [wandb_setup.py:_flush():76] Configure stats pid to 34
3
+ 2024-04-05 14:32:45,563 INFO MainThread:34 [wandb_setup.py:_flush():76] Loading settings from /root/.config/wandb/settings
4
+ 2024-04-05 14:32:45,563 INFO MainThread:34 [wandb_setup.py:_flush():76] Loading settings from /kaggle/working/wandb/settings
5
+ 2024-04-05 14:32:45,563 INFO MainThread:34 [wandb_setup.py:_flush():76] Loading settings from environment variables: {}
6
+ 2024-04-05 14:32:45,563 INFO MainThread:34 [wandb_setup.py:_flush():76] Applying setup settings: {'_disable_service': False}
7
+ 2024-04-05 14:32:45,563 INFO MainThread:34 [wandb_setup.py:_flush():76] Inferring run settings from compute environment: {'program': '<python with no main file>'}
8
+ 2024-04-05 14:32:45,563 INFO MainThread:34 [wandb_setup.py:_flush():76] Applying login settings: {'api_key': '***REDACTED***'}
9
+ 2024-04-05 14:32:45,563 INFO MainThread:34 [wandb_init.py:_log_setup():526] Logging user logs to /kaggle/working/wandb/run-20240405_143245-z6ibr5j0/logs/debug.log
10
+ 2024-04-05 14:32:45,563 INFO MainThread:34 [wandb_init.py:_log_setup():527] Logging internal logs to /kaggle/working/wandb/run-20240405_143245-z6ibr5j0/logs/debug-internal.log
11
+ 2024-04-05 14:32:45,564 INFO MainThread:34 [wandb_init.py:_jupyter_setup():472] configuring jupyter hooks <wandb.sdk.wandb_init._WandbInit object at 0x7ac638f76e30>
12
+ 2024-04-05 14:32:45,564 INFO MainThread:34 [wandb_init.py:init():566] calling init triggers
13
+ 2024-04-05 14:32:45,564 INFO MainThread:34 [wandb_init.py:init():573] wandb.init called with sweep_config: {}
14
+ config: {}
15
+ 2024-04-05 14:32:45,564 INFO MainThread:34 [wandb_init.py:init():616] starting backend
16
+ 2024-04-05 14:32:45,564 INFO MainThread:34 [wandb_init.py:init():620] setting up manager
17
+ 2024-04-05 14:32:45,566 INFO MainThread:34 [backend.py:_multiprocessing_setup():105] multiprocessing start_methods=fork,spawn,forkserver, using: spawn
18
+ 2024-04-05 14:32:45,568 INFO MainThread:34 [wandb_init.py:init():628] backend started and connected
19
+ 2024-04-05 14:32:45,580 INFO MainThread:34 [wandb_run.py:_label_probe_notebook():1295] probe notebook
20
+ 2024-04-05 14:32:46,063 INFO MainThread:34 [wandb_init.py:init():720] updated telemetry
21
+ 2024-04-05 14:32:46,067 INFO MainThread:34 [wandb_init.py:init():753] communicating run to backend with 90.0 second timeout
22
+ 2024-04-05 14:32:46,313 INFO MainThread:34 [wandb_run.py:_on_init():2262] communicating current version
23
+ 2024-04-05 14:32:46,379 INFO MainThread:34 [wandb_run.py:_on_init():2271] got version response upgrade_message: "wandb version 0.16.6 is available! To upgrade, please run:\n $ pip install wandb --upgrade"
24
+
25
+ 2024-04-05 14:32:46,379 INFO MainThread:34 [wandb_init.py:init():804] starting run threads in backend
26
+ 2024-04-05 14:33:17,409 INFO MainThread:34 [wandb_run.py:_console_start():2241] atexit reg
27
+ 2024-04-05 14:33:17,409 INFO MainThread:34 [wandb_run.py:_redirect():2096] redirect: wrap_raw
28
+ 2024-04-05 14:33:17,410 INFO MainThread:34 [wandb_run.py:_redirect():2161] Wrapping output streams.
29
+ 2024-04-05 14:33:17,410 INFO MainThread:34 [wandb_run.py:_redirect():2186] Redirects installed.
30
+ 2024-04-05 14:33:17,411 INFO MainThread:34 [wandb_init.py:init():847] run started, returning control to user process
31
+ 2024-04-05 14:33:17,416 INFO MainThread:34 [wandb_run.py:_config_callback():1343] config_cb None None {'vocab_size': 65024, 'hidden_size': 4544, 'num_hidden_layers': 32, 'num_attention_heads': 71, 'layer_norm_epsilon': 1e-05, 'initializer_range': 0.02, 'use_cache': False, 'hidden_dropout': 0.0, 'attention_dropout': 0.0, 'bos_token_id': 11, 'eos_token_id': 11, 'num_kv_heads': 71, 'alibi': False, 'new_decoder_architecture': False, 'multi_query': True, 'parallel_attn': True, 'bias': False, 'max_position_embeddings': 2048, 'rope_theta': 10000.0, 'rope_scaling': None, 'return_dict': True, 'output_hidden_states': False, 'output_attentions': False, 'torchscript': False, 'torch_dtype': 'bfloat16', 'use_bfloat16': False, 'tf_legacy_loss': False, 'pruned_heads': {}, 'tie_word_embeddings': True, 'chunk_size_feed_forward': 0, 'is_encoder_decoder': False, 'is_decoder': False, 'cross_attention_hidden_size': None, 'add_cross_attention': False, 'tie_encoder_decoder': False, 'max_length': 20, 'min_length': 0, 'do_sample': False, 'early_stopping': False, 'num_beams': 1, 'num_beam_groups': 1, 'diversity_penalty': 0.0, 'temperature': 1.0, 'top_k': 50, 'top_p': 1.0, 'typical_p': 1.0, 'repetition_penalty': 1.0, 'length_penalty': 1.0, 'no_repeat_ngram_size': 0, 'encoder_no_repeat_ngram_size': 0, 'bad_words_ids': None, 'num_return_sequences': 1, 'output_scores': False, 'return_dict_in_generate': False, 'forced_bos_token_id': None, 'forced_eos_token_id': None, 'remove_invalid_values': False, 'exponential_decay_length_penalty': None, 'suppress_tokens': None, 'begin_suppress_tokens': None, 'architectures': ['FalconForCausalLM'], 'finetuning_task': None, 'id2label': {0: 'LABEL_0', 1: 'LABEL_1'}, 'label2id': {'LABEL_0': 0, 'LABEL_1': 1}, 'tokenizer_class': None, 'prefix': None, 'pad_token_id': None, 'sep_token_id': None, 'decoder_start_token_id': None, 'task_specific_params': None, 'problem_type': None, '_name_or_path': 'tiiuae/falcon-7b-instruct', 'transformers_version': '4.38.2', 'apply_residual_connection_post_layernorm': False, 'auto_map': {'AutoConfig': 'tiiuae/falcon-7b-instruct--configuration_falcon.FalconConfig', 'AutoModel': 'tiiuae/falcon-7b-instruct--modeling_falcon.FalconModel', 'AutoModelForSequenceClassification': 'tiiuae/falcon-7b-instruct--modeling_falcon.FalconForSequenceClassification', 'AutoModelForTokenClassification': 'tiiuae/falcon-7b-instruct--modeling_falcon.FalconForTokenClassification', 'AutoModelForQuestionAnswering': 'tiiuae/falcon-7b-instruct--modeling_falcon.FalconForQuestionAnswering', 'AutoModelForCausalLM': 'tiiuae/falcon-7b-instruct--modeling_falcon.FalconForCausalLM'}, 'model_type': 'falcon', 'quantization_config': {'quant_method': 'QuantizationMethod.BITS_AND_BYTES', '_load_in_8bit': False, '_load_in_4bit': True, 'llm_int8_threshold': 6.0, 'llm_int8_skip_modules': None, 'llm_int8_enable_fp32_cpu_offload': False, 'llm_int8_has_fp16_weight': False, 'bnb_4bit_quant_type': 'nf4', 'bnb_4bit_use_double_quant': True, 'bnb_4bit_compute_dtype': 'bfloat16', 'load_in_4bit': True, 'load_in_8bit': False}, 'output_dir': '/kaggle/working/', 'overwrite_output_dir': False, 'do_train': False, 'do_eval': True, 'do_predict': False, 'evaluation_strategy': 'epoch', 'prediction_loss_only': False, 'per_device_train_batch_size': 6, 'per_device_eval_batch_size': 6, 'per_gpu_train_batch_size': None, 'per_gpu_eval_batch_size': None, 'gradient_accumulation_steps': 4, 'eval_accumulation_steps': None, 'eval_delay': 0, 'learning_rate': 0.0002, 'weight_decay': 0.01, 'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'max_grad_norm': 1.0, 'num_train_epochs': 50, 'max_steps': -1, 'lr_scheduler_type': 'linear', 'lr_scheduler_kwargs': {}, 'warmup_ratio': 0.0, 'warmup_steps': 2, 'log_level': 'passive', 'log_level_replica': 'warning', 'log_on_each_node': True, 'logging_dir': '/kaggle/working/runs/Apr05_14-32-29_351216fd69aa', 'logging_strategy': 'epoch', 'logging_first_step': False, 'logging_steps': 500, 'logging_nan_inf_filter': True, 'save_strategy': 'epoch', 'save_steps': 500, 'save_total_limit': None, 'save_safetensors': True, 'save_on_each_node': False, 'save_only_model': False, 'no_cuda': False, 'use_cpu': False, 'use_mps_device': False, 'seed': 42, 'data_seed': None, 'jit_mode_eval': False, 'use_ipex': False, 'bf16': False, 'fp16': True, 'fp16_opt_level': 'O1', 'half_precision_backend': 'auto', 'bf16_full_eval': False, 'fp16_full_eval': False, 'tf32': None, 'local_rank': 0, 'ddp_backend': None, 'tpu_num_cores': None, 'tpu_metrics_debug': False, 'debug': [], 'dataloader_drop_last': False, 'eval_steps': None, 'dataloader_num_workers': 0, 'dataloader_prefetch_factor': None, 'past_index': -1, 'run_name': '/kaggle/working/', 'disable_tqdm': False, 'remove_unused_columns': True, 'label_names': None, 'load_best_model_at_end': True, 'metric_for_best_model': 'loss', 'greater_is_better': False, 'ignore_data_skip': False, 'fsdp': [], 'fsdp_min_num_params': 0, 'fsdp_config': {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}, 'fsdp_transformer_layer_cls_to_wrap': None, 'accelerator_config': {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True}, 'deepspeed': None, 'label_smoothing_factor': 0.0, 'optim': 'paged_adamw_8bit', 'optim_args': None, 'adafactor': False, 'group_by_length': False, 'length_column_name': 'length', 'report_to': ['tensorboard', 'wandb'], 'ddp_find_unused_parameters': None, 'ddp_bucket_cap_mb': None, 'ddp_broadcast_buffers': None, 'dataloader_pin_memory': True, 'dataloader_persistent_workers': False, 'skip_memory_metrics': True, 'use_legacy_prediction_loop': False, 'push_to_hub': False, 'resume_from_checkpoint': None, 'hub_model_id': None, 'hub_strategy': 'every_save', 'hub_token': '<HUB_TOKEN>', 'hub_private_repo': False, 'hub_always_push': False, 'gradient_checkpointing': False, 'gradient_checkpointing_kwargs': None, 'include_inputs_for_metrics': False, 'fp16_backend': 'auto', 'push_to_hub_model_id': None, 'push_to_hub_organization': None, 'push_to_hub_token': '<PUSH_TO_HUB_TOKEN>', 'mp_parameters': '', 'auto_find_batch_size': False, 'full_determinism': False, 'torchdynamo': None, 'ray_scope': 'last', 'ddp_timeout': 1800, 'torch_compile': False, 'torch_compile_backend': None, 'torch_compile_mode': None, 'dispatch_batches': None, 'split_batches': None, 'include_tokens_per_second': False, 'include_num_input_tokens_seen': False, 'neftune_noise_alpha': None}
32
+ 2024-04-05 15:23:50,947 INFO MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
33
+ 2024-04-05 15:23:50,947 INFO MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
34
+ 2024-04-05 15:23:50,955 INFO MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend
35
+ 2024-04-05 15:23:52,718 INFO MainThread:34 [jupyter.py:save_ipynb():373] not saving jupyter notebook
36
+ 2024-04-05 15:23:52,718 INFO MainThread:34 [wandb_init.py:_pause_backend():437] pausing backend
37
+ 2024-04-05 15:35:51,534 INFO MainThread:34 [wandb_init.py:_resume_backend():442] resuming backend
wandb/run-20240405_143245-z6ibr5j0/run-z6ibr5j0.wandb ADDED
Binary file (146 kB). View file