Commit History

chore: update README and REPORT with performance insights and dataset changes
0dbb356

Sarthak commited on

chore: update dependencies and configuration for improved training
7837959

Sarthak commited on

chore: updated eval charts
37196da

Sarthak commited on

chore: moved tokenlearn as in internal package
4255a26

Sarthak commited on

chore: moved model2vec as in internal package
473c3a0

Sarthak commited on

feat(distiller): add option to skip post-training re-regularization
72121b3

Sarthak commited on

feat(analyze): enhance model type identification in analysis
fba41e9

Sarthak commited on

refactor(report): reflect distillation experiments data in report
d8a91f9

Sarthak commited on

chore(analysis): update image files
4afb5eb

Sarthak commited on

fix(pretrain): clamp input IDs to prevent out of bounds error
acc7e7d

Sarthak commited on

chore(distiller): remove unused GPU_NAME variable
293cdb2

Sarthak commited on

feat(config): allow multiple GPU types for training and simplify GPU handling
d820ac9

Sarthak commited on

refactor(patch-utils): improve patch application check for tokenlearn
c9e9334

Sarthak commited on

refactor(distiller): improve beam distillation and tokenlearn integration
729d700

Sarthak commited on

docs(readme): refine dataset identifiers in README
93151b9

Sarthak commited on

docs: rename model to codemalt and update evaluation instructions
6742590

Sarthak commited on

refactor(beam-utils): use direct file operations for beam volumes
12d70ca

Sarthak commited on

feat(distiller): add checkpointing and refactor analyze.py
8083c06

Sarthak commited on

chore(model): update model weights
ff551a2

Sarthak commited on

chore: add initial CodeMap configuration file
a4beaf0

Sarthak commited on

chore: remove MTEB results analysis script
7b072e5

Sarthak commited on

chore: add ignore patterns for temp and env files
aaaf803

Sarthak commited on

chore: configure git lfs for mteb and evaluation files
2879afb

Sarthak commited on

build: add taskfile for common commands
b7f5dc4

Sarthak commited on

chore(analysis-charts): add batch size and memory scaling charts
038fbb2

Sarthak commited on

chore(model): remove metadata file for distilled model
9e12501

Sarthak commited on

chore(analysis-charts): add peer comparison chart
7c0ebdd

Sarthak commited on

chore(analysis-charts): add benchmark performance chart
3d41a07

Sarthak commited on

chore(assets): add language heatmap image
e3760e9

Sarthak commited on

fix(main): reduce maximum queries per language to 100
da2f1e0

Sarthak commited on

chore(deps): update dependencies in uv.lock
44ddb30

Sarthak commited on

chore(analysis-charts): add new radar charts for code model embeddings
432dc37

Sarthak commited on

feat(distiller): configure beam functions with resource settings
ef6935e

Sarthak commited on

feat: introduce distiller package and update README
ee673cb

Sarthak commited on

feat(config): add new model type for vector embeddings
53a6528

Sarthak commited on

chore(analysis-charts): add new image assets
bba24e6

Sarthak commited on

chore: remove unused scripts and update dependencies
1bc7e54

Sarthak commited on

feat: overhaul distiller package with unified CLI, enhanced evaluation, and modular structure
454e47c

Sarthak commited on

feat: created a cli to manage the complete generation process
ea0b2a0

Sarthak commited on

feat: 4 stage training, refinement failed for first 3
0b74f1f

Sarthak commited on

feat: coding stage training completed
0b87653

Sarthak commited on

feat: potion-based trained model
07b2cb1

Sarthak commited on

chore: add env vars
850cf20

Sarthak commited on

feat: added MTEB evaluation scripts
eb5363b

Sarthak commited on

fix: migrate binary files to LFS tracking
b82c1c9

Sarthak commited on

chore: set lfs tracking
a49dfe1

Sarthak commited on

docs: Enhance README.md with detailed project information for gte-Qwen2-7B-instruct-M2V-Distilled, including model optimization benefits, installation instructions, usage examples, and performance results.
4a1942c

Sarthak commited on

initial commit
ecfceb8

Sarthak commited on

initial commit
7bb46ce
unverified

sarthak1 commited on