Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Aravind Pai's picture
3

Aravind Pai

aravindpai

AI & ML interests

None yet

Recent Activity

replied to tomaarsen's post 16 days ago
🏅 Quantized Embeddings are here! Unlike model quantization, embedding quantization is a post-processing step for embeddings that converts e.g. `float32` embeddings to binary or `int8` embeddings. This saves 32x or 4x memory & disk space, and these embeddings are much easier to compare! Our results show 25-45x speedups in retrieval compared to full-size embeddings, while keeping 96% of the performance! Learn more about it in our blogpost in collaboration with mixedbread.ai: https://huggingface.co/blog/embedding-quantization Or try out our demo where we use quantized embeddings to let you search all of Wikipedia (yes, 41,000,000 texts) in 1 second on a CPU Space: https://huggingface.co/spaces/sentence-transformers/quantized-retrieval
new activity about 1 year ago
microsoft/Phi-3-mini-128k-instruct:Irrelvant response
new activity about 1 year ago
microsoft/Phi-3-mini-128k-instruct:Irrelvant response
View all activity

Organizations

None yet

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs