Vectara

company

https://vectara.com

vectara

Activity Feed Request to join this org

AI & ML interests

retrieval augmented generation, grounded generation, large language models, LLMs, question answering, chatbot

Recent Activity

ofermend updated a model about 3 hours ago

vectara/hallucination_evaluation_model

stsui96 published a dataset 3 days ago

vectara/hhem_leaderboard_datasets

stsui96 updated a dataset 3 days ago

vectara/hhem_leaderboard_datasets

View all activity

ofermend

updated a model about 3 hours ago

vectara/hallucination_evaluation_model

Text Classification • 0.1B • Updated Jul 8 • 201k • 318

stsui96

published a dataset 3 days ago

vectara/hhem_leaderboard_datasets

Viewer • Updated 3 days ago • 1.97k • 11

stsui96

updated a dataset 3 days ago

vectara/hhem_leaderboard_datasets

Viewer • Updated 3 days ago • 1.97k • 11

nthakur

authored a paper 9 days ago

BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent

Paper • 2508.06600 • Published 13 days ago • 36

ahmed-d8k

updated a dataset 9 days ago

vectara/results

Preview • Updated 9 days ago • 364 • 1

ofermend

updated 9 Spaces about 1 month ago

HMC Demo

🐨

Ask questions about Harvard Management

Hacker News chat

🐨

chatbot with HN data using vectara-agentic

Justice Harvard

🐨

Teacher Assistant for Justice Harvard using vectara-agentic

SuperMicro Demo

🐨

Ask questions about Supermicro documents

UCSF Ortho Demo

🐨

Ask questions about UCSF Orthopedics

CFPB Assistant

🐨

CFPB Assistant using vectara-agentic

Legal Assistant

🐨

Legal Assistant using vectara-agentic

Finance assistant

🐨

Finance chatbot using vectara-agentic

EV Assistant

🐨

EV Assistant using vectara-agentic

forrest-vectara

updated a model about 1 month ago

vectara/hallucination_evaluation_model

Text Classification • 0.1B • Updated Jul 8 • 201k • 318

nthakur

authored 3 papers 3 months ago

FreshStack: Building Realistic Benchmarks for Evaluating Retrieval on Technical Documents

Paper • 2504.13128 • Published Apr 17 • 8

Chatbot Arena Meets Nuggets: Towards Explanations and Diagnostics in the Evaluation of LLM Responses

Paper • 2504.20006 • Published Apr 28

Fixing Data That Hurts Performance: Cascading LLMs to Relabel Hard Negatives for Robust Information Retrieval

Paper • 2505.16967 • Published May 22 • 24

clefourrier

posted an update 3 months ago

Post

1258

Always surprised that so few people actually read the FineTasks blog, on
✨how to select training evals with the highest signal✨

If you're serious about training models without wasting compute on shitty runs, you absolutely should read it!!

An high signal eval actually tells you precisely, during training, how wel & what your model is learning, allowing you to discard the bad runs/bad samplings/...!

The blog covers in depth prompt choice, metrics, dataset, across languages/capabilities, and my fave section is "which properties should evals have"👌
(to know on your use case how to select the best evals for you)

Blog: HuggingFaceFW/blogpost-fine-tasks

2 replies

ofermend

posted an update 4 months ago

Post

355

Excited to share open-rag-eval (https://github.com/vectara/open-rag-eval) a new open source project to help scale RAG evaluation. The key benefit: it does not require golden answers so much more scalable.
Would love any thoughts or feedback (or even better - if you want to contribute a PR that would be great).

AI & ML interests

Recent Activity

Team members 34

vectara's activity

HMC Demo

Hacker News chat

Justice Harvard

SuperMicro Demo

UCSF Ortho Demo

CFPB Assistant

Legal Assistant

Finance assistant

EV Assistant