Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
Innovatix
Innovatix
Follow
0 followers
ยท
2 following
AI & ML interests
None yet
Recent Activity
replied
to
jasoncorkill
's
post
about 1 month ago
"Why did the bee get married?" "Because he found his honey!" This was the "funniest" joke out of 10'000 jokes we generated with LLMs. With 68% of respondents rating it as "funny". Original jokes are particularly hard for LLMs, as jokes are very nuanced and a lot of context is needed to understand if something is "funny". Something that can only reliably be measured using humans. LLMs are not equally good at generating jokes in every language. Generated English jokes turned out to be way funnier than the Japanese ones. 46% of English-speaking voters on average found the generated joke funny. The same statistic for other languages: Vietnamese: 44% Portuguese: 40% Arabic: 37% Japanese: 28% There is not much variance in generation quality among models for any fixed language. But still Claude Sonnet 4 slightly outperforms others in Vietnamese, Arabic and Japanese and Gemini 2.5 Flash in Portuguese and English We have release the 1 Million (!) native speaker ratings and the 10'000 jokes as a dataset for anyone to use: https://huggingface.co/datasets/Rapidata/multilingual-llm-jokes-4o-claude-gemini
reacted
to
singhsidhukuldeep
's
post
with ๐
9 months ago
Sorry judge, my lawyer hallucinated? ๐ If you get an AI lawyer, you would want it to be hallucination-free! New @Stanford-@Yale research reveals surprising findings about leading AI legal research tools. Here's what you need to know: >> Key Findings The study tested LexisNexis (Lexis+ AI), Thomson Reuters (Westlaw AI & Ask Practical Law AI), and GPT-4, finding hallucination rates between 17-33% despite claims of being "hallucination-free". >> Technical Deep Dive The research evaluated these tools using Retrieval-Augmented Generation (RAG) architecture, which operates in two crucial steps: 1. Retrieval System: - Uses neural text embeddings to capture semantic meaning - Employs both lexical and semantic search mechanisms - Implements document filtering and extraction - Retrieves relevant legal documents from vast databases 2. Generation Pipeline: - Processes retrieved documents alongside original queries - Synthesizes information from multiple legal sources - Generates responses based on retrieved context - Includes citation verification mechanisms >> Performance Breakdown: - Lexis+ AI: 65% accuracy rate - Westlaw AI: 42% accuracy rate - Ask Practical Law AI: Over 60% incomplete answers >> Why This Matters This research exposes critical vulnerabilities in AI legal tools that lawyers increasingly rely on. It's essential for legal professionals to understand these limitations when incorporating AI into their practice.
reacted
to
LukeNeumann
's
post
with ๐
9 months ago
Nine years ago, I uploaded the first 8K resolution video to YouTube and I've been stockpiling 8K footage ever since: https://www.youtube.com/watch?v=sLprVF6d7Ug&t Should @Overlaiapp release the first open-source 8K video dataset? Could anyone even fine tune a model with this?๐
View all activity
Organizations
spaces
1
Runtime error
AutoTrain Advanced
๐
models
0
None public yet
datasets
0
None public yet