SmolLM3 pretraining datasets Collection datasets used in SmolLM3 pretraining • 15 items • Updated 3 days ago • 27
GPT-OSS Harmful (4.2B to 20B) Collection Research-oriented GPT-OSS models with inverted safety patterns for red-teaming & adversarial analysis to understand failure modes and vulnerabilities. • 29 items • Updated 2 days ago • 1
GPT-OSS Instruction Following (4.2B to 20B) Collection GPT-OSS models optimized for precise instruction adherence and constraint satisfaction from Tulu3 Persona Instruction Following dataset. • 29 items • Updated 2 days ago • 2
GPT-OSS Safety (4.2B to 20B) Collection Safety-focused GPT-OSS models specialized in identifying and responding to harmful content while maintaining helpful capabilities from SORRY-Bench. • 29 items • Updated 2 days ago • 1
GPT-OSS Law (4.2B to 20B) Collection Legal domain GPT-OSS models excelling at legal reasoning, jurisprudence, and understanding legal frameworks from MMLU legal subjects. • 29 items • Updated 2 days ago • 1
GPT-OSS Pruned Experts (4.2B-20B) [IF, Science, Math, etc.] Collection Complete collection of domain-specialized GPT-OSS models (1-32 experts) optimized for science, math, medicine, law, safety, and instruction following. • 8 items • Updated 2 days ago • 4
GPT-OSS Health / Medicine (4.2B to 20B) Collection Medical domain GPT-OSS models specializing in clinical knowledge, anatomy, medical procedures, & health-related reasoning from MMLU medical subjects. • 29 items • Updated 2 days ago • 1
GPT-OSS Math (4.2B to 20B) Collection Mathematics-focused GPT-OSS models excelling at mathematical computation, proof strategies, and logical reasoning from MMLU mathematics subjects. • 29 items • Updated 2 days ago • 1
GPT-OSS Science (4.2B to 20B) Collection Specialized GPT-OSS models optimized for scientific reasoning tasks in physics, chemistry, biology, and STEM from GPQA and MMLU science domains. • 29 items • Updated 2 days ago • 1
GPT-OSS General (4.2B to 20B) Collection Collection of pruned GPT-OSS models spanning 1-32 experts, maintaining general capabilities across domains while reducing computational requirements. • 29 items • Updated 2 days ago • 4
Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report Paper • 2508.01059 • Published 14 days ago • 30
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • Jul 8 • 625
Llama-3.1-FoundationAI-SecurityLLM-Base-8B Technical Report Paper • 2504.21039 • Published Apr 28 • 15
LlamaForTokenClassification Collection Fine Tuned llama variants for Token Classification • 6 items • Updated Aug 8, 2024 • 3
view article Article Dynamic Topic Modeling with RedPajama: A New Approach to Hierarchical Content Understanding By AmanPriyanshu • Nov 23, 2024 • 1
view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer By Pringled and 1 other • Oct 14, 2024 • 96
AI Governance and Accountability: An Analysis of Anthropic's Claude Paper • 2407.01557 • Published May 2, 2024 • 2
FRACTURED-SORRY-Bench: Framework for Revealing Attacks in Conversational Turns Undermining Refusal Efficacy and Defenses over SORRY-Bench Paper • 2408.16163 • Published Aug 28, 2024 • 1