Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Self-Optimizing Synthetic Data for Verified Code

non-profit
https://brando90.github.io/brandomiranda/
BrandoHablando
https://github.com/brando90
Activity Feed

AI & ML interests

None defined yet.

Brando Miranda's profile picture

brando 
authored 3 papers 9 months ago

Are Emergent Abilities of Large Language Models a Mirage?

Paper • 2304.15004 • Published Apr 28, 2023 • 7

ZIP-FIT: Embedding-Free Data Selection via Compression-Based Alignment

Paper • 2410.18194 • Published Oct 23, 2024 • 6

Pantograph: A Machine-to-Machine Interaction Interface for Advanced Theorem Proving, High Level Reasoning, and Data Extraction in Lean 4

Paper • 2410.16429 • Published Oct 21, 2024 • 5
brando 
authored a paper about 1 year ago

Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?

Paper • 2406.04391 • Published Jun 6, 2024 • 9
brando 
authored a paper about 2 years ago

Beyond Scale: the Diversity Coefficient as a Data Quality Metric Demonstrates LLMs are Pre-trained on Formally Diverse Data

Paper • 2306.13840 • Published Jun 24, 2023 • 11
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs