Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Lee Gao's picture
27 85 1

Lee Gao

leegao19
BK-Lee's profile picture Gargaz's profile picture Ksgk-fy's profile picture
·

AI & ML interests

None yet

Organizations

Google's profile picture Social Post Explorers's profile picture

Collections 3

Lee's RoPE Tricks / Context Extension Reads
Set of Long Context (RoPE or otherwise) I'm collecting off of HF
  • LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

    Paper • 2402.13753 • Published Feb 21, 2024 • 117
  • Data Engineering for Scaling Language Models to 128K Context

    Paper • 2402.10171 • Published Feb 15, 2024 • 26
  • LongAgent: Scaling Language Models to 128k Context through Multi-Agent Collaboration

    Paper • 2402.11550 • Published Feb 18, 2024 • 18
  • The What, Why, and How of Context Length Extension Techniques in Large Language Models -- A Detailed Survey

    Paper • 2401.07872 • Published Jan 15, 2024 • 2
Papers I Like
  • MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs

    Paper • 2402.15627 • Published Feb 23, 2024 • 39
  • Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models

    Paper • 2402.17177 • Published Feb 27, 2024 • 89
  • Beyond Language Models: Byte Models are Digital World Simulators

    Paper • 2402.19155 • Published Feb 29, 2024 • 54
  • Hydragen: High-Throughput LLM Inference with Shared Prefixes

    Paper • 2402.05099 • Published Feb 7, 2024 • 20

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs