Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Simran's picture
18 4 11

Simran

simarora
21world's profile picture manning's profile picture Fishtiks's profile picture
·
  • simran-arora

AI & ML interests

None yet

Recent Activity

published a model 6 days ago
simarora/cylon-nocumsum-k16-v256-sep-wide-dtype32-p1b-t10b-re0
published a model 6 days ago
simarora/cylon-nocumsum-k8-v256-sep-wide-dtype32-p1b-t10b-re0
published a model 6 days ago
simarora/cylon-nocumsum-fd1024-v256-sep-wide-dtype32-p1b-t10b-re0
View all activity

Organizations

Stanford NLP's profile picture HazyResearch's profile picture Together's profile picture Stanford University's profile picture

Papers 11

arxiv:2506.06266
arxiv:2410.10254
arxiv:2407.05483
arxiv:2402.18668
View 11 papers

models 27

simarora/cylon-nocumsum-fd1024-v256-sep-wide-dtype32-p1b-t10b-re0

Updated 6 days ago

simarora/cylon-nocumsum-k8-v256-sep-wide-dtype32-p1b-t10b-re0

Updated 6 days ago

simarora/cylon-nocumsum-k16-v256-sep-wide-dtype32-p1b-t10b-re0

Updated 6 days ago

simarora/gdn-paper-120m-hdim512-lr0.0005-layers12-dmodel768-n2048-head12-numQK1-v02

Updated 17 days ago

simarora/gdn-paper-120m-hdim1024-lr_0.0005-layers_12-dmodel_768-n_2048-head_12-numQK_1-sim

Updated 17 days ago

simarora/gdn-paper-noswiglu-120m-h6-numQK1-rerun

Updated May 6

simarora/gdn-paper-noswiglu-120m-h12-numQK1-rerun

Updated May 6

simarora/bilinear-gdn-paper-initstd1-120m-h6-fd128-rerun

Updated May 6

simarora/bilinear-gdn-paper-initstd1-120m-h12-fd64-rerun

Updated May 6

simarora/bilinear-gdn-paper-initstd1-120m-h12-fd64

Updated May 6
View 27 models

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs