Stalin16
's Collections
Inference
updated
The Impact of Hyperparameters on Large Language Model Inference
Performance: An Evaluation of vLLM and HuggingFace Pipelines
Paper
•
2408.01050
•
Published
•
8
Scaling LLM Test-Time Compute Optimally can be More Effective than
Scaling Model Parameters
Paper
•
2408.03314
•
Published
•
51
Towards a Unified View of Preference Learning for Large Language Models:
A Survey
Paper
•
2409.02795
•
Published
•
71
Paper Copilot: A Self-Evolving and Efficient LLM System for Personalized
Academic Assistance
Paper
•
2409.04593
•
Published
•
23
From MOOC to MAIC: Reshaping Online Teaching and Learning through
LLM-driven Agents
Paper
•
2409.03512
•
Published
•
26
Political DEBATE: Efficient Zero-shot and Few-shot Classifiers for
Political Text
Paper
•
2409.02078
•
Published
•
9
Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming
Paper
•
2408.16725
•
Published
•
52
TextBoost: Towards One-Shot Personalization of Text-to-Image Models via
Fine-tuning Text Encoder
Paper
•
2409.08248
•
Published
•
13
GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question
Answering
Paper
•
2409.06595
•
Published
•
37
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic
reasoning
Paper
•
2409.12183
•
Published
•
36
Preference Tuning with Human Feedback on Language, Speech, and Vision
Tasks: A Survey
Paper
•
2409.11564
•
Published
•
19
Enhancing Structured-Data Retrieval with GraphRAG: Soccer Data Case
Study
Paper
•
2409.17580
•
Published
•
7
Law of the Weakest Link: Cross Capabilities of Large Language Models
Paper
•
2409.19951
•
Published
•
53
Illustrious: an Open Advanced Illustration Model
Paper
•
2409.19946
•
Published
•
13
Ruler: A Model-Agnostic Method to Control Generated Length for Large
Language Models
Paper
•
2409.18943
•
Published
•
27
SLM: Bridge the thin gap between speech and text foundation models
Paper
•
2310.00230
•
Published
ComfyGen: Prompt-Adaptive Workflows for Text-to-Image Generation
Paper
•
2410.01731
•
Published
•
16
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Paper
•
2411.04905
•
Published
•
111
Agent-as-a-Judge: Evaluate Agents with Agents
Paper
•
2410.10934
•
Published
•
18
LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large
Multimodal Models
Paper
•
2410.09732
•
Published
•
54
Analyzing The Language of Visual Tokens
Paper
•
2411.05001
•
Published
•
22
LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content
Paper
•
2410.10783
•
Published
•
26
Intriguing Properties of Large Language and Vision Models
Paper
•
2410.04751
•
Published
•
16
Everything Everywhere All at Once: LLMs can In-Context Learn Multiple
Tasks in Superposition
Paper
•
2410.05603
•
Published
•
11
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle
Grandmaster Level
Paper
•
2411.03562
•
Published
•
63
"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM
Quantization
Paper
•
2411.02355
•
Published
•
46
Survey of Cultural Awareness in Language Models: Text and Beyond
Paper
•
2411.00860
•
Published
•
23
Multi-expert Prompting Improves Reliability, Safety, and Usefulness of
Large Language Models
Paper
•
2411.00492
•
Published
•
6
Personalization of Large Language Models: A Survey
Paper
•
2411.00027
•
Published
•
31
Survey of User Interface Design and Interaction Techniques in Generative
AI Applications
Paper
•
2410.22370
•
Published
•
11
Navigating the Unknown: A Chat-Based Collaborative Interface for
Personalized Exploratory Tasks
Paper
•
2410.24032
•
Published
•
9
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM
Inference
Paper
•
2410.21465
•
Published
•
11
Document Parsing Unveiled: Techniques, Challenges, and Prospects for
Structured Information Extraction
Paper
•
2410.21169
•
Published
•
30
Are LLMs Better than Reported? Detecting Label Errors and Mitigating
Their Effect on Model Performance
Paper
•
2410.18889
•
Published
•
15
Counting Ability of Large Language Models and Impact of Tokenization
Paper
•
2410.19730
•
Published
•
10
Can Knowledge Editing Really Correct Hallucinations?
Paper
•
2410.16251
•
Published
•
54
Looking Inward: Language Models Can Learn About Themselves by
Introspection
Paper
•
2410.13787
•
Published
•
6
JudgeBench: A Benchmark for Evaluating LLM-based Judges
Paper
•
2410.12784
•
Published
•
42
WorldCuisines: A Massive-Scale Benchmark for Multilingual and
Multicultural Visual Question Answering on Global Cuisines
Paper
•
2410.12705
•
Published
•
29
A Comparative Study on Reasoning Patterns of OpenAI's o1 Model
Paper
•
2410.13639
•
Published
•
16
Remember, Retrieve and Generate: Understanding Infinite Visual Concepts
as Your Personalized Assistant
Paper
•
2410.13360
•
Published
•
8
The Curse of Multi-Modalities: Evaluating Hallucinations of Large
Multimodal Models across Language, Visual, and Audio
Paper
•
2410.12787
•
Published
•
30
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse
Synthetic Data and Global-to-Local Adaptive Perception
Paper
•
2410.12628
•
Published
•
29
Needle Threading: Can LLMs Follow Threads through Near-Million-Scale
Haystacks?
Paper
•
2411.05000
•
Published
•
21
Cut Your Losses in Large-Vocabulary Language Models
Paper
•
2411.09009
•
Published
•
43
BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large
Language Models on Mobile Devices
Paper
•
2411.10640
•
Published
•
44
Generative World Explorer
Paper
•
2411.11844
•
Published
•
75
VideoAutoArena: An Automated Arena for Evaluating Large Multimodal
Models in Video Analysis through User Simulation
Paper
•
2411.13281
•
Published
•
17
When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context
Training
Paper
•
2411.13476
•
Published
•
15
Evaluating Tokenizer Performance of Large Language Models Across
Official Indian Languages
Paper
•
2411.12240
•
Published
•
6
SageAttention2 Technical Report: Accurate 4 Bit Attention for
Plug-and-play Inference Acceleration
Paper
•
2411.10958
•
Published
•
50
Personalized Multimodal Large Language Models: A Survey
Paper
•
2412.02142
•
Published
•
12
Towards Universal Soccer Video Understanding
Paper
•
2412.01820
•
Published
•
9
Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-OASIS
Paper
•
2411.19655
•
Published
•
20
Training Large Language Models to Reason in a Continuous Latent Space
Paper
•
2412.06769
•
Published
•
62
Evaluating Language Models as Synthetic Data Generators
Paper
•
2412.03679
•
Published
•
43
Paper
•
2412.04315
•
Published
•
16
Paper
•
2412.07724
•
Published
•
18