-
nvidia/Llama-3_3-Nemotron-Super-49B-v1
Text Generation • 50B • Updated • 69.4k • • 303 -
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation • 8B • Updated • 340k • • 188 -
google/gemma-3-1b-it
Text Generation • 1.0B • Updated • 2.22M • 495 -
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition • 6B • Updated • 551k • 1.44k
Gain.Energy
company
Verified
AI & ML interests
At Gain Energy, we are committed to harnessing the power of Artificial Intelligence (AI) and Machine Learning (ML) to revolutionize the oil and gas industry. Our focus spans a wide range of AI and ML applications aimed at enhancing efficiency, safety, and sustainability.
Sparse Mixture of Experts datasets for mathematical reasoning and complex calculations.
-
On Domain-Specific Post-Training for Multimodal Large Language Models
Paper • 2411.19930 • Published • 30 -
START: Self-taught Reasoner with Tools
Paper • 2503.04625 • Published • 114 -
Union of Experts: Adapting Hierarchical Routing to Equivalently Decomposed Transformer
Paper • 2503.02495 • Published • 8 -
Fine-Tuning Small Language Models for Domain-Specific AI: An Edge AI Perspective
Paper • 2503.01933 • Published • 12
-
Xkev/Llama-3.2V-11B-cot
Image-Text-to-Text • 11B • Updated • 5.55k • 153 -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 1.45M • • 1.56k -
microsoft/Phi-3.5-mini-instruct
Text Generation • 4B • Updated • 258k • • 878 -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 527k • • 1.47k
-
Stream of Search (SoS): Learning to Search in Language
Paper • 2404.03683 • Published • 32 -
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization
Paper • 2411.10442 • Published • 80 -
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions
Paper • 2411.14405 • Published • 62 -
Hymba: A Hybrid-head Architecture for Small Language Models
Paper • 2411.13676 • Published • 46
-
nvidia/Llama-3_3-Nemotron-Super-49B-v1
Text Generation • 50B • Updated • 69.4k • • 303 -
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation • 8B • Updated • 340k • • 188 -
google/gemma-3-1b-it
Text Generation • 1.0B • Updated • 2.22M • 495 -
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition • 6B • Updated • 551k • 1.44k
Sparse Mixture of Experts datasets for mathematical reasoning and complex calculations.
-
On Domain-Specific Post-Training for Multimodal Large Language Models
Paper • 2411.19930 • Published • 30 -
START: Self-taught Reasoner with Tools
Paper • 2503.04625 • Published • 114 -
Union of Experts: Adapting Hierarchical Routing to Equivalently Decomposed Transformer
Paper • 2503.02495 • Published • 8 -
Fine-Tuning Small Language Models for Domain-Specific AI: An Edge AI Perspective
Paper • 2503.01933 • Published • 12
-
Xkev/Llama-3.2V-11B-cot
Image-Text-to-Text • 11B • Updated • 5.55k • 153 -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 1.45M • • 1.56k -
microsoft/Phi-3.5-mini-instruct
Text Generation • 4B • Updated • 258k • • 878 -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 527k • • 1.47k
-
Stream of Search (SoS): Learning to Search in Language
Paper • 2404.03683 • Published • 32 -
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization
Paper • 2411.10442 • Published • 80 -
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions
Paper • 2411.14405 • Published • 62 -
Hymba: A Hybrid-head Architecture for Small Language Models
Paper • 2411.13676 • Published • 46