-
nvidia/Llama-3_3-Nemotron-Super-49B-v1
Text Generation • 50B • Updated • 18.5k • • 319 -
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation • 8B • Updated • 205k • • 196 -
google/gemma-3-1b-it
Text Generation • 1.0B • Updated • 2.83M • 570 -
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition • 6B • Updated • 414k • 1.47k
Gain.Energy
company
Verified
AI & ML interests
At Gain Energy, we are committed to harnessing the power of Artificial Intelligence (AI) and Machine Learning (ML) to revolutionize the oil and gas industry. Our focus spans a wide range of AI and ML applications aimed at enhancing efficiency, safety, and sustainability.
Sparse Mixture of Experts datasets for mathematical reasoning and complex calculations.
-
On Domain-Specific Post-Training for Multimodal Large Language Models
Paper • 2411.19930 • Published • 30 -
START: Self-taught Reasoner with Tools
Paper • 2503.04625 • Published • 114 -
Union of Experts: Adapting Hierarchical Routing to Equivalently Decomposed Transformer
Paper • 2503.02495 • Published • 8 -
Fine-Tuning Small Language Models for Domain-Specific AI: An Edge AI Perspective
Paper • 2503.01933 • Published • 12
-
Xkev/Llama-3.2V-11B-cot
Image-Text-to-Text • 11B • Updated • 4.63k • 153 -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 1.79M • • 1.66k -
microsoft/Phi-3.5-mini-instruct
Text Generation • 4B • Updated • 226k • 901 -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 821k • • 1.5k
-
Stream of Search (SoS): Learning to Search in Language
Paper • 2404.03683 • Published • 32 -
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization
Paper • 2411.10442 • Published • 87 -
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions
Paper • 2411.14405 • Published • 62 -
Hymba: A Hybrid-head Architecture for Small Language Models
Paper • 2411.13676 • Published • 46
-
nvidia/Llama-3_3-Nemotron-Super-49B-v1
Text Generation • 50B • Updated • 18.5k • • 319 -
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation • 8B • Updated • 205k • • 196 -
google/gemma-3-1b-it
Text Generation • 1.0B • Updated • 2.83M • 570 -
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition • 6B • Updated • 414k • 1.47k
Sparse Mixture of Experts datasets for mathematical reasoning and complex calculations.
-
On Domain-Specific Post-Training for Multimodal Large Language Models
Paper • 2411.19930 • Published • 30 -
START: Self-taught Reasoner with Tools
Paper • 2503.04625 • Published • 114 -
Union of Experts: Adapting Hierarchical Routing to Equivalently Decomposed Transformer
Paper • 2503.02495 • Published • 8 -
Fine-Tuning Small Language Models for Domain-Specific AI: An Edge AI Perspective
Paper • 2503.01933 • Published • 12
-
Xkev/Llama-3.2V-11B-cot
Image-Text-to-Text • 11B • Updated • 4.63k • 153 -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 1.79M • • 1.66k -
microsoft/Phi-3.5-mini-instruct
Text Generation • 4B • Updated • 226k • 901 -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 821k • • 1.5k
-
Stream of Search (SoS): Learning to Search in Language
Paper • 2404.03683 • Published • 32 -
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization
Paper • 2411.10442 • Published • 87 -
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions
Paper • 2411.14405 • Published • 62 -
Hymba: A Hybrid-head Architecture for Small Language Models
Paper • 2411.13676 • Published • 46