LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token Paper • 2501.03895 • Published 12 days ago • 48
jp1924/DevelopmentandDataofLLMswithEnhancedKoreanLanguagePerformance Viewer • Updated Dec 12, 2024 • 70.2k • 65
jp1924/DevelopmentandDataofLLMswithEnhancedKoreanLanguagePerformance Viewer • Updated Dec 12, 2024 • 70.2k • 65
jp1924/KoreanVisionDataforImageDescriptionSentenceExtractionandGeneration Viewer • Updated Dec 5, 2024 • 2.68M • 31 • 2
jp1924/KoreanVisionDataforImageDescriptionSentenceExtractionandGeneration Viewer • Updated Dec 5, 2024 • 2.68M • 31 • 2