Feynman Innovations's picture

Feynman Innovations

ajibawa-2023

·

AjinkyaBawase

AI & ML interests

LLM, RL, DL, ML, AGI. Developing LLMs (preferably fully fine tuned ) for various use cases.

Recent Activity

reacted to singhsidhukuldeep's post with 🔥 22 days ago

Exciting Research Alert: Revolutionizing Complex Information Retrieval! A groundbreaking paper from researchers at MIT, AWS AI, and UPenn introduces ARM (Alignment-Oriented LLM-based Retrieval Method), a novel approach to tackle complex information retrieval challenges. >> Key Innovations Information Alignment The method first decomposes queries into keywords and aligns them with available data using both BM25 and embedding similarity, ensuring comprehensive coverage of information needs. Structure Alignment ARM employs a sophisticated mixed-integer programming solver to identify connections between data objects, exploring relationships beyond simple semantic matching. Self-Verification The system includes a unique self-verification mechanism where the LLM evaluates and aggregates results from multiple retrieval paths, ensuring accuracy and completeness. >> Performance Highlights The results are impressive: - Outperforms standard RAG by up to 5.2 points in execution accuracy on Bird dataset - Achieves 19.3 points higher F1 scores compared to existing approaches on OTT-QA - Reduces the number of required LLM calls while maintaining superior retrieval quality >> Technical Implementation The system uses a three-step process: 1. N-gram indexing and embedding computation for all data objects 2. Constrained beam decoding for information alignment 3. Mixed-integer programming optimization for structure exploration This research represents a significant step forward in making complex information retrieval more efficient and accurate. The team's work demonstrates how combining traditional optimization techniques with modern LLM capabilities can solve challenging retrieval problems.

reacted to Tonic's post with 🔥 22 days ago

🙋🏻‍♂️hey there folks , Goedel's Theorem Prover is now being demo'ed on huggingface : https://huggingface.co/spaces/Tonic/Math give it a try !

reacted to hba123's post with 🔥 22 days ago

We developed a method that ensures almost-sure safety (i.e., safety with probability approaching 1). We proved this result. We then, present a practical implementation which we call InferenceGuard. InferenceGuard has impressive practical results: 91.04% on Alpaca-7B and 100% safety results on Beaver 7B-v3. Now, it is easy to get high safety results like those if we want a dumb model, e.g., just don't answer or answer with EOS and so on. However, our goal is not to only have safe results, but also to make sure that the rewards are high - we want a good trade-off between safety and rewards! That's exactly, what we show. InferenceGuard achieves that! Check it out: https://huggingface.co/papers/2502.01208

View all activity

Organizations

ajibawa-2023's activity

upvoted a collection 29 days ago

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 136

upvoted 12 papers 2 months ago

AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions

Paper • 2410.20424 • Published Oct 27, 2024 • 40

Bel Esprit: Multi-Agent Framework for Building AI Model Pipelines

Paper • 2412.14684 • Published Dec 19, 2024 • 1

Generative World Explorer

Paper • 2411.11844 • Published Nov 18, 2024 • 76

PRefLexOR: Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning and Agentic Thinking

Paper • 2410.12375 • Published Oct 16, 2024 • 4

DynaSaur: Large Language Agents Beyond Predefined Actions

Paper • 2411.01747 • Published Nov 4, 2024 • 28

Generative Agent Simulations of 1,000 People

Paper • 2411.10109 • Published Nov 15, 2024 • 4

AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant

Paper • 2410.18603 • Published Oct 24, 2024 • 32

AgentInstruct: Toward Generative Teaching with Agentic Flows

Paper • 2407.03502 • Published Jul 3, 2024 • 50

MALT: Improving Reasoning with Multi-Agent LLM Training

Paper • 2412.01928 • Published Dec 2, 2024 • 42

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12, 2024 • 119

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 66

GUI Agents: A Survey

Paper • 2412.13501 • Published Dec 18, 2024 • 25

upvoted an article 4 months ago

Article

🇮🇹🇯🇵🇧🇷 Generating multilingual instruction datasets with Magpie 🐦‍⬛

By

•

Oct 21, 2024

• 19

upvoted 2 papers 5 months ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 51

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 108

upvoted 2 collections 5 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 16 days ago • 296

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18, 2024 • 227

upvoted 2 papers 6 months ago

MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Paper • 2409.07314 • Published Sep 11, 2024 • 54

τ-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains

Paper • 2406.12045 • Published Jun 17, 2024 • 7