DatologyAI

Team

company

Verified

https://www.datologyai.com

datologyai

AI & ML interests

None defined yet.

Recent Activity

sjoshi804-datologyai authored a paper 3 days ago

Understanding the Robustness of Multi-modal Contrastive Learning to Distribution Shift

sjoshi804-datologyai authored a paper 3 days ago

Which Features are Learnt by Contrastive Learning? On the Role of Simplicity Bias in Class Collapse and Feature Suppression

sjoshi804-datologyai authored a paper 3 days ago

Investigating the Benefits of Projection Head for Representation Learning

View all activity

sjoshi804-datologyai

authored 8 papers 3 days ago

Understanding the Robustness of Multi-modal Contrastive Learning to Distribution Shift

Paper • 2310.04971 • Published Oct 8, 2023

Which Features are Learnt by Contrastive Learning? On the Role of Simplicity Bias in Class Collapse and Feature Suppression

Paper • 2305.16536 • Published May 25, 2023

Investigating the Benefits of Projection Head for Representation Learning

Paper • 2403.11391 • Published Mar 18, 2024

Data-Efficient Contrastive Language-Image Pretraining: Prioritizing Data Quality over Quantity

Paper • 2403.12267 • Published Mar 18, 2024

MM-GEN: Enhancing Task Performance Through Targeted Multimodal Data Curation

Paper • 2501.04155 • Published Jan 7, 2025

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Paper • 2508.10975 • Published Aug 14, 2025 • 60

Luxical: High-Speed Lexical-Dense Text Embeddings

Paper • 2512.09015 • Published Dec 9, 2025

DatBench: Discriminative, Faithful, and Efficient VLM Evaluations

Paper • 2601.02316 • Published 5 days ago • 9

rads101

updated 2 datasets 4 days ago

DatologyAI/DatBench-Full

Viewer • Updated 4 days ago • 195k • 176 • 18

DatologyAI/DatBench

Viewer • Updated 4 days ago • 43.5k • 189 • 40

mleavitt

published 2 datasets 4 days ago

DatologyAI/DatBench-Full

Viewer • Updated 4 days ago • 195k • 176 • 18

DatologyAI/DatBench

Viewer • Updated 4 days ago • 43.5k • 189 • 40

pratyushmaini

authored 5 papers 5 months ago

Model-tuning Via Prompts Makes NLP Models Adversarially Robust

Paper • 2303.07320 • Published Mar 13, 2023

Scaling Laws for Data Filtering -- Data Curation cannot be Compute Agnostic

Paper • 2404.07177 • Published Apr 10, 2024 • 1

Rethinking LLM Memorization through the Lens of Adversarial Compression

Paper • 2404.15146 • Published Apr 23, 2024

OpenUnlearning: Accelerating LLM Unlearning via Unified Benchmarking of Methods and Metrics

Paper • 2506.12618 • Published Jun 14, 2025

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Paper • 2508.10975 • Published Aug 14, 2025 • 60

ricardomonti08

authored a paper 5 months ago

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Paper • 2508.10975 • Published Aug 14, 2025 • 60

pratyushmaini

authored a paper over 1 year ago

Understanding Hallucinations in Diffusion Models through Mode Interpolation

Paper • 2406.09358 • Published Jun 13, 2024 • 5

mleavitt

authored a paper over 1 year ago

Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models

Paper • 2405.20541 • Published May 30, 2024 • 24