Peaceful Data

non-profit

Activity Feed Request to join this org

AI & ML interests

Peacefully Open Source Post-Processing Speech and Language Resources Toward Research Community.

Recent Activity

JinchuanTian updated a dataset 13 days ago

PeacefulData/chi_me_3_4

JinchuanTian published a dataset 13 days ago

PeacefulData/chi_me_3_4

huckiyang published a model 26 days ago

PeacefulData/NeKo-v0-post-correction

View all activity

JinchuanTian

updated a dataset 13 days ago

PeacefulData/chi_me_3_4

Updated 13 days ago • 30

JinchuanTian

published a dataset 13 days ago

PeacefulData/chi_me_3_4

Updated 13 days ago • 30

huckiyang

published a model 26 days ago

PeacefulData/NeKo-v0-post-correction

Text Generation • 47B • Updated Jun 30, 2024 • 24

huckiyang

published a dataset about 1 month ago

PeacefulData/Neko-v1

Viewer • Updated Apr 1 • 593k • 14

sungfengh

updated a dataset 2 months ago

PeacefulData/SINE_v2

Viewer • Updated Jun 26 • 350k • 4

sungfengh

published a dataset 2 months ago

PeacefulData/SINE_v2

Viewer • Updated Jun 26 • 350k • 4

jaeyeonkim99

authored a paper 5 months ago

Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features

Paper • 2504.00557 • Published Apr 1 • 15

SreyanG-NVIDIA

authored a paper 6 months ago

Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities

Paper • 2503.03983 • Published Mar 6 • 25

rhachiuma

authored a paper 7 months ago

Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks

Paper • 2501.08326 • Published Jan 14 • 34

rhachiuma

authored a paper 9 months ago

VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models

Paper • 2412.01822 • Published Dec 2, 2024 • 15

ZhifengKong

authored a paper 11 months ago

Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data

Paper • 2410.02056 • Published Oct 2, 2024 • 6

SreyanG-NVIDIA

authored a paper 11 months ago

Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data

Paper • 2410.02056 • Published Oct 2, 2024 • 6

JinchuanTian

authored a paper about 1 year ago

Towards Robust Speech Representation Learning for Thousands of Languages

Paper • 2407.00837 • Published Jun 30, 2024 • 11

wanchichen

authored 6 papers about 1 year ago

ML-SUPERB: Multilingual Speech Universal PERformance Benchmark

Paper • 2305.10615 • Published May 18, 2023 • 1

Joint Prediction and Denoising for Large-scale Multilingual Self-supervised Learning

Paper • 2309.15317 • Published Sep 26, 2023

Reproducing Whisper-Style Training Using an Open-Source Toolkit and Publicly Available Data

Paper • 2309.13876 • Published Sep 25, 2023 • 1

Improving Massively Multilingual ASR With Auxiliary CTC Objectives

Paper • 2302.12829 • Published Feb 24, 2023

OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer

Paper • 2401.16658 • Published Jan 30, 2024 • 14

YODAS: Youtube-Oriented Dataset for Audio and Speech

Paper • 2406.00899 • Published Jun 2, 2024 • 3

mingj

authored a paper over 1 year ago

A Survey on Graph Neural Networks for Time Series: Forecasting, Classification, Imputation, and Anomaly Detection

Paper • 2307.03759 • Published Jul 7, 2023 • 1