Michael Bommarito's picture

5 1 16

Michael Bommarito PRO

mjbommar

·

https://linkedin.com/in/bommarito

AI & ML interests

NLP, image classification, audio classification, image synthesis

Recent Activity

updated a dataset 26 days ago

mjbommar/SHELF

updated a dataset 27 days ago

mjbommar/ogbert-v1-mlm

updated a dataset 27 days ago

mjbommar/opengloss-v1.1-drafting

View all activity

Organizations

authored a paper about 1 month ago

OpenGloss: A Synthetic Encyclopedic Dictionary and Semantic Knowledge Graph

Paper • 2511.18622 • Published Nov 23, 2025

authored 9 papers 9 months ago

LexGLUE: A Benchmark Dataset for Legal Language Understanding in English

Paper • 2110.00976 • Published Oct 3, 2021

Precise Legal Sentence Boundary Detection for Retrieval at Scale: NUPunkt and CharBoundary

Paper • 2504.04131 • Published Apr 5, 2025

KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing Applications

Paper • 2503.17247 • Published Mar 21, 2025 • 1

The KL3M Data Project: Copyright-Clean Training Resources for Large Language Models

Paper • 2504.07854 • Published Apr 10, 2025

GPT Takes the Bar Exam

Paper • 2212.14402 • Published Dec 29, 2022

Natural Language Processing in the Legal Domain

Paper • 2302.12039 • Published Feb 23, 2023

GPT as Knowledge Worker: A Zero-Shot Evaluation of (AI)CPA Capabilities

Paper • 2301.04408 • Published Jan 11, 2023

Crowdsourcing accurately and robustly predicts Supreme Court decisions

Paper • 1712.03846 • Published Dec 11, 2017

A General Approach for Predicting the Behavior of the Supreme Court of the United States

Paper • 1612.03473 • Published Dec 11, 2016