File size: 732 Bytes
6bd59f7
df490c1
 
 
 
6bd59f7
df490c1
6bd59f7
df490c1
6bd59f7
df490c1
 
 
6bd59f7
 
df490c1
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
---
title: Shopping MMLU Leaderboard
emoji: 🌎
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 4.44.1
app_file: app.py
pinned: true
license: apache-2.0
tags:
- leaderboard
short_description: 'Massive Multi-Task LLM Benchmark for Online Shopping'
---


In this leaderboard, we display evaluation results obtained with Shopping MMLU. The space provides an overall leaderboard, consisting of 4 main online shopping skills:
- Shopping Concept Understanding
- Shopping Knowledge Reasoning
- User Behavior Alignment
- Multi-lingual Abilities  

Github: https://github.com/KL4805/ShoppingMMLU
Report: https://arxiv.org/abs/2410.20745

Please consider to cite the report if the resource is useful to your research:

```BibTex

```