fenglinliu commited on
Commit
2437deb
·
verified ·
1 Parent(s): fe7f326

Update meta_data.py

Browse files
Files changed (1) hide show
  1. meta_data.py +1 -111
meta_data.py CHANGED
@@ -60,114 +60,4 @@ LEADERBOARD_MD['MAIN'] = f"""
60
 
61
  - Metrics:
62
  - Accuracy, ROUGE-L, and F1.
63
- """
64
-
65
-
66
-
67
- LEADERBOARD_MD['Shopping Concept Understanding'] = """
68
- ## Shopping Concept Understanding Evaluation Results
69
-
70
- Online shopping concepts such as brands and product models are domain-specific and not often seen in pre-training. Moreover, they often appear in short texts (e.g. queries, attribute-value pairs) and thus no sufficient contexts are given to help understand them. Hence, failing to understand these concepts compromises the performance of LLMs on downstream tasks.
71
-
72
- The included sub-skills and tasks include:
73
- - **Concept Normalization**:
74
- - Product Category Synonym
75
- - Attribute Value Synonym
76
- - **Elaboration**:
77
- - Attribute Explanation
78
- - Product Category Explanation
79
- - **Relational Inference**:
80
- - Applicable Attribute to Product Category
81
- - Applicable Product Category to Attribute
82
- - Inapplicable Attributes
83
- - Valid Attribute Value Given Attribute and Product Category
84
- - Valid Attribute Given Attribute Value and Product Category
85
- - Product Category Classification
86
- - Product Category Generation
87
- - **Sentiment Analysis**:
88
- - Aspect-based Sentiment Classification
89
- - Aspect-based Review Retrieval
90
- - Aspect-based Review Selection
91
- - Aspect-based Reviews Overall Sentiment Classification
92
- - **Information Extraction**:
93
- - Attribute Value Extraction
94
- - Query Named Entity Recognition
95
- - Aspect-based Review Keyphrase Selection
96
- - Aspect-based Review Keyphrase Extraction
97
- - **Summarization**:
98
- - Attribute Naming from Decription
99
- - Product Category Naming from Description
100
- - Review Aspect Retrieval
101
- - Single Conversation Topic Selection
102
- - Multi-Conversation Topic Retrieval
103
- - Product Keyphrase Selection
104
- - Product Keyphrase Retrieval
105
- - Product Title Generation
106
- """
107
-
108
-
109
- LEADERBOARD_MD['Shopping Knowledge Reasoning'] = """
110
- ## Shopping Knowledge Reasoning Evaluation Results
111
-
112
- This skill focuses on understanding and applying various implicit knowledge to perform reasoning over products and their attributes. For example, calculations such as the total volume of a product pack require numeric reasoning, and finding compatible products requires multi-hop reasoning among various products over a product knowledge graph.
113
-
114
- The included sub-skills and tasks include:
115
- - **Numeric Reasoning**:
116
- - Unit Conversation
117
- - Product Numeric Reasoning
118
- - **Commonsense Reasoning**
119
- - **Implicit Multi-Hop Reasoning**:
120
- - Product Compatibility
121
- - Complementary Product Categories
122
- - Implicit Attribute Reasoning
123
- - Related Brands Selection
124
- - Related Brands Retrieval
125
- """
126
-
127
- LEADERBOARD_MD['User Behavior Alignment'] = """
128
- ## User Behavior Alignment Evaluation Results
129
-
130
- Accurately modeling user behaviors is a crucial skill in online shopping. A large variety of user behaviors exist in online shopping, including queries, clicks, add-to-carts, purchases, etc. Moreover, these behaviors are generally implicit and not expressed in text.
131
-
132
- Consequently, LLMs trained with general texts encounter challenges in aligning with the heterogeneous and implicit user behaviors as they rarely observe such inputs during pre-training.
133
-
134
- The included sub-skills and tasks include:
135
- - **Query-Query Relations**:
136
- - Query Re-Writing
137
- - Query-Query Intention Selection
138
- - Intention-Based Related Query Retrieval
139
- - **Query-Product Relations**:
140
- - Product Category Selection for Query
141
- - Query-Product Relation Selection
142
- - Query-Product Ranking
143
- - **Sessions**:
144
- - Session-based Query Recommendation
145
- - Session-based Next Query Selection
146
- - Session-based Next Product Selection
147
- - **Purchases**:
148
- - Product Co-Purchase Selection
149
- - Product Co-Purchase Retrieval
150
- - **Reviews and QA**:
151
- - Review Rating Prediction
152
- - Aspect-Sentiment-Based Review Generation
153
- - Review Helpfulness Selection
154
- - Product-Based Question Answering
155
- """
156
-
157
- LEADERBOARD_MD['Multi-lingual Abilities'] = """
158
- ## Multi-lingual Abilities Evaluation Results
159
-
160
- Multi-lingual models are desired in online shopping as they can be deployed in multiple marketplaces without re-training.
161
-
162
- The included sub-skills and tasks include:
163
- - **Multi-lingual Shopping Concept Understanding**:
164
- - Multi-lingual Product Title Generation
165
- - Multi-lingual Product Keyphrase Selection
166
- - Cross-lingual Product Title Translation
167
- - Cross-lingual Product Entity Alignment
168
- - **Multi-lingual User Behavior Alignment**:
169
- - Multi-lingual Query-product Relation Selection
170
- - Multi-lingual Query-product Ranking
171
- - Multi-lingual Session-based Product Recommendation
172
- """
173
-
 
60
 
61
  - Metrics:
62
  - Accuracy, ROUGE-L, and F1.
63
+ """