NER-Luxury / README.md
AkimfromParis's picture
Update README.md
5e543cf verified
|
raw
history blame
10.3 kB
---
extra_gated_prompt: >-
You agree to not use the model to conduct experiments that cause harm to human
subjects.
extra_gated_fields:
Name: text
AI-Lab/Company: text
Email: text
I agree to use this model for academic research (non-commercial use ONLY): checkbox
license: bigscience-openrail-m
language: en
base_model: xlm-roberta-base
tags:
- NER
- token-classification
- Fashion
- Luxury
library_name: transformers
model-index:
- name: AkimfromParis/Bert-Luxury
results:
- task:
type: token-classification
name: Token Classification
dataset:
name: Private
type: private
metrics:
- name: Loss
type: Loss
value: 0.4079
verified: true
- name: Precision
type: Precision
value: 0.7652
verified: true
- name: Recall
type: Recall
value: 0.8033
verified: true
- name: F1
type: F1
value: 0.7838
verified: true
- name: Accuracy
type: Accuracy
value: 0.9403
verified: true
pipeline_tag: token-classification
widget:
- text: >-
According to Bloomberg, the market cap of LVMH surpassed $500 billion
becoming the first European company to reach that milestone. As of July
2023, Hermès has a market cap of $213.80 Billion, bigger than Nike at
$161.80 Billion.
example_title: Finance
- text: >-
During Milan Fashion Week, Raf Simons and Miuccia Prada showcased their
latest Prada collection at the Fondazione Prada in Milano.
example_title: Fashion
- text: >-
On 3 April 2023, L'Oréal acquired for $2.5 Billion the cosmetic label Aēsop
from Australia. And on 26 June 2023, the French luxury group Kering
acquired 100% of the perfume house, Creed from a fund of BlackRock
example_title: Beauty
- text: >-
French house Hermès and British department store Selfridges are leaving the
Fashion Pact after the appointment of CEO Helena Helmersson from Swedish
fast-fashion company H&M as the new co-chair
example_title: Sustainability
---
# ***NER-Luxury***
# A fine-tuned XLM-Roberta model for NER in the fashion and luxury industry
## . Goal
- **NER-Luxury** is a fine-tuned XLM-Roberta model for the subtask N.E.R (Named Entity Recognition) in English. **NER-Luxury** is **domain-specific for the fashion and luxury industry** with bespoke labels. **NER-Luxury** is trying to be a bridge between the **aesthetic side** and the **quantitative side** of the fashion and luxury industry.
- As a downstream task, **NER-Luxury** is able to identify major fashion houses, artistic directors, fragrances, models, or influential artists on the website of a fashion magazine. And **NER-Luxury** is also able to identify companies, listed groups, executives, financial analysts, and investment companies inside a 200-page quarterly financial report.
- The goal of **NER-Luxury** is to create a clear **hierarchical classification** of luxury houses, fine watchmakers, beauty brands, sportswear labels, and fast fashion brands with respect of temporality, context, and sustainability. **NER-Luxury** is trying to solve the **"entity disambiguation"** between the founder, his eponymous label, the company designation, the names of products, and the intellectual property rights for corporate lawyers, M&A bankers, and financial analysts.
For example, the disambiguation of **Louis Vuitton**:
- The visionary founder, **Louis Vuitton** (1821-1892)
- The luxury house, **Louis Vuitton**
- The giant luxury group **LVMH Moët Hennessy Louis Vuitton SE**
- The collection with Japanese artist, **Louis Vuitton x Yayoi Kusama**
## . NER bespoke labels
**Entities are evolving according to temporality, and context.**
**Label** | **Description and example**
- | -
**O** | **Outside** (of a text segment)
**Date** | **Temporal expressions** (1854, Q2 2023, Nineties, September 21)
**Location** | **Physical location and area** (Paris, Japan, Europe, Champs-Elysées)
**Event** | **Critical events** (WW II, Olympics, IPO, Covid pandemic, Paris Fashion Week)
**MonetaryValue** | **Currency, price, sales, revenue** ($2.65 billion, 4.6 million euros, CHF 400,000, etc.)
**House** | **Fashion and luxury houses** (Louis Vuitton, Cartier, Gucci, Chanel)
**Brand** | **Sportswear, beauty and labels** (Nike, Lululemon, Clinique)
**FastFashion** | **Mass-market retailers** (Zara, H&M, Uniqlo, Shein)
**PrivateCompany** | **Unlisted companies** (Chanel SA, Stella McCartney Ltd, Valentino S.p.A)
**ListedGroup** | **Listed groups** (LVMH, Hermès International SCA, Kering)
**HoldingTrust** | **Holding and family office** (Agache, H51, Mousse Partners, Artèmis)
**InvestmentFirm** | **Investment banks, PE funds, M&A firms** (KKR, L Catterton, Mayhoola, Bernstein)
**MediaPublisher** | **Media outlets** (Bloomberg, Vogue, Business of Fashion, NYT)
**Hospitality** | **Luxury hospitality** (Ritz Paris, Belmond hotel Cipriani,Venetian Macao)
**MuseumGallery** | **Exhibition spaces** (Louvre, MET, Victoria & Albert, Pinault Collection)
**Retailer** | **POS, department stores, and select shops** (Bergdorf, Le Bon Marché, Takashimaya)
**Education** | **Business and fashion schools** (Polytechnic, Harvard, LSE, ESCP, Central Saint Martins, IFM)
**Organization** | **Legal, scientific, and cultural entities** (CFDA, European Union, UNESCO, SEC)
**ArtisticDirector** | **Lead creative of houses** (Karl Lagerfeld, Daniel Lee, Sarah Burton, Alessandro Michele)
**Executive** | **C-level, board members** (Jérôme Lambert, Sue Nabi, Pietro Beccari)
**Founder** | **Founder, creative, and owner** (Ralph Lauren, Rei Kawakubo, Michael Kors)
**Chairperson** | **Chairman/Chairwoman** (Bernard Arnault, Patrizio Bertelli, François-Henri Pinault)
**AnalystBanker** | **Equity analysts, M&A bankers** (Luca Solca, Pierre Mallevays, Louise Singlehurst)
**KOL** | **Artists, celebrities, historical figures** (Audrey Hepburn, BTS, Kanye West, Emma Watson)
**AthleteTeam** | **Professional athletes and teams** (David Beckham, Maria Sharapova, Luna Rossa, Scuderia Ferrari)
**Model** | **Fashion models** (Iman, Kate Moss, Adriana Lima, Naomi Campbell, Mariacarla Boscono)
**CreativeInsider** | **Photographers, make-up artists, watchmakers** (Steven Meisel, Dominique Ropion, Gérald Genta)
**EditorJournalist** | **Editor-in-chief, fashion editors, journalists** (Suzy Menkes, Anna Wintour, Carine Roitfeld)
**GarmCollection** | **Iconic garment and collections** (Haute Couture, Bar suit, No.13 of McQueen, Green Jungle Dress)
**Cosmetic** | **Cosmetic products** (Tilbury Glow palette, Crème de La Mer, YSL Nu, Viva Glam)
**Fragrance** | **Perfumes and EdT** (Chanel No.5, Dior Sauvage, Terre d'Hermès, Tom Ford Black Orchid)
**BagTrvlGoods** | **Bags, handbags, and leather goods** (Hermès Birkin bag, Louis Vuitton Speedy bag, Chanel 2.55)
**Jewelry** | **Fine jewellery, and gems** (Alhambra of Van Cleef & Arpels, Juste un Clou Cartier, The Winston Blue)
**Timepiece** | **Fine watches** (Nautilus Patek Philippe, Reverso Jaeger-Lecoultre, Rolex Oyster)
**Footwear** | **High heels to sneakers** (Rainbow of Ferragamo, Armadillo of McQueen, Air Force1)
**WineSpirit** | **Wine and spirit** (Château d'Yquem, Clos de Tart, Château Matras, Hennessy, Moet, Belvedere)
**Sustainability** | **Relevant ESG factors and entities** (Ethical Fashion Initiative, decoupling, biodiversity loss)
**CulturalArtifact** | **Songs, books, movies** (The Devil wears Prada, American Gigolo, Poker Face, The College Dropout)
***Paper address and cite information: https://arxiv.org/abs/2409.15804***
### Citation info
```
@misc{mousterou2024nerluxurynamedentityrecognition,
title={NER-Luxury: Named entity recognition for the fashion and luxury domain},
author={Akim Mousterou},
year={2024},
eprint={2409.15804},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2409.15804},
}
```
## How to use NER-Luxury with HuggingFace?
#### Load NER-Luxury and its sub-word tokenizer :
```python
from transformers import AutoTokenizer, AutoModelForTokenClassification
from transformers import pipeline
tokenizer = AutoTokenizer.from_pretrained("AkimfromParis/NER-Luxury")
model = AutoModelForTokenClassification.from_pretrained("AkimfromParis/NER-Luxury")
nlp = pipeline("ner", model=model, tokenizer=tokenizer, aggregation_strategy="simple")
example = "CEO Leena Nair dismisses IPO rumours for Chanel."
ner_results = nlp(example)
print(ner_results)
```
# NER-Luxury
This model is a fine-tuned version of [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) on the None dataset.
It achieves the following results on the evaluation set:
- Loss: 0.4079
- Precision: 0.7652
- Recall: 0.8033
- F1: 0.7838
- Accuracy: 0.9403
## Model description
More information needed
## Intended uses & limitations
More information needed
## Training and evaluation data
More information needed
## Training procedure
### Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 32
- eval_batch_size: 32
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 8
- mixed_precision_training: Native AMP
### Training results
| Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
|:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
| 1.1269 | 1.0 | 1155 | 0.6237 | 0.6085 | 0.6716 | 0.6385 | 0.9005 |
| 0.5871 | 2.0 | 2310 | 0.4933 | 0.6857 | 0.7367 | 0.7103 | 0.9208 |
| 0.4517 | 3.0 | 3465 | 0.4470 | 0.7115 | 0.7639 | 0.7368 | 0.9273 |
| 0.3692 | 4.0 | 4620 | 0.4271 | 0.7298 | 0.7797 | 0.7539 | 0.9322 |
| 0.3121 | 5.0 | 5775 | 0.4103 | 0.7422 | 0.7906 | 0.7656 | 0.9362 |
| 0.2726 | 6.0 | 6930 | 0.4109 | 0.7531 | 0.7940 | 0.7730 | 0.9381 |
| 0.2138 | 7.0 | 8085 | 0.4088 | 0.7632 | 0.8005 | 0.7814 | 0.9397 |
| 0.1962 | 8.0 | 9240 | 0.4079 | 0.7652 | 0.8033 | 0.7838 | 0.9403 |
### Framework versions
- Transformers 4.44.2
- Pytorch 2.4.0+cu121
- Datasets 2.21.0
- Tokenizers 0.19.1