metadata
extra_gated_prompt: >-
You agree to not use the model to conduct experiments that cause harm to human
subjects.
extra_gated_fields:
Name: text
AI-Lab/Company: text
Email: text
I agree to use this model for academic research (non-commercial use ONLY): checkbox
license: bigscience-openrail-m
language: en
base_model: xlm-roberta-base
tags:
- NER
- token-classification
- Fashion
- Luxury
library_name: transformers
model-index:
- name: AkimfromParis/Bert-Luxury
results:
- task:
type: token-classification
name: Token Classification
dataset:
name: Private
type: private
metrics:
- name: Loss
type: Loss
value: 0.4079
verified: true
- name: Precision
type: Precision
value: 0.7652
verified: true
- name: Recall
type: Recall
value: 0.8033
verified: true
- name: F1
type: F1
value: 0.7838
verified: true
- name: Accuracy
type: Accuracy
value: 0.9403
verified: true
pipeline_tag: token-classification
widget:
- text: >-
According to Bloomberg, the market cap of LVMH surpassed $500 billion
becoming the first European company to reach that milestone. As of July
2023, Hermès has a market cap of $213.80 Billion, bigger than Nike at
$161.80 Billion.
example_title: Finance
- text: >-
During Milan Fashion Week, Raf Simons and Miuccia Prada showcased their
latest Prada collection at the Fondazione Prada in Milano.
example_title: Fashion
- text: >-
On 3 April 2023, L'Oréal acquired for $2.5 Billion the cosmetic label
Aēsop from Australia. And on 26 June 2023, the French luxury group Kering
acquired 100% of the perfume house, Creed from a fund of BlackRock
example_title: Beauty
- text: >-
French house Hermès and British department store Selfridges are leaving
the Fashion Pact after the appointment of CEO Helena Helmersson from
Swedish fast-fashion company H&M as the new co-chair
example_title: Sustainability
NER-Luxury
A fine-tuned XLM-Roberta model for NER in the fashion and luxury industry
. Goal
- NER-Luxury is a fine-tuned XLM-Roberta model for the subtask N.E.R (Named Entity Recognition) in English. NER-Luxury is domain-specific for the fashion and luxury industry with bespoke labels. NER-Luxury is trying to be a bridge between the aesthetic side and the quantitative side of the fashion and luxury industry.
- As a downstream task, NER-Luxury is able to identify major fashion houses, artistic directors, fragrances, models, or influential artists on the website of a fashion magazine. And NER-Luxury is also able to identify companies, listed groups, executives, financial analysts, and investment companies inside a 200-page quarterly financial report.
- The goal of NER-Luxury is to create a clear hierarchical classification of luxury houses, fine watchmakers, beauty brands, sportswear labels, and fast fashion brands with respect of temporality, context, and sustainability. NER-Luxury is trying to solve the "entity disambiguation" between the founder, his eponymous label, the company designation, the names of products, and the intellectual property rights for corporate lawyers, M&A bankers, and financial analysts.
For example, the disambiguation of Louis Vuitton:
- The visionary founder, Louis Vuitton (1821-1892)
- The luxury house, Louis Vuitton
- The giant luxury group LVMH Moët Hennessy Louis Vuitton SE
- The collection with Japanese artist, Louis Vuitton x Yayoi Kusama
. NER bespoke labels
Entities are evolving according to temporality, and context.
Label | Description and example |
---|---|
O | Outside (of a text segment) |
Date | Temporal expressions (1854, Q2 2023, Nineties, September 21) |
Location | Physical location and area (Paris, Japan, Europe, Champs-Elysées) |
Event | Critical events (WW II, Olympics, IPO, Covid pandemic, Paris Fashion Week) |
MonetaryValue | Currency, price, sales, revenue ($2.65 billion, 4.6 million euros, CHF 400,000, etc.) |
House | Fashion and luxury houses (Louis Vuitton, Cartier, Gucci, Chanel) |
Brand | Sportswear, beauty and labels (Nike, Lululemon, Clinique) |
FastFashion | Mass-market retailers (Zara, H&M, Uniqlo, Shein) |
PrivateCompany | Unlisted companies (Chanel SA, Stella McCartney Ltd, Valentino S.p.A) |
ListedGroup | Listed groups (LVMH, Hermès International SCA, Kering) |
HoldingTrust | Holding and family office (Agache, H51, Mousse Partners, Artèmis) |
InvestmentFirm | Investment banks, PE funds, M&A firms (KKR, L Catterton, Mayhoola, Bernstein) |
MediaPublisher | Media outlets (Bloomberg, Vogue, Business of Fashion, NYT) |
Hospitality | Luxury hospitality (Ritz Paris, Belmond hotel Cipriani,Venetian Macao) |
MuseumGallery | Exhibition spaces (Louvre, MET, Victoria & Albert, Pinault Collection) |
Retailer | POS, department stores, and select shops (Bergdorf, Le Bon Marché, Takashimaya) |
Education | Business and fashion schools (Polytechnic, Harvard, LSE, ESCP, Central Saint Martins, IFM) |
Organization | Legal, scientific, and cultural entities (CFDA, European Union, UNESCO, SEC) |
ArtisticDirector | Lead creative of houses (Karl Lagerfeld, Daniel Lee, Sarah Burton, Alessandro Michele) |
Executive | C-level, board members (Jérôme Lambert, Sue Nabi, Pietro Beccari) |
Founder | Founder, creative, and owner (Ralph Lauren, Rei Kawakubo, Michael Kors) |
Chairperson | Chairman/Chairwoman (Bernard Arnault, Patrizio Bertelli, François-Henri Pinault) |
AnalystBanker | Equity analysts, M&A bankers (Luca Solca, Pierre Mallevays, Louise Singlehurst) |
KOL | Artists, celebrities, historical figures (Audrey Hepburn, BTS, Kanye West, Emma Watson) |
AthleteTeam | Professional athletes and teams (David Beckham, Maria Sharapova, Luna Rossa, Scuderia Ferrari) |
Model | Fashion models (Iman, Kate Moss, Adriana Lima, Naomi Campbell, Mariacarla Boscono) |
CreativeInsider | Photographers, make-up artists, watchmakers (Steven Meisel, Dominique Ropion, Gérald Genta) |
EditorJournalist | Editor-in-chief, fashion editors, journalists (Suzy Menkes, Anna Wintour, Carine Roitfeld) |
GarmCollection | Iconic garment and collections (Haute Couture, Bar suit, No.13 of McQueen, Green Jungle Dress) |
Cosmetic | Cosmetic products (Tilbury Glow palette, Crème de La Mer, YSL Nu, Viva Glam) |
Fragrance | Perfumes and EdT (Chanel No.5, Dior Sauvage, Terre d'Hermès, Tom Ford Black Orchid) |
BagTrvlGoods | Bags, handbags, and leather goods (Hermès Birkin bag, Louis Vuitton Speedy bag, Chanel 2.55) |
Jewelry | Fine jewellery, and gems (Alhambra of Van Cleef & Arpels, Juste un Clou Cartier, The Winston Blue) |
Timepiece | Fine watches (Nautilus Patek Philippe, Reverso Jaeger-Lecoultre, Rolex Oyster) |
Footwear | High heels to sneakers (Rainbow of Ferragamo, Armadillo of McQueen, Air Force1) |
WineSpirit | Wine and spirit (Château d'Yquem, Clos de Tart, Château Matras, Hennessy, Moet, Belvedere) |
Sustainability | Relevant ESG factors and entities (Ethical Fashion Initiative, decoupling, biodiversity loss) |
CulturalArtifact | Songs, books, movies (The Devil wears Prada, American Gigolo, Poker Face, The College Dropout) |
Paper address and cite information: https://arxiv.org/abs/2409.15804
Citation info
@misc{mousterou2024nerluxurynamedentityrecognition,
title={NER-Luxury: Named entity recognition for the fashion and luxury domain},
author={Akim Mousterou},
year={2024},
eprint={2409.15804},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2409.15804},
}
How to use NER-Luxury with HuggingFace?
Load NER-Luxury and its sub-word tokenizer :
from transformers import AutoTokenizer, AutoModelForTokenClassification
from transformers import pipeline
tokenizer = AutoTokenizer.from_pretrained("AkimfromParis/NER-Luxury")
model = AutoModelForTokenClassification.from_pretrained("AkimfromParis/NER-Luxury")
nlp = pipeline("ner", model=model, tokenizer=tokenizer, aggregation_strategy="simple")
example = "CEO Leena Nair dismisses IPO rumours for Chanel."
ner_results = nlp(example)
print(ner_results)
NER-Luxury
This model is a fine-tuned version of xlm-roberta-base on the None dataset. It achieves the following results on the evaluation set:
- Loss: 0.4079
- Precision: 0.7652
- Recall: 0.8033
- F1: 0.7838
- Accuracy: 0.9403
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 32
- eval_batch_size: 32
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 8
- mixed_precision_training: Native AMP
Training results
Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
---|---|---|---|---|---|---|---|
1.1269 | 1.0 | 1155 | 0.6237 | 0.6085 | 0.6716 | 0.6385 | 0.9005 |
0.5871 | 2.0 | 2310 | 0.4933 | 0.6857 | 0.7367 | 0.7103 | 0.9208 |
0.4517 | 3.0 | 3465 | 0.4470 | 0.7115 | 0.7639 | 0.7368 | 0.9273 |
0.3692 | 4.0 | 4620 | 0.4271 | 0.7298 | 0.7797 | 0.7539 | 0.9322 |
0.3121 | 5.0 | 5775 | 0.4103 | 0.7422 | 0.7906 | 0.7656 | 0.9362 |
0.2726 | 6.0 | 6930 | 0.4109 | 0.7531 | 0.7940 | 0.7730 | 0.9381 |
0.2138 | 7.0 | 8085 | 0.4088 | 0.7632 | 0.8005 | 0.7814 | 0.9397 |
0.1962 | 8.0 | 9240 | 0.4079 | 0.7652 | 0.8033 | 0.7838 | 0.9403 |
Framework versions
- Transformers 4.44.2
- Pytorch 2.4.0+cu121
- Datasets 2.21.0
- Tokenizers 0.19.1