parthesh111
/

layoutlmv3-finetune-bioes-new

@@ -24,17 +24,16 @@ This model is a fine-tuned version of `microsoft/layoutlmv3-base` designed for t
 ### Model Description
 * **Developed by:** Parthesh Ingale
-* **Funded by \[optional]:** Academic Research
-* **Shared by \[optional]:** [parthesh111](https://huggingface.co/parthesh111)
 * **Model type:** Token Classification (NER)
 * **Language(s) (NLP):** English
 * **License:** Apache-2.0
-* **Finetuned from model \[optional]:** `microsoft/layoutlmv3-base`
-### Model Sources \[optional]
 * **Repository:** [https://huggingface.co/parthesh111/layoutlmv3-finetune-bioes-new](https://huggingface.co/parthesh111/layoutlmv3-finetune-bioes-new)
-* **Paper \[optional]:** N/A
 ## Uses
@@ -43,7 +42,7 @@ This model is a fine-tuned version of `microsoft/layoutlmv3-base` designed for t
 * Extract named entities from medical lab reports (scanned images).
 * Automate structured data extraction from semi-structured medical documents.
-### Downstream Use \[optional]
 * Preprocessing step in EHR (Electronic Health Records).
 * PII-aware document processing.
@@ -81,7 +80,7 @@ import numpy as np
 import os
 from huggingface_hub import login
-# Login to Hugging Face using environment variable token
 HF_TOKEN = os.environ.get("HF_TOKEN")
 if not HF_TOKEN:
     st.error("Hugging Face token not found. Please set 'HF_TOKEN' as an environment variable.")
@@ -320,7 +319,7 @@ st.markdown("""
 ### Training Procedure
-#### Preprocessing \[optional]
 * Images were preprocessed using PaddleOCR.
 * Bounding boxes normalized to 1000-scale.
@@ -330,10 +329,10 @@ st.markdown("""
 * **Training regime:** fp16 mixed precision
 * **Epochs:** 20
-* **Batch size:** 8
 * **Learning rate:** 5e-5
-#### Speeds, Sizes, Times \[optional]
 * **Checkpoint size:** \~435 MB
 * **Training time:** \~2 hours on RTX 3060
@@ -367,7 +366,7 @@ LayoutLMv3 with token classification head using OCR input (image, text, and layo
 * PyTorch, Hugging Face Transformers, PaddleOCR, Streamlit
-## Citation \[optional]
 **BibTeX:**
@@ -379,18 +378,10 @@ LayoutLMv3 with token classification head using OCR input (image, text, and layo
   howpublished = {\url{https://huggingface.co/parthesh111/layoutlmv3-finetune-bioes-new}},
 }
 ```
-## Glossary \[optional]
 * **BIOES:** Beginning, Inside, Outside, End, Single tagging scheme used for NER.
-## More Information \[optional]
-For demo, Streamlit app, or usage questions, contact below.
-## Model Card Authors \[optional]
-* Parthesh Ingale
 ## Model Card Contact
 * **GitHub/HF:** [parthesh111](https://huggingface.co/parthesh111)

 ### Model Description
 * **Developed by:** Parthesh Ingale
+* **Shared by:** [parthesh111](https://huggingface.co/parthesh111)
 * **Model type:** Token Classification (NER)
 * **Language(s) (NLP):** English
 * **License:** Apache-2.0
+* **Finetuned from model:** `microsoft/layoutlmv3-base`
+### Model Sources
 * **Repository:** [https://huggingface.co/parthesh111/layoutlmv3-finetune-bioes-new](https://huggingface.co/parthesh111/layoutlmv3-finetune-bioes-new)
+* **Paper:** N/A
 ## Uses
 * Extract named entities from medical lab reports (scanned images).
 * Automate structured data extraction from semi-structured medical documents.
+### Downstream Use
 * Preprocessing step in EHR (Electronic Health Records).
 * PII-aware document processing.
 import os
 from huggingface_hub import login
+# Login to Hugging Face using the environment variable token
 HF_TOKEN = os.environ.get("HF_TOKEN")
 if not HF_TOKEN:
     st.error("Hugging Face token not found. Please set 'HF_TOKEN' as an environment variable.")
 ### Training Procedure
+#### Preprocessing
 * Images were preprocessed using PaddleOCR.
 * Bounding boxes normalized to 1000-scale.
 * **Training regime:** fp16 mixed precision
 * **Epochs:** 20
+* **Batch size:** 1
 * **Learning rate:** 5e-5
+#### Speeds, Sizes, Times
 * **Checkpoint size:** \~435 MB
 * **Training time:** \~2 hours on RTX 3060
 * PyTorch, Hugging Face Transformers, PaddleOCR, Streamlit
+## Citation
 **BibTeX:**
   howpublished = {\url{https://huggingface.co/parthesh111/layoutlmv3-finetune-bioes-new}},
 }
 ```
+## Glossary
 * **BIOES:** Beginning, Inside, Outside, End, Single tagging scheme used for NER.
 ## Model Card Contact
 * **GitHub/HF:** [parthesh111](https://huggingface.co/parthesh111)