ocr-orderid3

Sleeping

App Files Files Community

ethanrom commited on Jun 10, 2023

Commit

b7320ed

1 Parent(s): 4ca964a

Upload 3 files

Browse files

Files changed (3) hide show

app.py +12 -5
generated_image.jpg +0 -0
real_image.jpg +0 -0

app.py CHANGED Viewed

@@ -27,11 +27,10 @@ def main():
             st.markdown(
                 """
                 <h3 style='text-align: center;'>Jewellery Font Type Detection - Proposed Solution Demo</h3>
-                <p>The challenge is to identify the correct order ID from a list by using both the OCR detected text and font type of custom-made jewelry.</p>
                 <h4>Proposed Solution</h4>
-                <p>Our proposed solution involves bolstering an OCR engine with a custom-trained CNN for font type classification. In this demo, we have trained two custom CNNs to classify two font types using a synthetic dataset of 1000 images for each font, generated using NumPy, PIL, and OpenCV. The dataset consists of text images rendered with different fonts, utilizing variations in font size and positioning to create diversity. However, training an accurate custom CNN for the given problem requires thousands of images due to the similar nature of the font types used in custom jewelry.</p>
                 <p>There are two potential solutions to overcome this challenge:</p>
@@ -41,6 +40,7 @@ def main():
                 <h5>Solution 2</h5>
                 <p>Use Photoshop batch actions to create thousands of realistic images.</p>
                 """, unsafe_allow_html=True
             )
@@ -50,7 +50,7 @@ def main():
             st.image('otsu.PNG', use_column_width=True)
         with col3:
-            st.markdown("""<p>In initial testing, we found that Otsu thresholding can pre-process images to a similar level of a synthetic dataset. See the image :</p>
             <p>Otsu's method assumes that the image contains two distinct intensity distributions, corresponding to the foreground and background regions.
             It calculates the threshold that minimizes the intra-class variance or maximizes the inter-class variance.
             By choosing the threshold that maximizes the inter-class variance, Otsu's thresholding effectively separates the two classes, resulting in a binary image.</p> """, unsafe_allow_html=True)
@@ -68,7 +68,14 @@ def main():
             By using RANSAC to estimate the homography between the matched keypoints, we can eliminate outliers and improve the accuracy of the registration process.</P>
             """, unsafe_allow_html=True)
-        colab_link = '[<img src="https://colab.research.google.com/assets/colab-badge.svg">](https://colab.research.google.com/drive/1cJy6ny9AGvhe_OdCK_5MxRhUDwuj1NF3?usp=sharing)'
         st.markdown(colab_link, unsafe_allow_html=True)

             st.markdown(
                 """
                 <h3 style='text-align: center;'>Jewellery Font Type Detection - Proposed Solution Demo</h3>
                 <h4>Proposed Solution</h4>
+                <p>One solution involves bolstering an OCR engine with a custom-trained CNN for font type classification. In this demo, I have trained two custom CNNs to classify 21 font types using a synthetic dataset of 4000 images for each font, generated using NumPy, PIL, and OpenCV. The dataset consists of text images rendered with different fonts, utilizing variations in font size and positioning to create diversity. However, training an accurate custom CNN for the given problem requires thousands of images due to the similar nature of the font types used in custom jewelry.</p>
                 <p>There are two potential solutions to overcome this challenge:</p>
                 <h5>Solution 2</h5>
                 <p>Use Photoshop batch actions to create thousands of realistic images.</p>
+                <p> Alternatly use a feature matching algorithm as implemented in FLANN matching tab </p>
                 """, unsafe_allow_html=True
             )
             st.image('otsu.PNG', use_column_width=True)
         with col3:
+            st.markdown("""<p>Otsu thresholding can pre-process images to a similar level of a synthetic dataset. See the image :</p>
             <p>Otsu's method assumes that the image contains two distinct intensity distributions, corresponding to the foreground and background regions.
             It calculates the threshold that minimizes the intra-class variance or maximizes the inter-class variance.
             By choosing the threshold that maximizes the inter-class variance, Otsu's thresholding effectively separates the two classes, resulting in a binary image.</p> """, unsafe_allow_html=True)
             By using RANSAC to estimate the homography between the matched keypoints, we can eliminate outliers and improve the accuracy of the registration process.</P>
             """, unsafe_allow_html=True)
+            col5, col6 = st.columns(2)
+            with col5:
+                st.image('real_image.jpg', caption = 'A sample image with Otsu thresholding applied', width = 100)
+            with col6:
+                st.image('generated_image.jpg', caption = 'A generatated image', width = 100)
+        colab_link = '[<img src="https://colab.research.google.com/assets/colab-badge.svg">](https://colab.research.google.com/drive/11aO-QNRl2qMK0tgJ03RvcRLUuUSPvWMc?usp=sharing)'
         st.markdown(colab_link, unsafe_allow_html=True)

generated_image.jpg ADDED Viewed

real_image.jpg ADDED Viewed