Spaces:

LunaticMaestro
/

you_might_speak

Sleeping

App Files Files Community

you_might_speak / README.md

Deepak Sahu

update readme

a207710 4 months ago

preview code

raw

history blame contribute delete

1.69 kB

	---
	title: You Might Speak
	emoji: 🌍
	colorFrom: pink
	colorTo: gray
	sdk: gradio
	sdk_version: 5.9.1
	app_file: app.py
	pinned: false
	short_description: I guess you might speak <Language>
	---

	# Language Guesser based on Name

	![preview](.resources/preview.png)

	## Data Source

	https://pytorch.org/tutorials/intermediate/char_rnn_classification_tutorial.html
	Last Accessed: 30th Dec 2024

	## Code Information

	The code is partially inspired by https://pytorch.org/tutorials/intermediate/char_rnn_classification_tutorial.html

	Changes I Introduced
	- NamesDataset is separated from transformation, useful for transformation during inference
	- target is made integer instead of one-hot encoding;
	- changed the loss from combination of LogSoftmax + NLLoss to CrossEntropy (EXACTLY THE SAME STUFF); which further required removing the softmax layer from the architecture.
	- DataLoader is added
	- Input made batch first > Corresponding RNN is also made batch first.

	## Evaluation
	Although the code is mostly replicated. However, I changed the dataloader to use apply lowercase transformation to data, and it confused the model.

	> Notice that the diagonal brightness for the without lowercase, which can be said that the actual class being arabic (for example) is guess as arabic. This is not the case with with lowercase.

	- Confusion matrix with with lowercase transformation
	![Click here to view the image](model/lowercase_evaluate.png)
	![Click here to view the image](model/lowercase_loss.png)

	- Confusion matrix without lowercase transformation
	![Click here to view the image](model/evaluate.png)
	![Click here to view the image](model/loss.png)