VarsaGupta/NLP_Based_Chatbot

Model Card: NLP-Based Chatbot

Overview

The NLP-Based Chatbot is designed to explore Science & Technology topics. It utilizes a combination of semantic search and summarization techniques to provide relevant and concise responses to user queries.

Model Details

Model Name: NLP-Based Chatbot
Model Type: Natural Language Processing (NLP) Chatbot
Framework: Gradio Blocks Interface, spaCy, Transformers

Components

Semantic Search

The chatbot employs semantic search to retrieve relevant information from a preprocessed dataset (Dronealexa.csv). The search is based on a TF-IDF vectorizer and cosine similarity calculations.

Summarization

A summarization pipeline is used to generate concise summaries of the retrieved information. The Hugging Face Transformers library is utilized for summarization tasks.

Custom Embeddings

The model incorporates custom text embeddings using spaCy and pre-trained word embeddings. These embeddings enhance the understanding of user queries and contribute to the semantic search.

Gradio Blocks Interface

The chatbot's frontend is built using Gradio Blocks Interface, providing an interactive and user-friendly platform for users to input queries and receive responses.

Model Card Generation

The model card generation involves constructing prompts based on search results and utilizing a summarization pipeline to produce model card content.

Intended Use

The NLP-Based Chatbot is intended for users interested in exploring Science & Technology topics. It can be used to obtain information from the provided dataset, and users are encouraged to provide feedback for continuous improvement.

Training Data

The model is trained on a custom dataset (Dronealexa.csv) containing Science & Technology-related information. The dataset has been preprocessed to handle missing values and ensure efficient semantic search.

Evaluation Metrics

Semantic Search: TF-IDF Vectorizer, Cosine Similarity
Summarization: Hugging Face Transformers Pipeline

Ethical Considerations

The chatbot aims to provide accurate and relevant information. However, users are advised to critically evaluate the responses and understand that the model's knowledge is based on the training data.

Usage Instructions

Input your query in the provided textbox.
Click the "Send" button to receive a response.
Optionally, submit feedback using the "Submit Feedback" button.

License

This model is released under the Apache 2.0 License.

Contact Information

For inquiries or issues, please contact [email protected].