streamlit transformers sentence-transformers faiss-cpu PyMuPDF python-docx beautifulsoup4 requests langdetect