streamlit pandas nltk torch transformers docx2txt python-docx