Florian Grässle
holehan
AI & ML interests
None yet
Recent Activity
reacted
to
juhoinkinen's
post
with 😎
20 days ago
Annif is a subject indexing toolkit developed by the National Library of Finland: https://github.com/NatLibFi/Annif
Last November we organized a survey for Annif users, and now the results have been published: https://www.doria.fi/bitstream/handle/10024/190930/Annif%20Users%20Survey.pdf
The report includes an overview of:
• The vocabularies and datasets that are used with Annif
• The workflows that Annif is integrated with
• The problems Annif users are facing
The average ratings for various aspects and features of Annif given by users are shown. In short, in a scale from 1 to 5, the ratings are:
• Overall: 4.4
• Features and functions: 4.1
• Documentation: 4.5
• Smoothness of initial setup: 4.2
• Usability: 4.4
• Achieved quality of subject suggestions: 3.6
The survey also gathered user views on the improvements and new features, which are briefly discussed in the report.
reacted
to
juhoinkinen's
post
with 🚀
6 months ago
Annif 1.2 has been released!
https://github.com/NatLibFi/Annif/releases/tag/v1.2.0
This release introduces language detection capabilities in the REST API and CLI, improves 🤗 Hugging Face Hub integration, and also includes the usual maintenance work and minor bug fixes.
The new REST API endpoint `/v1/detect-language` expects POST requests that contain a JSON object with the text whose language is to be analyzed and a list of candidate languages. Similarly, the CLI has a new command `annif detect-language`. Annif projects are typically language specific, so a text of a given language needs to be processed with a project intended for that language; the language detection feature can help in this. For details see this [Wiki page](https://github.com/NatLibFi/Annif/wiki/Language-detection). The language detection is performed with the Simplemma library by [@adbar](https://github.com/adbar) et al.
The `annif download` command has a new `--trust-repo` option, which needs to be used if the repository to download from has not been used previously (that is if the repository does not appear in the local Hugging Face Hub cache). This option is introduced to raise awareness of the risks of downloading projects from the internet; the project downloads should only be done from trusted sources. For more information see the [Hugging Face Hub documentation](https://huggingface.co/docs/hub/en/security-pickle).
This release also includes automation of downloading the NLTK datapackage used for tokenization to simplify Annif installation. Maintenance tasks include upgrading dependencies, including a new version of Simplemma that allows better control over memory usage. The bug fixes include restoring the `--host` option of the `annif run` command.
Python 3.12 is now fully supported (previously NN-ensemble and STWFSA backends were not supported on Python 3.12).
https://huggingface.co/spaces/NatLibFi/Annif
liked
a Space
12 months ago
NatLibFi/Annif
Organizations
models
None public yet
datasets
None public yet