MusicAI (MusicAI)

asigalov61

posted an update about 1 month ago

Post

2272

Check out new symbolic music AI front end and CLI training app

https://webchatappai.github.io/midi-gen/

https://github.com/WebChatAppAi/Orpheus-Midi-Model-Maker

@Timzoid @Csplk @not-lain @victor @bartowski @John6666

Nymbo

posted an update about 1 month ago

Post

2350

Anyone know how to reset Claude web's MCP config? I connected mine when the HF MCP first released with just the default example spaces added. I added lots of other MCP spaces but Claude.ai doesn't update the available tools... "Disconnecting" the HF integration does nothing, deleting it and adding it again does nothing.

Refreshing tools works fine in VS Code because I can manually restart it in mcp.json, but claude.ai has no such option. Anyone got any ideas?

3 replies

·

asigalov61

posted an update about 1 month ago

Post

462

Hey guys!

I wanted to invite all of you who are interested in symbolic music AI to check out my Orpheus Music Transformer

IMHO the model turned out very well and it plays very well too.

I would really appreciate any feedback and likes. It helps a lot.

Here are the links for your convenience:

1) Orpheus Music Transformer main demo space asigalov61/Orpheus-Music-Transformer

2) Orpheus Music Transformer Collection asigalov61/orpheus-music-transformer-685c3c8e59ed1414c02bb8cd

3) Orpheus Music Transformer Models Repo asigalov61/Orpheus-Music-Transformer

I hope you will enjoy it :)

Sincerely,

Alex

1024m

authored a paper 2 months ago

Uncovering Cultural Representation Disparities in Vision-Language Models

Paper • 2505.14729 • Published May 20 • 1

Felguk

posted an update 3 months ago

Post

2143

Where gone streamlit in huggingface?

3 replies

·

Nymbo

posted an update 3 months ago

Post

3840

Haven't seen this posted anywhere - Llama-3.3-8B-Instruct is available on the new Llama API. Is this a new model or did someone mislabel Llama-3.1-8B?

1 reply

·

Nymbo

posted an update 3 months ago

Post

2752

PSA for anyone using Nymbo/Nymbo_Theme or Nymbo/Nymbo_Theme_5 in a Gradio space ~

Both of these themes have been updated to fix some of the long-standing inconsistencies ever since the transition to Gradio v5. Textboxes are no longer bright green and in-line code is readable now! Both themes are now visually identical across versions.

If your space is already using one of these themes, you just need to restart your space to get the latest version. No code changes needed.

1024m

authored 3 papers 4 months ago

Robust and Fine-Grained Detection of AI Generated Texts

Paper • 2504.11952 • Published Apr 16 • 12

Improving Multilingual Capabilities with Cultural and Local Knowledge in Large Language Models While Enhancing Native Performance

Paper • 2504.09753 • Published Apr 13 • 5

Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation

Paper • 2504.07072 • Published Apr 9 • 9

not-lain

posted an update 5 months ago

Post

4245

🚀AraClip is now fully integrated with Hugging Face 🤗

AraClip is a specialized CLIP model that was created by @pain and optimized for Arabic text-image retrieval tasks🔥

🔗 Try it out 🔗
🤖 model: Arabic-Clip/araclip
🧩 Gradio demo: Arabic-Clip/Araclip-Simplified
🌐 website: https://arabic-clip.github.io/Arabic-CLIP/

2 replies

·

not-lain

posted an update 6 months ago

Post

4524

I have just released a new blogpost about kv caching and its role in inference speedup 🚀
🔗 https://huggingface.co/blog/not-lain/kv-caching/
some takeaways :

4 replies

·

not-lain

posted an update 7 months ago

Post

1790

we now have more than 2000 public AI models using ModelHubMixin🤗

not-lain

posted an update 7 months ago

Post

4136

Published a new blogpost 📖
In this blogpost I have gone through the transformers' architecture emphasizing how shapes propagate throughout each layer.
🔗 https://huggingface.co/blog/not-lain/tensor-dims
some interesting takeaways :

Sri-Vigneshwar-DJ

posted an update 7 months ago

Post

835

Checkout phi-4 from Microsoft, dropped a day ago... If you ❤️ the Phi series, then here is the GGUF - Sri-Vigneshwar-DJ/phi-4-GGUF. phi-4 is a 14B highly efficient open LLM that beats much larger models at math and reasoning - check out evaluations on the Open LLM.

Technical paper - https://arxiv.org/pdf/2412.08905 ; The Data Synthesis approach is interesting

Sri-Vigneshwar-DJ

posted an update 7 months ago

Post

2104

Just sharing a thought: I started using DeepSeek V3 a lot, and an idea struck me about agents "orchestrating during inference" on a test-time compute model like DeepSeek V3 or the O1 series.

Agents (Instruction + Function Calls + Memory) execute during inference, and based on the output decision, a decision is made to scale the time to reason or perform other tasks.

Sri-Vigneshwar-DJ

posted an update 7 months ago

Post

2364

Combining smolagents with Anthropic’s best practices simplifies building powerful AI agents:

1. Code-Based Agents: Write actions as Python code, reducing steps by 30%.
2. Prompt Chaining: Break tasks into sequential subtasks with validation gates.
3. Routing: Classify inputs and direct them to specialized handlers.
4. Fallback: Handle tasks even if classification fails.

https://huggingface.co/blog/Sri-Vigneshwar-DJ/building-effective-agents-with-anthropics-best-pra

not-lain

posted an update 9 months ago

Post

2449

ever wondered how you can make an API call to a visual-question-answering model without sending an image url 👀

you can do that by converting your local image to base64 and sending it to the API.

recently I made some changes to my library "loadimg" that allows you to make converting images to base64 a breeze.
🔗 https://github.com/not-lain/loadimg

API request example 🛠️:

from loadimg import load_img
from huggingface_hub import InferenceClient

# or load a local image
my_b64_img = load_img(imgPath_url_pillow_or_numpy ,output_type="base64" ) 

client = InferenceClient(api_key="hf_xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx")

messages = [
	{
		"role": "user",
		"content": [
			{
				"type": "text",
				"text": "Describe this image in one sentence."
			},
			{
				"type": "image_url",
				"image_url": {
					"url": my_b64_img # base64 allows using images without uploading them to the web
				}
			}
		]
	}
]

stream = client.chat.completions.create(
    model="meta-llama/Llama-3.2-11B-Vision-Instruct", 
	messages=messages, 
	max_tokens=500,
	stream=True
)

for chunk in stream:
    print(chunk.choices[0].delta.content, end="")

1024m

authored 2 papers 10 months ago

RKadiyala at SemEval-2024 Task 8: Black-Box Word-Level Text Boundary Detection in Partially Machine Generated Texts

Paper • 2410.16659 • Published Oct 22, 2024

Large Language Models for Cross-lingual Emotion Detection

Paper • 2410.15974 • Published Oct 21, 2024 • 1

AI & ML interests

Team members 108

MusicAI's activity