Evaluation datasets

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

hynky updated a dataset 4 days ago

lighteval/treb_table_retrieval

hynky updated a dataset 5 days ago

lighteval/squad_v2

hynky published a dataset 5 days ago

lighteval/squad_v2

View all activity

hynky

updated a dataset 4 days ago

lighteval/treb_table_retrieval

Viewer • Updated 4 days ago • 500 • 202

hynky

updated a dataset 5 days ago

lighteval/squad_v2

Viewer • Updated 5 days ago • 142k • 59

hynky

published a dataset 5 days ago

lighteval/squad_v2

Viewer • Updated 5 days ago • 142k • 59

hynky

updated a dataset 5 days ago

lighteval/wikitablequestions

Viewer • Updated 5 days ago • 18.5k • 190

hynky

published 2 datasets 5 days ago

lighteval/wikitablequestions

Viewer • Updated 5 days ago • 18.5k • 190

lighteval/treb_table_retrieval

Viewer • Updated 4 days ago • 500 • 202

albertvillanova

posted an update 15 days ago

Post

434

🚀 New in smolagents v1.20.0: Remote Python Execution via WebAssembly (Wasm)

We've just merged a major new capability into the smolagents framework: the CodeAgent can now execute Python code remotely in a secure, sandboxed WebAssembly environment!

🔧 Powered by Pyodide and Deno, this new WasmExecutor lets your agent-generated Python code run safely: without relying on Docker or local execution.

Why this matters:
✅ Isolated execution = no host access
✅ No need for Python on the user's machine
✅ Safer evaluation of arbitrary code
✅ Compatible with serverless / edge agent workloads
✅ Ideal for constrained or untrusted environments

This is just the beginning: a focused initial implementation with known limitations. A solid MVP designed for secure, sandboxed use cases. 💡

💡 We're inviting the open-source community to help evolve this executor:
• Tackle more advanced Python features
• Expand compatibility
• Add test coverage
• Shape the next-gen secure agent runtime

🔗 Check out the PR: https://github.com/huggingface/smolagents/pull/1261

Let's reimagine what agent-driven Python execution can look like: remote-first, wasm-secure, and community-built.

This feature is live in smolagents v1.20.0!
Try it out.
Break things. Extend it. Give us feedback.
Let's build safer, smarter agents; together 🧠⚙️

👉 https://github.com/huggingface/smolagents/releases/tag/v1.20.0

#smolagents #WebAssembly #Python #AIagents #Pyodide #Deno #OpenSource #HuggingFace #AgenticAI