Akihito Miyazaki's picture
7

Akihito Miyazaki PRO

Akjava

AI & ML interests

I'm developing a user-friendly, browser-based platform that allows users to connect various AI services like ChatGPT, Gemini, local LLMs, Hugging Face models, and more. Our goal is to empower users to build custom AI tools by seamlessly combining these services, similar to Langchain or ComfyUI.

Recent Activity

reacted to m-ric's post with ๐Ÿ‘ about 7 hours ago
We now have a Deep Research for academia: SurveyX automatically writes academic surveys nearly indistinguishable from human-written ones ๐Ÿ”ฅ Researchers from Beijing and Shanghai just published the first application of a deep research system to academia: their algorithm, given a question, can give you a survey of all papers on the subject. To make a research survey, you generally follow two steps, preparation (collect and organize papers) and writing (outline creation, writing, polishing). Researchers followed the same two steps and automated them. ๐ŸŽฏ For the preparation part, a key part is find all the important references on the given subject. Researchers first cast a wide net of all relevant papers. But then finding the really important ones is like distilling knowledge from a haystack of information. To solve this challenge, they built an โ€œAttributeTreeโ€ object that structures key information from citations. Ablating these AttributeTrees significantly decreased structure and synthesis scores, so they were really useful! ๐Ÿ“ For the writing part, key was to get a synthesis that's both short and true. This is not easy to get with LLMs! So they used methods like LLM-based deduplication to shorten the too verbose listings made by LLMs, and RAG to grab original quotes instead of made-up ones. As a result, their system outperforms previous approaches by far! As assessed by LLM-judges, the quality score os SurveyX even approaches this of human experts, with 4.59/5 vs 4.75/5 ๐Ÿ† I advise you to read the paper, it's a great overview of the kind of assistants that we'll get in the short future! ๐Ÿ‘‰ https://huggingface.co/papers/2502.14776 Their website shows examples of generated surveys ๐Ÿ‘‰ http://www.surveyx.cn/
published a Space 10 days ago
Akjava/open_Deep-Research-DuckDuckGo
View all activity

Organizations

None yet

Akjava's activity

reacted to m-ric's post with ๐Ÿ‘ about 7 hours ago
view post
Post
3844
We now have a Deep Research for academia: SurveyX automatically writes academic surveys nearly indistinguishable from human-written ones ๐Ÿ”ฅ

Researchers from Beijing and Shanghai just published the first application of a deep research system to academia: their algorithm, given a question, can give you a survey of all papers on the subject.

To make a research survey, you generally follow two steps, preparation (collect and organize papers) and writing (outline creation, writing, polishing). Researchers followed the same two steps and automated them.

๐ŸŽฏ For the preparation part, a key part is find all the important references on the given subject.
Researchers first cast a wide net of all relevant papers. But then finding the really important ones is like distilling knowledge from a haystack of information. To solve this challenge, they built an โ€œAttributeTreeโ€ object that structures key information from citations. Ablating these AttributeTrees significantly decreased structure and synthesis scores, so they were really useful!

๐Ÿ“ For the writing part, key was to get a synthesis that's both short and true. This is not easy to get with LLMs! So they used methods like LLM-based deduplication to shorten the too verbose listings made by LLMs, and RAG to grab original quotes instead of made-up ones.

As a result, their system outperforms previous approaches by far!

As assessed by LLM-judges, the quality score os SurveyX even approaches this of human experts, with 4.59/5 vs 4.75/5 ๐Ÿ†

I advise you to read the paper, it's a great overview of the kind of assistants that we'll get in the short future! ๐Ÿ‘‰ SurveyX: Academic Survey Automation via Large Language Models (2502.14776)
Their website shows examples of generated surveys ๐Ÿ‘‰ http://www.surveyx.cn/
posted an update 10 days ago
reacted to nyuuzyou's post with โค๏ธ about 1 month ago
view post
Post
1693
๐ŸŽจ Artfol Dataset - nyuuzyou/artfol

A collection of 1,892,816 artwork posts featuring:
- High-quality art pieces with various styles and techniques
- Complete metadata including artist IDs, titles, and moderation flags
- Content from Artfol social media platform

The dataset contains:
- Public domain artwork posts
- Artist attribution and identifiers
- Direct image URLs and web page links
- Content safety flags (NSFW, gore)
- Post titles and descriptions

All content is available under CC0 license, allowing unrestricted use including commercial applications.
posted an update about 2 months ago
view post
Post
654
I've released some spaces that demonstrates more advanced uses of MediaPipe-landmarks.

Head-pose-estimate
original mediapipeโ€™s detection is good on short angles,trained-model seems work and there are more improve space
Akjava/mediapipe-head-pose-estimation

generate-3d-head:gltf
this is simple and initial
Akjava/mediapipe-face-mesh-3d
Akjava/mediapipe-head-2d-spinning

prototype-faceswap
color adjust and transform soso work,need find a way to keep face features.
Akjava/mediapipe-face-skin-transform
reacted to davidberenstein1957's post with ๐Ÿ”ฅ 3 months ago
view post
Post
1718
Letโ€™s make a generation of amazing image-generation models

The best image generation models are trained on human preference datasets, where annotators have selected the best image from a choice of two. Unfortunately, many of these datasets are closed source so the community cannot train open models on them. Letโ€™s change that!

The community can contribute image preferences for an open-source dataset that could be used for building AI models that convert text to image, like the flux or stable diffusion families. The dataset will be open source so everyone can use it to train models that we can all use.

Blog: https://huggingface.co/blog/burtenshaw/image-preferences
posted an update 3 months ago
posted an update 3 months ago
view post
Post
539
Finaly I realesed mediapipe-face animation space.

Mediapipe 68-points Eyes-Closed and Mouth-Opened
Akjava/mediapipe-68-facial-guide-eyes-closed-mouth-opened

[Article]Results: Converted Guide Images(eyes-closed and mouth-opened) with Flux.1 schenll img2img/inpaint
https://huggingface.co/blog/Akjava/result-guide-image-eyes-mouth

All the other tools listed are designed to support Mediapipe Face Animation

Akjava/mediapipe-tools-672ffe8ee7b62763c31b70c7

Akjava/webp-3-frame-talking-animation-tools-672819ce4989f354cdbcc739
posted an update 3 months ago
view post
Post
561
hi All I just shared Spaces and Article.

This key feature is Mediapipe face landmarker
Apache Licensed and trained with own dataset.
Good licensed model who use Flux.1 schnell instead of Dev

[Spaces]
Mediapipe Face detect
Akjava/mediapipe-face-detect

Face crop and replace
Akjava/mediapipe-face-crop-and-replace

Mediapipe 68 landmark
Akjava/mediapipe-68-points-facial-landmark

Mediapipe 68 Face Mask
Akjava/mediapipe-68-points-facial-mask

[Articles]
Better img2img results with Flux.1 schnell by using ScaleUp or Sharpen or FillColor pre-processing
https://huggingface.co/blog/Akjava/img2img-pre-processing
posted an update 4 months ago
view post
Post
707
I've released several new Hugging Face Spaces.

My primary objective is to create consistent character facial animation using image-to-image techniques:

Akjava/CreateConsistentCharacterFacialAnimationWithImg2Img

A short-term goal is create simple talk-head animation.

WebP-3-Frame-Talking-Animation
Akjava/AIDiagramChatWithVoice-FaceCharacter

[Space]

- GPU tools
Flux1-schnell img2img
Akjava/flux1-schnell-img2img

Flux1-schnell Inpaint with mask-file
Akjava/flux1-schnell-img2img

- Tiny CPU tools
WebP-3F-TH - create webp animation from 3 images
OpenCV-Inapint - classic inpaint
Whitebalance - simple white balance
Paste Image - just paste image with mask
WebP Resize Convert - resize and convert webp-animation
posted an update 5 months ago
view post
Post
1424
Streaming Text-to-Speech Chat Demo (CPU Inference Client)

Akjava/mistral-7b-v0.3-matcha-tts-en

Please be patient, as it may take over a minute to load the ONNX model.

This demo utilizes an inference client, which may occasionally become unresponsive.