1 13

alpik

Salvor

AI & ML interests

Just chilling and AI

Recent Activity

liked a model about 2 months ago

stabilityai/stable-audio-open-small

liked a model about 2 months ago

ACE-Step/Ace-Step1.5

reacted to as-cle-bert's post with 👍 about 1 year ago

𝐒𝐜𝐢𝐍𝐞𝐰𝐬𝐁𝐨𝐭 - 𝐑𝐞𝐩𝐨𝐫𝐭 𝐝𝐚𝐢𝐥𝐲 𝐒𝐜𝐢𝐞𝐧𝐜𝐞 𝐧𝐞𝐰𝐬 𝐨𝐧 𝐁𝐥𝐮𝐞𝐒𝐤𝐲 GitHub 👉 https://github.com/AstraBert/SciNewsBot BlueSky 👉 https://bsky.app/profile/sci-news-bot.bsky.social Hi there HF Community!🤗 I just created a very simple AI-powered bot that shares fact-checked news about Science, Environment, Energy and Technology on BlueSky :) The bot takes news from Google News, filters out the sources that are not represented in the Media Bias Fact Check database, and then evaluates the reliability of the source based on the MBFC metrics. After that, it creates a catchy headline for the article and publishes the post on BlueSky📰 The cool thing? SciNewsBot is open-source and is cheap to maintain, as it is based on https://huggingface.co/mistralai/Mistral-Small-24B-Instruct-2501 (via Mistral API). You can reproduce it locally, spinning it up on your machine, and even launch it on cloud through a comfy Docker setup🐋 Have fun and spread Science!✨

View all activity

Organizations

None yet

liked 2 models about 2 months ago

stabilityai/stable-audio-open-small

Text-to-Audio • Updated May 27, 2025 • 3.37k • 254

ACE-Step/Ace-Step1.5

Text-to-Audio • Updated Feb 3 • 47.5k • 714

reactedto as-cle-bert's post with 👍 about 1 year ago

Post

1410

𝐒𝐜𝐢𝐍𝐞𝐰𝐬𝐁𝐨𝐭 - 𝐑𝐞𝐩𝐨𝐫𝐭 𝐝𝐚𝐢𝐥𝐲 𝐒𝐜𝐢𝐞𝐧𝐜𝐞 𝐧𝐞𝐰𝐬 𝐨𝐧 𝐁𝐥𝐮𝐞𝐒𝐤𝐲

GitHub 👉 https://github.com/AstraBert/SciNewsBot
BlueSky 👉 https://bsky.app/profile/sci-news-bot.bsky.social

Hi there HF Community!🤗
I just created a very simple AI-powered bot that shares fact-checked news about Science, Environment, Energy and Technology on BlueSky :)

The bot takes news from Google News, filters out the sources that are not represented in the Media Bias Fact Check database, and then evaluates the reliability of the source based on the MBFC metrics. After that, it creates a catchy headline for the article and publishes the post on BlueSky📰

The cool thing? SciNewsBot is open-source and is cheap to maintain, as it is based on mistralai/Mistral-Small-24B-Instruct-2501 (via Mistral API). You can reproduce it locally, spinning it up on your machine, and even launch it on cloud through a comfy Docker setup🐋

Have fun and spread Science!✨

liked 2 models over 1 year ago

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27, 2025 • 676k • • 12.6k

genmo/mochi-1-preview

Text-to-Video • Updated Sep 4, 2025 • 9.67k • • 1.32k

liked a Space over 1 year ago

SDXL

🌍

385

Generate images from text prompts using SDXL

reactedto fdaudens's post with 🔥 over 1 year ago

Post

3081

The Nobel Prize background for Hopfield and Hinton's work on neural networks is pure gold. It's a masterclass in explaining AI basics.

Key takeaways from the conclusion:
- ML applications are expanding rapidly. We're still figuring out which will stick.
- Ethical discussions are crucial as the tech develops.
- Physics 🤝 AI: A two-way street of innovation.

Some mind-blowing AI applications in physics:
- Discovering the Higgs particle
- Cleaning up gravitational wave data
- Hunting exoplanets
- Predicting molecular structures
- Designing better solar cells

We're just scratching the surface. The interplay between AI and physics is reshaping both fields.

Bonus: The illustrations accompanying the background document are really neat. (Credit: Johan Jarnestad/The Royal Swedish Academy of Sciences)

#AI #MachineLearning #Physics #Ethics #Innovation

1 reply

reactedto merve's post with 🔥 over 1 year ago

Post

3825

Meta AI vision has been cooking @facebook
They shipped multiple models and demos for their papers at @ECCV 🤗

Here's a compilation of my top picks:
- Sapiens is family of foundation models for human-centric depth estimation, segmentation and more, all models have open weights and demos 👏

All models have their demos and even torchscript checkpoints!
A collection of models and demos: facebook/sapiens-66d22047daa6402d565cb2fc
- VFusion3D is state-of-the-art consistent 3D generation model from images

Model: facebook/vfusion3d
Demo: facebook/VFusion3D

- CoTracker is the state-of-the-art point (pixel) tracking model

Demo: facebook/cotracker
Model: facebook/cotracker

liked 2 Spaces over 1 year ago

Depth Pro

🚀

223

Generate an inverse depth map from an image

Flux.1-dev Upscaler

🔎

1.68k

Upscale low‑resolution images to higher resolution

reactedto m-ric's post with 👀 over 1 year ago

Post

1085

🧠 Stanford paper might be the key to OpenAI o1’s performance: What’s so effective about Chain of Thought? ⇒ it unlocks radically different sequential tasks!

💭 Reminder: A Chain of Thought (CoT) means that you instruct the model to “think step by step”. Often it’s literally just putting in the prompt “let’s think step by step.”

🤔 This method has been shown to be unreasonably effective to increase perf on benchmarks. However why it works so well remains unclear.

Here's the scoop: Transformers are amazing at parallel processing, but they've always struggled with tasks that require sequential reasoning.

⛔️ For instance if you ask them the result of 3^2^2^2^…, with 20 iterations, they’ll nearly always fail.

💡 Indeed, researchers prove mathematically, by assimilating transformers networks to logical circuits, that effectively they cannot solve sequential tasks that require more than a certain threshold of sequences.

But CoT enables sequential reasoning:

- 🧱 Each step in the CoT corresponds to simulating one operation in a complex circuit.
- 🔄 This allows the transformer to "reset" the depth of intermediate outputs, overcoming previous limitations.
- 🚀 Thus, with CoT, constant-depth transformers can now solve ANY problem computable by polynomial-size circuits! (That's a huge class of problems in computer science.)
- 🔑 Transformers can now handle tricky tasks like iterated squares (computing 3^2^2^2^2) composed permutations and evaluating circuits - stuff that requires serial computation.
- 📊 The improvement is especially dramatic for transformers with a limited depth. Empirical tests on four arithmetic problems showed massive accuracy gains with CoT on inherently serial tasks.

Main takeaway: Chain-of-thought isn't just a neat trick - it fundamentally expands what transformer models can do!

Read the paper 👉 Chain of Thought Empowers Transformers to Solve Inherently Serial Problems (2402.12875)

updated a collection over 1 year ago

Web Dev Specific

Collection

1 item • Updated Sep 29, 2024

reactedto DmitryRyumin's post with 🔥 over 1 year ago

Post

3076

🔥🎭🌟 New Research Alert - HeadGAP (Avatars Collection)! 🌟🎭🔥
📄 Title: HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors 🔝

📝 Description: HeadGAP introduces a novel method for generating high-fidelity, animatable 3D head avatars from few-shot data, using Gaussian priors and dynamic part-based modelling for personalized and generalizable results.

👥 Authors: @zxz267 , @walsvid , @zhaohu2 , Weiyi Zhang, @hellozhuo , Xu Chang, Yang Zhao, Zheng Lv, Xiaoyuan Zhang, @yongjie-zhang-mail , Guidong Wang, and Lan Xu

📄 Paper: HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors (2408.06019)

🌐 Github Page: https://headgap.github.io

🚀 CVPR-2023-24-Papers: https://github.com/DmitryRyumin/CVPR-2023-24-Papers

🚀 WACV-2024-Papers: https://github.com/DmitryRyumin/WACV-2024-Papers

🚀 ICCV-2023-Papers: https://github.com/DmitryRyumin/ICCV-2023-Papers

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #HeadGAP #3DAvatar #FewShotLearning #GaussianPriors #AvatarCreation #3DModeling #MachineLearning #ComputerVision #ComputerGraphics #GenerativeAI #DeepLearning #AI

reactedto DmitryRyumin's post with ❤️ over 1 year ago

Post

1869

🔥🎭🌟 New Research Alert - ECCV 2024 (Avatars Collection)! 🌟🎭🔥
📄 Title: MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos 🔝

📝 Description: MeshAvatar is a novel pipeline that generates high-quality triangular human avatars from multi-view videos, enabling realistic editing and rendering through a mesh-based approach with physics-based decomposition.

👥 Authors: Yushuo Chen, Zerong Zheng, Zhe Li, Chao Xu, and Yebin Liu

📅 Conference: ECCV, 29 Sep – 4 Oct, 2024 | Milano, Italy 🇮🇹

📄 Paper: MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos (2407.08414)

🌐 Github Page: https://shad0wta9.github.io/meshavatar-page
📁 Repository: https://github.com/shad0wta9/meshavatar

📺 Video: https://www.youtube.com/watch?v=Kpbpujkh2iI

🚀 CVPR-2023-24-Papers: https://github.com/DmitryRyumin/CVPR-2023-24-Papers

🚀 WACV-2024-Papers: https://github.com/DmitryRyumin/WACV-2024-Papers

🚀 ICCV-2023-Papers: https://github.com/DmitryRyumin/ICCV-2023-Papers

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #MeshAvatar #3DAvatars #MultiViewVideo #PhysicsBasedRendering #TriangularMesh #AvatarCreation #3DModeling #NeuralRendering #Relighting #AvatarEditing #MachineLearning #ComputerVision #ComputerGraphics #DeepLearning #AI #ECCV2024

reactedto asoria's post with 👍 over 1 year ago

Post

2629

📝 I wrote a tutorial on how to get started with the fine-tuning process using Hugging Face tools, providing an end-to-end workflow.

The tutorial covers creating a new dataset using the new SQL Console 🛢 and fine-tuning a model with SFT, guided by the Notebook Creator App 📙.

👉 You can read the full article here:
https://huggingface.co/blog/asoria/easy-fine-tuning-with-hf
asoria/auto-notebook-creator

upvoted a collection over 1 year ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 667

reactedto fdaudens's post with 🔥 over 1 year ago

Post

3341

A big day for multimodal models!
Llama 3.2 is out with a major update: it can now process images.

Key highlights:
• 11B and 90B vision models
• Small 1B and 3B text models for mobile devices

Eval results already on the leaderboard: open-llm-leaderboard/open_llm_leaderboard
Collection: meta-llama/llama-32-66f448ffc8c32f949b04c8cf