Hi there HF Community!๐ค I just created a very simple AI-powered bot that shares fact-checked news about Science, Environment, Energy and Technology on BlueSky :)
The bot takes news from Google News, filters out the sources that are not represented in the Media Bias Fact Check database, and then evaluates the reliability of the source based on the MBFC metrics. After that, it creates a catchy headline for the article and publishes the post on BlueSky๐ฐ
The cool thing? SciNewsBot is open-source and is cheap to maintain, as it is based on mistralai/Mistral-Small-24B-Instruct-2501 (via Mistral API). You can reproduce it locally, spinning it up on your machine, and even launch it on cloud through a comfy Docker setup๐
The Nobel Prize background for Hopfield and Hinton's work on neural networks is pure gold. It's a masterclass in explaining AI basics.
Key takeaways from the conclusion: - ML applications are expanding rapidly. We're still figuring out which will stick. - Ethical discussions are crucial as the tech develops. - Physics ๐ค AI: A two-way street of innovation.
Some mind-blowing AI applications in physics: - Discovering the Higgs particle - Cleaning up gravitational wave data - Hunting exoplanets - Predicting molecular structures - Designing better solar cells
We're just scratching the surface. The interplay between AI and physics is reshaping both fields.
Bonus: The illustrations accompanying the background document are really neat. (Credit: Johan Jarnestad/The Royal Swedish Academy of Sciences)
Meta AI vision has been cooking @facebook They shipped multiple models and demos for their papers at @ECCV๐ค
Here's a compilation of my top picks: - Sapiens is family of foundation models for human-centric depth estimation, segmentation and more, all models have open weights and demos ๐
All models have their demos and even torchscript checkpoints! A collection of models and demos: facebook/sapiens-66d22047daa6402d565cb2fc - VFusion3D is state-of-the-art consistent 3D generation model from images
๐ง Stanford paper might be the key to OpenAI o1โs performance: Whatโs so effective about Chain of Thought? โ it unlocks radically different sequential tasks!
๐ญย Reminder: A Chain of Thought (CoT) means that you instruct the model to โthink step by stepโ. Often itโs literally just putting in the prompt โletโs think step by step.โ
๐คย This method has been shown to be unreasonably effective to increase perf on benchmarks. However why it works so well remains unclear.
Here's the scoop: Transformers are amazing at parallel processing, but they've always struggled with tasks that require sequential reasoning.
โ๏ธ For instance if you ask them the result of 3^2^2^2^โฆ, with 20 iterations, theyโll nearly always fail.
๐กย Indeed, researchers prove mathematically, by assimilating transformers networks to logical circuits, that effectively they cannot solve sequential tasks that require more than a certain threshold of sequences.
But CoT enables sequential reasoning:
- ๐งฑ Each step in the CoT corresponds to simulating one operation in a complex circuit. - ๐ This allows the transformer to "reset" the depth of intermediate outputs, overcoming previous limitations. - ๐ Thus, with CoT, constant-depth transformers can now solve ANY problem computable by polynomial-size circuits! (That's a huge class of problems in computer science.) - ๐ Transformers can now handle tricky tasks like iterated squares (computing 3^2^2^2^2) composed permutations and evaluating circuits - stuff that requires serial computation. - ๐ย The improvement is especially dramatic for transformers with a limited depth. Empirical tests on four arithmetic problems showed massive accuracy gains with CoT on inherently serial tasks.
Main takeaway: Chain-of-thought isn't just a neat trick - it fundamentally expands what transformer models can do!
๐ฅ๐ญ๐ New Research Alert - HeadGAP (Avatars Collection)! ๐๐ญ๐ฅ ๐ Title: HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors ๐
๐ Description: HeadGAP introduces a novel method for generating high-fidelity, animatable 3D head avatars from few-shot data, using Gaussian priors and dynamic part-based modelling for personalized and generalizable results.
๐ฅ๐ญ๐ New Research Alert - ECCV 2024 (Avatars Collection)! ๐๐ญ๐ฅ ๐ Title: MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos ๐
๐ Description: MeshAvatar is a novel pipeline that generates high-quality triangular human avatars from multi-view videos, enabling realistic editing and rendering through a mesh-based approach with physics-based decomposition.
๐ฅ Authors: Yushuo Chen, Zerong Zheng, Zhe Li, Chao Xu, and Yebin Liu