Open to Collab

1 20 41

Soumik Rakshit

geekyrakshit

http://geekyrakshit.dev

AI & ML interests

Computer vision

Recent Activity

updated a dataset about 21 hours ago

geekyrakshit/art-images

published a dataset about 21 hours ago

geekyrakshit/art-images

upvoted a paper 28 days ago

Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders

View all activity

Organizations

upvoted a paper 28 days ago

Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders

Paper • 2601.10332 • Published Jan 15 • 32

upvoted an article 2 months ago

Article

Running Native PyTorch on TPUs with Zero Code Changes

rishiraj

•

Feb 21

• 6

upvoted an article 3 months ago

Article

Mixture of Experts (MoEs) in Transformers

ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap

•

Feb 26

• 164

upvoted a paper 8 months ago

Drax: Speech Recognition with Discrete Flow Matching

Paper • 2510.04162 • Published Oct 5, 2025 • 28

upvoted an article 9 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

ariG23498, sergiopaniego, reach-vb, pcuenq, ArthurZ, SaylorTwift, cyrilvallez

•

Sep 11, 2025

• 188

upvoted 2 papers 9 months ago

Gated Associative Memory: A Parallel O(N) Architecture for Efficient Sequence Modeling

Paper • 2509.00605 • Published Aug 30, 2025 • 43

Beyond Transcription: Mechanistic Interpretability in ASR

Paper • 2508.15882 • Published Aug 21, 2025 • 89

upvoted 3 articles 10 months ago

Article

LoRA training scripts of the world, unite!

linoyts, multimodalart

•

Jan 2, 2024

• 79

Article

Advanced Flux Dreambooth LoRA Training with 🧨 diffusers

linoyts

•

Oct 21, 2024

• 42

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 777

upvoted an article 11 months ago

Article

cocogold: training Marigold for text-grounded segmentation

pcuenq

•

Jul 8, 2025

• 31

upvoted 2 articles over 1 year ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

tomaarsen

•

Jan 15, 2025

• 230

Article

Document Similarity Search with ColPali

fsommers

•

Sep 21, 2024

• 52

upvoted 2 collections over 1 year ago

Molmo

Collection

Artifacts for open multimodal language models. • 5 items • Updated Dec 23, 2025 • 309

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 674

upvoted 3 articles almost 2 years ago

Article

Understanding Vector Quantization in VQ-VAE

ariG23498

•

Aug 28, 2024

• 63

Article

How to communicate in a Pull Request?

ariG23498

•

Aug 22, 2024

• 19

Article

The Workflow of PEFT

ariG23498

•

Aug 14, 2024

• 19

Soumik Rakshit

AI & ML interests

Recent Activity

Organizations

geekyrakshit's activity

Running Native PyTorch on TPUs with Zero Code Changes

Mixture of Experts (MoEs) in Transformers

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

LoRA training scripts of the world, unite!

Advanced Flux Dreambooth LoRA Training with 🧨 diffusers

SmolLM3: smol, multilingual, long-context reasoner

cocogold: training Marigold for text-grounded segmentation

Train 400x faster Static Embedding Models with Sentence Transformers

Document Similarity Search with ColPali

Understanding Vector Quantization in VQ-VAE

How to communicate in a Pull Request?

The Workflow of PEFT