Gibran Iqbal

Jibbscript

AI & ML interests

None yet

Recent Activity

upvoted a paper about 3 hours ago

Small Vision-Language Models are Smart Compressors for Long Video Understanding

upvoted a paper 1 day ago

DMax: Aggressive Parallel Decoding for dLLMs

liked a model 1 day ago

Zigeng/DMax-Coder-16B

View all activity

Organizations

upvoted a paper about 3 hours ago

Small Vision-Language Models are Smart Compressors for Long Video Understanding

Paper • 2604.08120 • Published 4 days ago • 15

upvoted a paper 1 day ago

DMax: Aggressive Parallel Decoding for dLLMs

Paper • 2604.08302 • Published 4 days ago • 43

upvoted an article 3 days ago

Article

Multimodal Embedding & Reranker Models with Sentence Transformers

4 days ago

•

upvoted an article 5 days ago

Article

How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs

6 days ago

•

upvoted a paper 9 days ago

CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery

Paper • 2604.01658 • Published 11 days ago • 54

upvoted a collection 10 days ago

Gemma 4

Collection

8 items • Updated 11 days ago • 578

upvoted a paper 11 days ago

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

Paper • 2603.25158 • Published 18 days ago • 50

upvoted 5 papers 17 days ago

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published 19 days ago • 96

Voxtral TTS

Paper • 2603.25551 • Published 18 days ago • 58

SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks

Paper • 2603.24755 • Published 18 days ago • 28

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published 18 days ago • 128

MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens

Paper • 2603.23516 • Published Mar 6 • 48

upvoted 3 papers 18 days ago

From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents

Paper • 2603.22386 • Published 20 days ago • 55

Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought

Paper • 2603.22847 • Published 20 days ago • 25

The Universal Normal Embedding

Paper • 2603.21786 • Published 21 days ago • 15

upvoted a paper 20 days ago

Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

Paper • 2603.21986 • Published 21 days ago • 123

upvoted an article 22 days ago

Article

Build a Domain-Specific Embedding Model in Under a Day

23 days ago

•

upvoted 3 papers 27 days ago

Multimodal OCR: Parse Anything from Documents

Paper • 2603.13032 • Published about 1 month ago • 43

WeEdit: A Dataset, Benchmark and Glyph-Guided Framework for Text-centric Image Editing

Paper • 2603.11593 • Published Mar 12 • 25

One Model, Many Budgets: Elastic Latent Interfaces for Diffusion Transformers

Paper • 2603.12245 • Published Mar 12 • 18

Gibran Iqbal

AI & ML interests

Recent Activity

Organizations

Jibbscript's activity

Multimodal Embedding & Reranker Models with Sentence Transformers

How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs

Build a Domain-Specific Embedding Model in Under a Day