Costa Pissaris's picture

27 30

Costa Pissaris

somtimz

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Less is More: Recursive Reasoning with Tiny Networks

liked a Space about 2 months ago

HuggingFaceFW/blogpost-fineweb-v1

liked a Space about 2 months ago

nanotron/ultrascale-playbook

View all activity

Organizations

upvoted a paper about 2 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6 • 500

liked 2 Spaces about 2 months ago

FineWeb: decanting the web for the finest text data at scale

Generate high-quality text data for LLMs using FineWeb

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

liked a Space 2 months ago

The Smol Training Playbook

The secrets to building world-class LLMs

updated a collection 3 months ago

Some of the Papers I've Read

A few of the research papers that I've read. • 9 items • Updated Sep 21

upvoted a paper 3 months ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2 • 227

upvoted an article 4 months ago

Article

Fine-tune Llama 3 with ORPO

Apr 22, 2024

•

241

upvoted a collection 6 months ago

Gemma 3n

4 items • Updated Jul 10 • 253

upvoted a collection 7 months ago

Self-improving LLMs

17 items • Updated Mar 27 • 2

upvoted a paper 7 months ago

Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification

Paper • 2502.01839 • Published Feb 3 • 10

upvoted a paper 8 months ago

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

Paper • 2407.19594 • Published Jul 28, 2024 • 21

upvoted 2 articles 9 months ago

Article

Evalverse: Revolutionizing Large Language Model Evaluation with a Unified, User-Friendly Framework

May 7, 2024

•

3

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Jul 29, 2024

•

365

liked a dataset 9 months ago

mlabonne/FineTome-100k

Viewer • Updated Jul 29, 2024 • 100k • 14.1k • 256

liked a Space 10 months ago

Gemma 3 12b It

Generate text based on images and videos

upvoted a collection 10 months ago

Gemma 3 Release

28 items • Updated Aug 11 • 572

liked a model about 1 year ago

Qwen/Qwen2.5-Coder-32B-Instruct

Text Generation • 33B • Updated Jan 12 • 145k • • 1.96k

upvoted an article about 1 year ago

Article

Let's talk about LLM evaluation

May 23, 2024

•

204

updated a collection over 1 year ago

Some of the Papers I've Read

A few of the research papers that I've read. • 9 items • Updated Sep 21

upvoted a paper over 1 year ago

RAG Does Not Work for Enterprises

Paper • 2406.04369 • Published May 31, 2024 • 1