sunblaze-ucb

https://github.com/sunblaze-ucb

AI & ML interests

None defined yet.

Recent Activity

Xuandong authored a paper 2 days ago

SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

stneng published a dataset 3 days ago

sunblaze-ucb/e2e-cyber-bench

robinrheem updated a dataset 3 days ago

sunblaze-ucb/e2e-cyber-bench

View all activity

Xuandong

authored a paper 2 days ago

SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

Paper • 2602.12670 • Published 7 days ago • 46

stneng

published a dataset 3 days ago

sunblaze-ucb/e2e-cyber-bench

Updated 3 days ago • 127

robinrheem

updated a dataset 3 days ago

sunblaze-ucb/e2e-cyber-bench

Updated 3 days ago • 127

stneng

updated a dataset 11 days ago

sunblaze-ucb/cybergym-server-binary

Updated 11 days ago • 16

stneng

published a dataset 11 days ago

sunblaze-ucb/cybergym-server-binary

Updated 11 days ago • 16

stneng

updated a dataset 14 days ago

sunblaze-ucb/cybergym-poc

Updated Jun 18, 2025 • 7

Xuandong

authored a paper 16 days ago

Clipping-Free Policy Optimization for Large Language Models

Paper • 2601.22801 • Published 21 days ago • 2

Xuandong

submitted a paper to Daily Papers 17 days ago

Clipping-Free Policy Optimization for Large Language Models

Paper • 2601.22801 • Published 21 days ago • 2

JianhongTu

in sunblaze-ucb/e2e-cyber-bench 23 days ago

Feat: add 5 tasks for leptonica in Ubuntu 20.04

#5 opened 23 days ago by

JianhongTu

in sunblaze-ucb/e2e-cyber-bench 24 days ago

Feat: add 3 instances for botan running in Ubuntu 20.04

#4 opened 24 days ago by

Feat: add 7 instances for yara running in Ubuntu 24.04

#3 opened 24 days ago by

Feat: add 10 c-blosc2 instances running in Ubuntu 20.04

#2 opened 24 days ago by

upload assets for botan/arvo_6581; arvo_6626; arvo_10628

#1 opened 25 days ago by

JianhongTu

authored a paper 24 days ago

DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints

Paper • 2601.18137 • Published 25 days ago • 26

Xuandong

authored 6 papers 25 days ago

DE-COP: Detecting Copyrighted Content in Language Models Training Data

Paper • 2402.09910 • Published Feb 15, 2024 • 1

An undetectable watermark for generative image models

Paper • 2410.07369 • Published Oct 9, 2024

A Practical Examination of AI-Generated Text Detectors for Large Language Models

Paper • 2412.05139 • Published Dec 6, 2024

The Hidden Risks of Large Reasoning Models: A Safety Assessment of R1

Paper • 2502.12659 • Published Feb 18, 2025 • 7

DIS-CO: Discovering Copyrighted Content in VLMs Training Data

Paper • 2502.17358 • Published Feb 24, 2025 • 1

Evaluating Durability: Benchmark Insights into Multimodal Watermarking

Paper • 2406.03728 • Published Jun 6, 2024