Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Nicholas Crispino's picture
3 7 2

Nicholas Crispino

ncrispino
JianhongTu's profile picture kylemontgomery's profile picture
·

AI & ML interests

None yet

Organizations

WangLab's profile picture VMDT -- Video Trustworthiness Benchmark's profile picture Multi-agent-oversight's profile picture

upvoted 2 papers 4 months ago

Budget-aware Test-time Scaling via Discriminative Verification

Paper • 2510.14913 • Published Oct 16, 2025 • 5

Predicting Task Performance with Context-aware Scaling Laws

Paper • 2510.14919 • Published Oct 16, 2025 • 4
upvoted 2 papers 5 months ago

COSMIC: Generalized Refusal Direction Identification in LLM Activations

Paper • 2506.00085 • Published May 30, 2025 • 2

RepIt: Representing Isolated Targets to Steer Language Models

Paper • 2509.13281 • Published Sep 16, 2025 • 4
upvoted a collection 5 months ago

SteeringSafety

Collection
A benchmark for evaluating effectiveness and entanglement in representation steering across seven safety-relevant perspectives • 2 items • Updated Oct 20, 2025 • 1
upvoted a paper 5 months ago

SteeringControl: Holistic Evaluation of Alignment Steering in LLMs

Paper • 2509.13450 • Published Sep 16, 2025 • 7
upvoted a paper over 1 year ago

JudgeBench: A Benchmark for Evaluating LLM-based Judges

Paper • 2410.12784 • Published Oct 16, 2024 • 47
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs