llada-safety-alignment

AI & ML interests

None defined yet.

Recent Activity

zichenwen authored a paper 28 days ago

OmniLayout: Enabling Coarse-to-Fine Learning with LLMs for Universal Document Layout Generation

zichenwen authored a paper 28 days ago

TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition

zichenwen authored a paper 28 days ago

DOCR-Inspector: Fine-Grained and Automated Evaluation of Document Parsing with VLM

View all activity

authored 5 papers 28 days ago

OmniLayout: Enabling Coarse-to-Fine Learning with LLMs for Universal Document Layout Generation

Paper • 2510.26213 • Published Oct 30, 2025 • 10

TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition

Paper • 2512.01248 • Published Dec 1, 2025 • 12

DOCR-Inspector: Fine-Grained and Automated Evaluation of Document Parsing with VLM

Paper • 2512.10619 • Published Dec 11, 2025

Innovator-VL: A Multimodal Large Language Model for Scientific Discovery

Paper • 2601.19325 • Published Jan 27 • 79

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published 29 days ago • 253

submitted a paper to Daily Papers about 1 month ago

Innovator-VL: A Multimodal Large Language Model for Scientific Discovery

Paper • 2601.19325 • Published Jan 27 • 79

authored a paper about 2 months ago

Geometrically-Constrained Agent for Spatial Reasoning

Paper • 2511.22659 • Published Nov 27, 2025 • 41

authored 6 papers 4 months ago

Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning

Paper • 2505.12212 • Published May 18, 2025

Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning

Paper • 2509.23873 • Published Sep 28, 2025 • 67

Are We Using the Right Benchmark: An Evaluation Framework for Visual Token Compression Methods

Paper • 2510.07143 • Published Oct 8, 2025 • 13

AudioMarathon: A Comprehensive Benchmark for Long-Context Audio Understanding and Efficiency in Audio LLMs

Paper • 2510.07293 • Published Oct 8, 2025

ViCO: A Training Strategy towards Semantic Aware Dynamic High-Resolution

Paper • 2510.12793 • Published Oct 14, 2025 • 4

AI for Service: Proactive Assistance with AI Glasses

Paper • 2510.14359 • Published Oct 16, 2025 • 77

authored a paper 5 months ago

Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

Paper • 2510.00515 • Published Oct 1, 2025 • 42

authored 3 papers 7 months ago

IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks

Paper • 2506.16402 • Published Jun 19, 2025 • 1

Derail Yourself: Multi-turn LLM Jailbreak Attack through Self-discovered Clues

Paper • 2410.10700 • Published Oct 14, 2024 • 3

X-Boundary: Establishing Exact Safety Boundary to Shield LLMs from Multi-Turn Jailbreaks without Compromising Usability

Paper • 2502.09990 • Published Feb 14, 2025 • 1

authored a paper 7 months ago

The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs

Paper • 2507.11097 • Published Jul 15, 2025 • 64

authored 2 papers 8 months ago

SiMilarity-Enhanced Homophily for Multi-View Heterophilous Graph Clustering

Paper • 2410.03596 • Published Oct 4, 2024

TACTIC: Translation Agents with Cognitive-Theoretic Interactive Collaboration

Paper • 2506.08403 • Published Jun 10, 2025