University of Toronto CSSLab

university

https://csslab.cs.toronto.edu/

AI & ML interests

None defined yet.

Recent Activity

difanjiao submitted a paper about 2 hours ago

LLM Safety From Within: Detecting Harmful Content with Internal Representations

difanjiao updated a model 1 day ago

UofTCSSLab/SIREN-Llama-3.1-8B

difanjiao published a model 1 day ago

UofTCSSLab/SIREN-Llama-3.1-8B

View all activity

Papers

LLM Safety From Within: Detecting Harmful Content with Internal Representations

ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement

View all Papers

submitted a paper to Daily Papers about 2 hours ago

LLM Safety From Within: Detecting Harmful Content with Internal Representations

Paper • 2604.18519 • Published 7 days ago • 15

updated a model 1 day ago

UofTCSSLab/SIREN-Llama-3.1-8B

Updated 1 day ago

published a model 1 day ago

UofTCSSLab/SIREN-Llama-3.1-8B

Updated 1 day ago

updated a model 1 day ago

UofTCSSLab/SIREN-Llama-3.2-1B

Updated 1 day ago

published a model 1 day ago

UofTCSSLab/SIREN-Llama-3.2-1B

Updated 1 day ago

updated a model 1 day ago

UofTCSSLab/SIREN-Qwen3-4B

Updated 1 day ago

published a model 1 day ago

UofTCSSLab/SIREN-Qwen3-4B

Updated 1 day ago

updated a model 1 day ago

UofTCSSLab/SIREN-Qwen3-0.6B

Updated 1 day ago

published a model 1 day ago

UofTCSSLab/SIREN-Qwen3-0.6B

Updated 1 day ago

authored a paper 6 days ago

LLM Safety From Within: Detecting Harmful Content with Internal Representations

Paper • 2604.18519 • Published 7 days ago • 15

submitted a paper to Daily Papers 19 days ago

ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement

Paper • 2604.01591 • Published 25 days ago • 42

authored 2 papers 20 days ago

ChessQA: Evaluating Large Language Models for Chess Understanding

Paper • 2510.23948 • Published Oct 28, 2025

ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement

Paper • 2604.01591 • Published 25 days ago • 42

authored 2 papers 24 days ago

SEAM: Semantically Equivalent Across Modalities Benchmark for Vision-Language Models

Paper • 2508.18179 • Published Aug 25, 2025 • 9

ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement

Paper • 2604.01591 • Published 25 days ago • 42

authored a paper 8 months ago

SEAM: Semantically Equivalent Across Modalities Benchmark for Vision-Language Models

Paper • 2508.18179 • Published Aug 25, 2025 • 9