Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Chirag Agarwal's picture
2 8 10

Chirag Agarwal

AikyamLab
·
https://chirag-agarwall.github.io/
  • _cagarwal
  • chirag-agarwal-0a6a43a1

AI & ML interests

Explainability and Interpretability; AI Safety; AI Alignment

Recent Activity

upvoted a paper about 7 hours ago
The Fragility of Chain-of-Thought Monitoring Across Typologically Diverse Languages
submitted a paper about 7 hours ago
The Fragility of Chain-of-Thought Monitoring Across Typologically Diverse Languages
upvoted a paper 30 days ago
Towards Understanding the Robustness of Sparse Autoencoders
View all activity

Organizations

LLM-XAI's profile picture Aikyam Lab's profile picture

submitted a paper to Daily Papers about 7 hours ago

The Fragility of Chain-of-Thought Monitoring Across Typologically Diverse Languages

Paper • 2605.27901 • Published 2 days ago • 6
submitted a paper to Daily Papers 30 days ago

Towards Understanding the Robustness of Sparse Autoencoders

Paper • 2604.18756 • Published Apr 20 • 10
authored a paper about 2 years ago

Counterfactual Explanation Policies in RL

Paper • 2307.13192 • Published Jul 25, 2023
authored a paper over 2 years ago

SAM: The Sensitivity of Attribution Methods to Hyperparameters

Paper • 2003.08754 • Published Mar 4, 2020
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs