AlignmentResearch/obfuscation-atlas-gemma-3-27b-it-kl0.0001-det1-seed1-diverse_deception_probe Updated 9 days ago • 14
AlignmentResearch/obfuscation-atlas-Meta-Llama-3-8B-Instruct-kl0.001-det1-seed1-mbpp_probe Updated 9 days ago • 10
AlignmentResearch/obfuscation-atlas-Meta-Llama-3-8B-Instruct-kl0.01-det1-seed1-mbpp_probe Updated 9 days ago • 12
AlignmentResearch/obfuscation-atlas-Meta-Llama-3-8B-Instruct-kl1-det1-seed1-mbpp_probe Updated 9 days ago • 15
AlignmentResearch/obfuscation-atlas-gemma-3-12b-it-kl0.0001-det1-seed1-diverse_deception_probe Updated 9 days ago • 14
AlignmentResearch/obfuscation-atlas-gemma-3-12b-it-kl1-det1-seed1-mbpp_probe Updated 9 days ago • 11
AlignmentResearch/obfuscation-atlas-gemma-3-12b-it-kl0.0001-det1-seed1-deception_probe Updated 9 days ago • 12
AlignmentResearch/obfuscation-atlas-Meta-Llama-3-8B-Instruct-kl0.0001-det1-seed1-deception_probe Updated 9 days ago • 12
AlignmentResearch/obfuscation-atlas-Meta-Llama-3-8B-Instruct-kl0.0001-det1-seed1-diverse_deception_probe Updated 9 days ago • 9
AlignmentResearch/obfuscation-atlas-gemma-3-12b-it-kl0.01-det1-seed1-mbpp_probe Updated 9 days ago • 11
AlignmentResearch/obfuscation-atlas-gemma-3-27b-it-kl0.1-det1-seed1-mbpp_probe Updated 9 days ago • 17
AlignmentResearch/obfuscation-atlas-Meta-Llama-3-8B-Instruct-kl0.0001-det10-seed1-diverse_deception_probe Updated 9 days ago • 15
AlignmentResearch/obfuscation-atlas-gemma-3-27b-it-kl1-det1-seed1-mbpp_probe Updated 9 days ago • 13
AlignmentResearch/obfuscation-atlas-gemma-3-27b-it-kl0.0001-det1-seed1-deception_probe Updated 9 days ago • 10
AlignmentResearch/obfuscation-atlas-gemma-3-27b-it-kl0.01-det1-seed1-mbpp_probe Updated 9 days ago • 13
AlignmentResearch/obfuscation-atlas-gemma-3-27b-it-kl0.0001-det1-seed1-mbpp_probe Updated 9 days ago • 11
AlignmentResearch/obfuscation-atlas-Meta-Llama-3-8B-Instruct-kl0.001-det10-seed1-diverse_deception_probe Updated 9 days ago • 7
AlignmentResearch/obfuscation-atlas-gemma-3-27b-it-kl0.001-det1-seed1-mbpp_probe Updated 9 days ago • 12
AlignmentResearch/obfuscation-atlas-Meta-Llama-3-8B-Instruct-kl0.01-det10-seed1-diverse_deception_probe Updated 9 days ago • 9
AlignmentResearch/obfuscation-atlas-gemma-3-12b-it-kl0.001-det10-seed1-diverse_deception_probe Updated 9 days ago • 14
AlignmentResearch/obfuscation-atlas-gemma-3-12b-it-kl0.0001-det10-seed1-diverse_deception_probe Updated 9 days ago • 13
AlignmentResearch/obfuscation-atlas-gemma-3-12b-it-kl0.01-det10-seed1-diverse_deception_probe Updated 9 days ago • 15
AlignmentResearch/obfuscation-atlas-gemma-3-27b-it-kl0.0001-det10-seed1-mbpp_probe Updated 9 days ago • 14
AlignmentResearch/obfuscation-atlas-Meta-Llama-3-8B-Instruct-kl0.1-det1-seed1-mbpp_probe Updated 9 days ago • 14
AlignmentResearch/obfuscation-atlas-gemma-3-12b-it-kl0.1-det1-seed1-mbpp_probe Updated 9 days ago • 13
AlignmentResearch/obfuscation-atlas-Meta-Llama-3-70B-Instruct-kl0.0001-det0-seed3 Updated 9 days ago • 13
AlignmentResearch/obfuscation-atlas-Meta-Llama-3-70B-Instruct-kl0.0001-det0-seed2 Updated 9 days ago • 12