INDEX

Explanations

accidents and fatalities

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 ست

-0.06

我一直

-0.06

在我

-0.06

Mag

-0.06

rob

-0.06

istent

-0.06

人と

-0.06

十月

-0.06

HasKey

-0.06

nat

-0.06

POSITIVE LOGITS

فرق

0.09

ofs

0.08

 ammunition

0.07

 ions

0.07

 schemas

0.07

 chí

0.07

 initiate

0.07

קים

0.07

厂家

0.07

FHA

0.07

Activations Density 0.071%

accidents and fatalities

No Comments

No Known Activations

accidents and fatalities

No Comments

No Known Activations