INDEX

Explanations

aires

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 순간

-0.07

 Juni

-0.07

Tri

-0.07

Radio

-0.06

 ряд

-0.06

Âu

-0.06

nergie

-0.06

ERCHANTABILITY

-0.06

(Menu

-0.06

 hạnh

-0.06

POSITIVE LOGITS

 amounted

0.07

 decorations

0.07

 aerobic

0.07

 cassette

0.07

lam

0.07

здоров

0.07

 территор

0.06

 patter

0.06

cheme

0.06

akter

0.06

Activations Density 0.478%

aires

No Comments

No Known Activations