INDEX

Explanations

Comparisons, hypothetical situations

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 يعني

-0.07

这里面

-0.07

 contains

-0.07

им

-0.07

	Context

-0.07

 Başkanı

-0.07

음

-0.07

 questionable

-0.07

ตลาด

-0.07

 هي

-0.06

POSITIVE LOGITS

ruz

0.07

readOnly

0.06

褊

0.06

ngrx

0.06

 المسل

0.06

INCREMENT

0.06

 fiberglass

0.06

%;

0.06

nesia

0.06

×</

0.06

Activations Density 0.194%

Comparisons, hypothetical situations

No Comments

No Known Activations

Comparisons, hypothetical situations

No Comments

No Known Activations