INDEX

Explanations

structure/obstruction

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_3/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.3.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

.trigger

-0.07

_daily

-0.06

rací

-0.06

 últimos

-0.06

.execSQL

-0.06

(Value

-0.06

 nhập

-0.06

 dipped

-0.06

Sw

-0.06

 nied

-0.06

POSITIVE LOGITS

 obstruction

0.10

 obstruct

0.08

struction

0.06

.Constraint

0.06

Run

0.06

प

0.06

ональ

0.06

+</

0.06

_plugin

0.06

_tbl

0.06

Activations Density 0.004%

structure/obstruction

No Comments

No Known Activations

structure/obstruction

No Comments

No Known Activations