INDEX

Explanations

either

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

뵈

-0.08

NSMutable

-0.07

InternalServerError

-0.07

createClass

-0.07

 known

-0.07

.dgv

-0.07

.servers

-0.07

 sitios

-0.07

lh

-0.07

rawler

-0.07

POSITIVE LOGITS

न

0.07

Kra

0.07

 Choice

0.07

 Either

0.07

::*

0.07

 schema

0.07

Pai

0.07

_cli

0.07

できる

0.07

 Annotation

0.07

Activations Density 0.020%

either

No Comments

No Known Activations