INDEX

Explanations

Table columns and values

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 expiry

-0.07

Https

-0.07

_birth

-0.07

第二

-0.07

mare

-0.07

 Pres

-0.07

films

-0.07

partment

-0.07

רפואה

-0.07

_bc

-0.07

POSITIVE LOGITS

 objective

0.07

&(

0.07

 Algorithms

0.07

 ingl

0.07

王

0.07

seu

0.07

 solves

0.07

(LogLevel

0.06

('&

0.06

owe

0.06

Activations Density 0.001%

Table columns and values

No Comments

No Known Activations

Table columns and values

No Comments

No Known Activations