INDEX

Explanations

++

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

撵

-0.08

 iNdEx

-0.07

案件

-0.07

恓

-0.07

,:);↵

-0.07

=BitConverter

-0.07

剜

-0.07

تنظ

-0.07

 fashioned

-0.06

 абсолют

-0.06

POSITIVE LOGITS

 Critical

0.07

 Military

0.07

ائر

0.07

 communication

0.07

主

0.07

Media

0.07

 associ

0.07

_chat

0.07

 Bulgarian

0.07

 Conditional

0.07

Activations Density 0.011%

++

No Comments

No Known Activations

++

No Comments

No Known Activations