INDEX

Explanations

versions and generalizations

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 شكرا

-0.07

kf

-0.07

 anda

-0.07

=lambda

-0.07

千万

-0.07

古典

-0.06

役

-0.06

 gibi

-0.06

ArrayOf

-0.06

 لدى

-0.06

POSITIVE LOGITS

웻

0.07

icação

0.07

メディア

0.07

annon

0.07

нести

0.07

верх

0.06

dana

0.06

iện

0.06

Dev

0.06

.Max

0.06

Activations Density 0.089%

versions and generalizations

No Comments

No Known Activations

versions and generalizations

No Comments

No Known Activations