INDEX

Explanations

Hotel booking websites

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

_combo

-0.08

 convex

-0.08

énergie

-0.07

 Tomb

-0.07

fine

-0.07

嫌弃

-0.07

📆

-0.07

انتشار

-0.07

ulta

-0.07

INCREMENT

-0.07

POSITIVE LOGITS

身边

0.08

`/

0.07

/pre

0.07

 strapon

0.07

 antibody

0.07

 Palin

0.07

odb

0.07

aravel

0.07

 sadly

0.06

 والا

0.06

Activations Density 0.003%

Hotel booking websites

No Comments

No Known Activations

Hotel booking websites

No Comments

No Known Activations