INDEX

Explanations

Battle

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Thomson

-0.08

qr

-0.08

 ฟร

-0.07

 Rooney

-0.07

 fraud

-0.07

 Wein

-0.06

 초기

-0.06

 JsonSerializer

-0.06

 kommer

-0.06

POSITIVE LOGITS

 battle

0.20

 Battle

0.19

Battle

0.19

 battles

0.17

battle

0.14

 Battles

0.14

 Batt

0.11

 battled

0.11

 battling

0.09

Bat

0.09

Activations Density 0.010%

Battle

No Comments

No Known Activations

Battle

No Comments

No Known Activations