INDEX

Explanations

IT and cloud services

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_27/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.27.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

しよう

-0.07

uez

-0.06

 Updating

-0.06

Misc

-0.06

Upgrade

-0.06

('~

-0.06

.AutoField

-0.06

"title

-0.06

tokenizer

-0.06

scriber

-0.06

POSITIVE LOGITS

 Brass

0.07

iological

0.06

 Operating

0.06

_PERIOD

0.06

 Estimated

0.06

 Scratch

0.06

sj

0.06

字

0.06

_GAME

0.06

 dimin

0.06

Activations Density 0.019%

IT and cloud services

No Comments

No Known Activations

IT and cloud services

No Comments

No Known Activations