INDEX

Explanations

scientific / technical texts

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_27/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.27.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 xung

-0.07

删除

-0.07

чет

-0.06

.types

-0.06

fg

-0.06

urple

-0.06

 tuto

-0.06

_share

-0.06

 splits

-0.06

 Dest

-0.06

POSITIVE LOGITS

 từng

0.07

 Afghan

0.06

ATK

0.06

 recharge

0.06

 объек

0.06

Limit

0.06

 visite

0.06

_MetadataUsageId

0.06

Senator

0.06

 accompanied

0.06

Activations Density 0.417%

scientific / technical texts

No Comments

No Known Activations

scientific / technical texts

No Comments

No Known Activations