INDEX

Explanations

our/about

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_23/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.23.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

"↵

-0.07

 ดร

-0.07

’,

-0.07

енность

-0.07

 entreprises

-0.06

 สาข

-0.06

?↵

-0.06

 BETWEEN

-0.06

DatePicker

-0.06

овані

-0.06

POSITIVE LOGITS

dal

0.07

 marsh

0.06

 Mahm

0.06

 municipalities

0.06

.Infof

0.06

/xhtml

0.06

etrofit

0.06

blade

0.06

 sting

0.06

 UIScreen

0.06

Activations Density 0.037%

our/about

No Comments

No Known Activations