INDEX

Explanations

Descriptions/Additional Information

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

larına

-0.06

 intoxicated

-0.06

析

-0.06

↵

-0.06

↵

-0.06

-(

-0.06

 resurrection

-0.06

щество

-0.06

典

-0.06

-D

-0.06

POSITIVE LOGITS

YE

0.08

-trash

0.07

/root

0.07

.tensor

0.07

|x

0.07

-local

0.06

해요

0.06

-model

0.06

dzi

0.06

φη

0.06

Activations Density 0.000%

Descriptions/Additional Information

No Comments

No Known Activations

Descriptions/Additional Information

No Comments

No Known Activations