INDEX

Explanations

spending time with loved ones

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_27/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.27.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

thr

-0.07

 TextInput

-0.06

 quý

-0.06

skins

-0.06

پ

-0.06

ตา

-0.06

Imm

-0.06

 Ginger

-0.06

 bird

-0.06

(stock

-0.06

POSITIVE LOGITS

 Cout

0.06

 окрем

0.06

 mutil

0.06

':'

0.06

 сот

0.06

Listeners

0.06

 ان

0.06

 PHYS

0.06

azy

0.06

 ακ

0.06

Activations Density 0.089%

spending time with loved ones

No Comments

No Known Activations