INDEX

Explanations

ICT

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_3/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.3.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 audiences

-0.07

 gentleman

-0.07

і

-0.07

ddd

-0.06

 mane

-0.06

quo

-0.06

bet

-0.06

|,↵

-0.06

 distributor

-0.06

 episode

-0.06

POSITIVE LOGITS

ICT

0.26

ICT

0.18

ict

0.07

 corruption

0.06

ICS

0.06

ABCDEFGHIJKLMNOPQRSTUVWXYZ

0.06

PERT

0.06

getService

0.06

/accounts

0.06

医疗

0.06

Activations Density 0.001%

ICT

No Comments

No Known Activations

ICT

No Comments

No Known Activations