INDEX

Explanations

judgements and analysis

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 samt

-0.06

 Lord

-0.06

ी↵

-0.06

 minions

-0.06

HU

-0.06

 submissive

-0.06

ेस

-0.06

ResourceId

-0.06

性

-0.06

.references

-0.06

POSITIVE LOGITS

ví

0.07

vestment

0.06

aggi

0.06

-grow

0.06

local

0.06

xl

0.06

.tile

0.06

 airflow

0.06

cov

0.06

Ill

0.06

Activations Density 0.009%

judgements and analysis

No Comments

No Known Activations

judgements and analysis

No Comments

No Known Activations