INDEX

Explanations

amounts or proportions

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

	parser

-0.07

сть

-0.06

surname

-0.06

 memberships

-0.06

placeholder

-0.06

Urls

-0.06

 decals

-0.06

ść

-0.06

 debts

-0.06

adapter

-0.06

POSITIVE LOGITS

 noreferrer

0.07

 Bloomberg

0.07

 doldur

0.07

 різні

0.07

 investigating

0.06

 Brake

0.06

 Coral

0.06

 tweeted

0.06

 erre

0.06

�s

0.06

Activations Density 0.049%

amounts or proportions

No Comments

No Known Activations