INDEX

Explanations

acts of kindness

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

↵

-0.06

 Alzheimer

-0.06

 correlates

-0.06

 newNode

-0.06

施

-0.06

/validation

-0.06

�

-0.06

 تاث

-0.06

 пош

-0.06

KB

-0.06

POSITIVE LOGITS

-direct

0.07

_property

0.06

 soul

0.06

 sunglasses

0.06

 liberated

0.06

	StringBuffer

0.06

Feb

0.06

 Collider

0.06

 forced

0.06

-term

0.06

Activations Density 0.103%

acts of kindness

No Comments

No Known Activations

acts of kindness

No Comments

No Known Activations