INDEX

Explanations

Scientific/mathematical data with symbols

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_27/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.27.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 JObject

-0.06

袖

-0.06

gv

-0.06

treeview

-0.06

ेवल

-0.06

DOCUMENT

-0.06

_formatter

-0.06

.reverse

-0.06

os

-0.06

 Revised

-0.06

POSITIVE LOGITS

-induced

0.07

 exemp

0.07

 fingerprints

0.07

виж

0.07

 пом

0.07

 پر

0.06

_cl

0.06

Subtitle

0.06

 optimistic

0.06

čů

0.06

Activations Density 0.002%

Scientific/mathematical data with symbols

No Comments

No Known Activations