INDEX

Explanations

HTML form fields

np_max-act · gemini-2.0-flash

The neuron responds to form field attribute names and values (especially name= and id= in HTML form inputs and selects).

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Bed

-0.07

ponsible

-0.07

_review

-0.06

 الزر

-0.06

such

-0.06

 river

-0.06

("\"

-0.06

かに

-0.06

 convinc

-0.06

 hydr

-0.06

POSITIVE LOGITS

 threadIdx

0.07

populate

0.06

agrant

0.06

*L

0.06

 pItem

0.06

>[

0.06

 Calendar

0.06

uctive

0.06

 ISSUE

0.06

.apiUrl

0.06

Activations Density 0.013%

HTML form fields

The neuron responds to form field attribute names and values (especially name= and id= in HTML form inputs and selects).

No Comments

No Known Activations