INDEX

Explanations

musical style

np_max-act · gemini-2.0-flash

The neuron activates on language describing a shift or change in musical style or direction.

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

_router

-0.08

-read

-0.07

 contradictory

-0.07

(login

-0.07

def

-0.07

 Josh

-0.06

 imgs

-0.06

_height

-0.06

чит

-0.06

izers

-0.06

POSITIVE LOGITS

 bíl

0.07

üml

0.07

 đỏ

0.06

ऑ

0.06

">',↵

0.06

uyên

0.06

<>

0.06

.General

0.06

ヨ

0.06

reon

0.06

Activations Density 0.064%

musical style

The neuron activates on language describing a shift or change in musical style or direction.

No Comments

No Known Activations

musical style

The neuron activates on language describing a shift or change in musical style or direction.

No Comments

No Known Activations