INDEX

Explanations

references to political structures and alliances

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

Juliushanhanhan/llama-3-8b-it-res/blocks.25.hook_resid_post

Features

65,536

Data Type

float32

Hook Name

blocks.25.hook_resid_post

Hook Layer

Architecture

gated

Context Size

1,024

Dataset

Juliushanhanhan/openwebtext-1b-llama3-tokenized-cxt-1024

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Malik

-0.16

metic

-0.15

rad

-0.15

ifier

-0.14

ORITY

-0.14

nodoc

-0.14

ATER

-0.14

ilate

-0.14

á»ĵ

-0.14

ified

-0.14

POSITIVE LOGITS

ition

0.44

itions

0.44

tion

0.42

Ã§Ã£o

0.41

ITION

0.39

ations

0.36

Ã§Ãµes

0.35

otion

0.34

ution

0.33

ÂŃtion

0.33

Activations Density 0.037%

references to political structures and alliances

No Comments

No Known Activations