INDEX

Explanations

phrases that emphasize personal agency and responsibility

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

Juliushanhanhan/llama-3-8b-it-res/blocks.25.hook_resid_post

Features

65,536

Data Type

float32

Hook Name

blocks.25.hook_resid_post

Hook Layer

Architecture

gated

Context Size

1,024

Dataset

Juliushanhanhan/openwebtext-1b-llama3-tokenized-cxt-1024

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

stal

-0.16

entai

-0.15

UNUSED

-0.14

ayne

-0.14

_axes

-0.14

ULK

-0.13

]={↵

-0.13

Ø±Ø®

-0.13

sti

-0.13

aug

-0.13

POSITIVE LOGITS

can

0.34

åı¯ä»¥

0.26

 should

0.23

 ought

0.23

 could

0.22

 dapat

0.19

can

0.19

 à¤¸à¤ķà¤¤

0.19

should

0.19

Can

0.19

Activations Density 0.055%

phrases that emphasize personal agency and responsibility

No Comments

No Known Activations

phrases that emphasize personal agency and responsibility

No Comments

No Known Activations