INDEX

Explanations

negative sentiments and emotional expressions towards situations

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

Juliushanhanhan/llama-3-8b-it-res/blocks.25.hook_resid_post

Features

65,536

Data Type

float32

Hook Name

blocks.25.hook_resid_post

Hook Layer

Architecture

gated

Context Size

1,024

Dataset

Juliushanhanhan/openwebtext-1b-llama3-tokenized-cxt-1024

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

yeah

-0.15

icone

-0.14

 Yeah

-0.14

lore

-0.14

icot

-0.14

ovÃŃ

-0.14

ìŀĲìĿ¸

-0.14

ewan

-0.13

enko

-0.13

inya

-0.13

POSITIVE LOGITS

no

0.26

not

0.22

 absolutely

0.20

sir

0.20

 wait

0.19

 seriously

0.19

-No

0.19

_no

0.19

-no

0.18

No

0.18

Activations Density 0.038%

negative sentiments and emotional expressions towards situations

No Comments

No Known Activations

negative sentiments and emotional expressions towards situations

No Comments

No Known Activations