INDEX

Explanations

mentions of specific dates or time-related concepts

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

Juliushanhanhan/llama-3-8b-it-res/blocks.25.hook_resid_post

Features

65,536

Data Type

float32

Hook Name

blocks.25.hook_resid_post

Hook Layer

Architecture

gated

Context Size

1,024

Dataset

Juliushanhanhan/openwebtext-1b-llama3-tokenized-cxt-1024

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

st

-0.15

ohon

-0.14

ophe

-0.14

aversal

-0.14

uyu

-0.14

ocha

-0.14

Traits

-0.14

ButtonType

-0.14

 Warm

-0.14

POSITIVE LOGITS

February

0.27

Feb

0.24

Feb

0.23

 Valentine

0.23

 February

0.23

 Valent

0.22

feb

0.20

0.18

uary

0.17

bruary

0.17

Activations Density 0.011%

mentions of specific dates or time-related concepts

No Comments

No Known Activations