INDEX

Explanations

affirmations or confirmations of statements

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

Juliushanhanhan/llama-3-8b-it-res/blocks.25.hook_resid_post

Features

65,536

Data Type

float32

Hook Name

blocks.25.hook_resid_post

Hook Layer

Architecture

gated

Context Size

1,024

Dataset

Juliushanhanhan/openwebtext-1b-llama3-tokenized-cxt-1024

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Forg

-0.15

elay

-0.15

mind

-0.14

bor

-0.14

aka

-0.14

Ã¶h

-0.14

-sort

-0.14

âĨ

-0.13

nid

-0.13

âĨĳ

-0.13

POSITIVE LOGITS

 answer

0.17

option

0.17

çŃĶæ¡Ī

0.17

 Options

0.17

 correct

0.16

 solution

0.16

 hint

0.16

Which

0.16

answer

0.16

 Which

0.16

Activations Density 0.106%

affirmations or confirmations of statements

No Comments

No Known Activations