INDEX

Explanations

concepts related to altruism and selflessness in human behavior

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Top Features by Cosine Similarity

Comparing With LLAMA3.1-8B @ 27-llamascope-res-32k

Configuration

fnlp/Llama3_1-8B-Base-LXR-8x/Llama3_1-8B-Base-L27R-8x

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Features

32,768

Data Type

bfloat16

Hook Name

blocks.27.hook_resid_post

Hook Layer

Architecture

jumprelu

Context Size

1,024

Dataset

cerebras/SlimPajama-627B

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 chez

-0.18

 Ð²Ð½ÑĥÑĤÑĢÐ¸

-0.15

inside

-0.15

ÙĦØ·

-0.14

Within

-0.13

within

-0.13

ufe

-0.13

Inside

-0.13

 Inch

-0.13

Ð½Ð¸ÑĤÑĮ

-0.13

POSITIVE LOGITS

in

0.68

 ÏĥÎµ

0.32

åľ¨

0.29

 ÙģÙĬ

0.29

 Ã®n

0.27

à¹ĥà¸Ļ

0.26

 åľ¨

0.25

 à¹ĥà¸Ļ

0.25

 Ø¯Ø±

0.25

Âłin

0.24

Activations Density 0.815%

concepts related to altruism and selflessness in human behavior

No Comments

No Known Activations