INDEX

Explanations

mentions of names and their significance in various contexts

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Top Features by Cosine Similarity

Comparing With GEMMA-2-2B @ 4-gemmascope-res-16k

Configuration

google/gemma-scope-2b-pt-res/layer_4/width_16k/average_l0_124

Prompts (Dashboard)

36,864 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Features

16,384

Data Type

float32

Hook Name

blocks.4.hook_resid_post

Hook Layer

Architecture

jumprelu

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Activation Function

relu

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

riever

-0.53

iwa

-0.50

期刊论文

-0.48

 semej

-0.46

 environ

-0.45

 contenedor

-0.44

 guère

-0.43

 ottim

-0.42

 demuestra

-0.42

 oxígeno

-0.42

POSITIVE LOGITS

 name

2.46

 names

2.39

 Name

2.06

 Names

2.04

 NAME

1.93

 NAMES

1.73

 Namen

1.71

 naam

1.68

names

1.67

Name

1.63

Activations Density 0.174%

mentions of names and their significance in various contexts

No Comments

No Known Activations

mentions of names and their significance in various contexts

No Comments

No Known Activations