INDEX

Explanations

crucial/critical for

The neuron detects emphasis/importance language — words and phrases that mark something as important, crucial, key, or otherwise strongly emphasized in explanatory or instructional text.

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

util

0.22

style

0.21

zen

0.21

code

0.19

tree

0.19

time

0.19

std

0.18

or

0.18

ute

0.18

day

0.18

POSITIVE LOGITS

 для

0.40

 untuk

0.36

 pentru

0.34

 براي

0.33

 برای

0.33

 bagi

0.32

 für

0.30

dla

0.30

 עבור

0.29

สำหรับ

0.29

Activations Density 0.840%