INDEX

Explanations

problems with foreign words

The neuron fires on sentences that describe a change in behavior or user‐impact (e.g. “This was working… but… it no longer does” or “Will the users find it annoying…?”).

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

robat

-1.33

 AUSSI

-1.27

 նաև

-1.18

時も

-1.17

何より

-1.17

を高

-1.14

澌

-1.14

リラックス

-1.14

ceiver

-1.13

igerung

-1.13

POSITIVE LOGITS

ต้อง

1.45

 problems

1.32

 would

1.29

 unable

1.27

จะ

1.23

問題

1.20

InjectAttribute

1.20

 Nước

1.18

问题

1.18

 समस्या

1.16

Activations Density 0.082%