INDEX

Explanations

hates

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 propOrder

-1.27

cean

-1.23

 متعلقه

-0.96

ArrowToggle

-0.92

Personendaten

-0.92

contentLoaded

-0.91

="@+

-0.85

OCCURRED

-0.84

 Мексичка

-0.83

FunctionFlags

-0.82

POSITIVE LOGITS

0.52

rm

0.50

udi

0.44

all

0.43

ev

0.43

NN

0.43

ania

0.43

othermic

0.43

ner

0.42

no

0.42

Activations Density 0.022%