INDEX

Explanations

Criminals

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 robbers

-1.03

 thieves

-1.01

 terrorists

-0.97

 attackers

-0.95

 CreateTagHelper

-0.95

 scammer

-0.94

 gunmen

-0.92

 thief

-0.91

 scammers

-0.91

 insurgents

-0.87

POSITIVE LOGITS

who

0.49

hores

0.48

sons

0.45

 clear

0.44

 engaged

0.44

 living

0.44

removeClass

0.43

 العام

0.43

aults

0.42

Activations Density 0.085%