INDEX

Explanations

the word "unusual."

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Mille

-0.72

elsa

-0.63

ela

-0.61

 Kirke

-0.61

MemoryWarning

-0.61

?>">

-0.60

 amazed

-0.59

Free

-0.58

PLES

-0.58

uride

-0.58

POSITIVE LOGITS



1.02

TagMode

0.99

unusual

0.93

 Unusual

0.89

 inusual

0.84

 unusual

0.82

 ungewöhn

0.82

Unusual

0.74

endpush

0.74

 uncommon

0.72

Activations Density 0.014%