INDEX
Explanations
complex descriptive phrases related to perception and existential concepts
describing qualities or importance
German, Danish, and Dutch words
New Auto-Interp
Negative Logits
agerie
-0.73
harem
-0.68
sidekick
-0.66
ambigu
-0.66
smog
-0.65
acrob
-0.65
superpowers
-0.64
dandy
-0.64
backstory
-0.63
profan
-0.63
POSITIVE LOGITS
faßt
0.40
groote
0.34
ausges
0.34
skall
0.33
zijne
0.32
formule
0.32
UserScript
0.30
politie
0.29
setEmail
0.29
imprese
0.29
Activations Density 0.731%