INDEX
Explanations
phrases indicating desire or preference
expressions of desire or personal wants
New Auto-Interp
Negative Logits
Archer
-0.60
âĹı
-0.54
++++++++++++++++
-0.54
FANTASY
-0.53
Indust
-0.53
âĨ
-0.53
Transparency
-0.53
Geh
-0.51
Orn
-0.48
Hard
-0.48
POSITIVE LOGITS
illet
0.59
ocument
0.59
strap
0.58
destined
0.58
liest
0.56
accompl
0.52
deem
0.52
muster
0.51
wrought
0.51
lawfully
0.50
Activations Density 1.431%