INDEX
Explanations
words related to emotional or psychological concepts and dynamics
New Auto-Interp
Negative Logits
vig
-0.18
phinx
-0.17
liga
-0.16
rig
-0.15
unsch
-0.15
Vig
-0.15
Doc
-0.14
vig
-0.14
ignet
-0.14
/stats
-0.14
POSITIVE LOGITS
Bail
0.17
æĩ
0.15
isser
0.14
pParent
0.14
ύ
0.14
λÏĮγ
0.14
/cop
0.13
nut
0.13
onder
0.13
ÑĢаÑħ
0.13
Activations Density 0.004%