INDEX
Explanations
verbs and actions that indicate interpersonal interactions and relationships
New Auto-Interp
Negative Logits
fait
-0.17
eca
-0.17
ITA
-0.16
compression
-0.16
Compression
-0.15
agan
-0.15
olik
-0.14
phin
-0.14
Compression
-0.14
reich
-0.14
POSITIVE LOGITS
strup
0.17
Registers
0.16
ourcem
0.16
tal
0.15
Schmidt
0.14
anium
0.14
mtree
0.14
measure
0.14
Supply
0.13
mission
0.13
Activations Density 0.354%