INDEX
Explanations
actions and states related to organization and mental well-being
New Auto-Interp
Negative Logits
cheid
-0.16
isko
-0.16
prot
-0.15
ually
-0.15
POCH
-0.14
adt
-0.14
itself
-0.13
éd
-0.13
sj
-0.13
ndl
-0.13
POSITIVE LOGITS
enough
0.18
WHILE
0.17
while
0.16
Enough
0.16
/get
0.16
whilst
0.15
physical
0.15
(er
0.15
doing
0.14
ness
0.14
Activations Density 0.254%