INDEX
Explanations
actions related to improvement or change
references to progress and improvement
New Auto-Interp
Negative Logits
don
-0.36
mun
-0.34
ãĥ³ãĤ¸
-0.34
Cunning
-0.33
isl
-0.33
ãĥ¼ãĥĨãĤ£
-0.32
©¶æ
-0.32
equ
-0.31
sole
-0.31
elve
-0.30
POSITIVE LOGITS
iatus
0.49
natureconservancy
0.48
NetMessage
0.46
APD
0.44
atform
0.42
terness
0.41
ĵĺ
0.39
Enlarge
0.39
earable
0.39
arcer
0.39
Activations Density 3.816%