INDEX
Explanations
actions involving movement from one place to another
New Auto-Interp
Negative Logits
esm
-0.74
ivia
-0.69
iation
-0.68
nesota
-0.67
understatement
-0.65
ãĥ¼ãĥĨãĤ£
-0.64
illary
-0.61
partName
-0.61
iosity
-0.60
osphere
-0.60
POSITIVE LOGITS
loading
1.07
hang
1.05
drive
1.00
tones
0.97
rule
0.96
priced
0.90
lord
0.89
sold
0.89
grown
0.86
hung
0.86
Activations Density 0.083%