INDEX
Explanations
verbs indicating taking action or making changes
the word "make" in various contexts
New Auto-Interp
Negative Logits
76561
-0.73
scrimmage
-0.63
---------
-0.62
Vest
-0.61
Leban
-0.61
Ging
-0.60
nurs
-0.59
Mub
-0.58
bour
-0.58
Wick
-0.58
POSITIVE LOGITS
sure
1.04
hift
0.94
ailable
0.84
emort
0.82
awaru
0.82
urate
0.78
iversal
0.77
itives
0.76
amera
0.74
itions
0.74
Activations Density 0.130%