INDEX
Explanations
phrases related to creating or causing something
New Auto-Interp
Negative Logits
thia
-0.67
phrine
-0.64
CLOSE
-0.63
assis
-0.60
Mania
-0.60
Niet
-0.59
Fram
-0.58
heter
-0.56
ban
-0.55
pac
-0.55
POSITIVE LOGITS
sure
1.20
hift
1.04
sense
0.89
landfall
0.89
strides
0.86
excuses
0.84
URE
0.82
adjustments
0.82
Sense
0.81
headlines
0.79
Activations Density 2.135%