INDEX
Explanations
phrases related to acquiring benefits or advantages
New Auto-Interp
Negative Logits
pory
-0.61
Roberts
-0.57
chede
-0.57
sted
-0.56
Morton
-0.55
upsi
-0.55
precau
-0.55
ohs
-0.55
dı
-0.53
motic
-0.53
POSITIVE LOGITS
gain
3.27
Gain
3.15
gain
3.01
GAIN
2.94
gains
2.93
Gain
2.93
gained
2.72
Gains
2.67
gains
2.51
gaining
2.48
Activations Density 0.043%