INDEX
Explanations
phrases emphasizing the importance of effort and opportunity
New Auto-Interp
Negative Logits
obe
-0.15
pez
-0.14
lify
-0.14
DCF
-0.14
axed
-0.14
lahoma
-0.14
alore
-0.14
-cn
-0.14
ance
-0.14
污
-0.14
POSITIVE LOGITS
compens
0.20
compensated
0.18
compensate
0.17
orte
0.16
oran
0.16
ecc
0.15
owski
0.15
ût
0.15
çĶŁãģį
0.14
orta
0.14
Activations Density 0.308%