INDEX
Explanations
phrases expressing effort or striving to achieve the best outcome
New Auto-Interp
Negative Logits
lassen
-0.15
achts
-0.15
.tt
-0.14
hiba
-0.14
átek
-0.14
obra
-0.14
leneck
-0.14
-<?
-0.14
/of
-0.14
celik
-0.13
POSITIVE LOGITS
effort
0.20
efforts
0.18
Eff
0.16
اÙĨÙĩ
0.16
to
0.15
962
0.14
enty
0.14
Double
0.13
PIO
0.13
-eff
0.13
Activations Density 0.098%