INDEX
Explanations
phrases related to organized endeavors or attempts to achieve specific outcomes
New Auto-Interp
Negative Logits
aison
-0.17
enus
-0.16
irket
-0.16
antha
-0.16
pire
-0.16
arend
-0.15
heit
-0.14
akt
-0.14
kad
-0.14
URA
-0.14
POSITIVE LOGITS
lessly
0.25
lessness
0.23
effort
0.23
efforts
0.20
Eff
0.19
orts
0.18
ful
0.17
uated
0.16
towards
0.16
orst
0.15
Activations Density 0.020%