INDEX
Explanations
words related to failure or unsuccessful outcomes
occurrences of the word "fail" in various contexts
New Auto-Interp
Negative Logits
ourt
-0.73
iliary
-0.69
dar
-0.69
utra
-0.68
Austral
-0.65
arya
-0.64
Lug
-0.63
rete
-0.62
onen
-0.62
ript
-0.61
POSITIVE LOGITS
miser
1.24
ingly
0.93
catast
0.88
horribly
0.86
lect
0.86
dism
0.84
afe
0.81
fail
0.79
hard
0.78
DEV
0.72
Activations Density 0.019%