INDEX
Explanations
instances of the word "fail" and its derivatives in various contexts
New Auto-Interp
Negative Logits
andise
-0.70
iliary
-0.69
orthy
-0.67
ript
-0.65
rete
-0.62
iewicz
-0.62
sovere
-0.61
Ec
-0.60
population
-0.60
crates
-0.59
POSITIVE LOGITS
miser
1.47
catast
1.05
horribly
1.02
afe
0.98
dism
0.88
lect
0.84
spectacular
0.82
DEV
0.79
muster
0.78
ingly
0.75
Activations Density 0.021%