INDEX
Explanations
instances of failure or disappointment
instances of the word "failed."
New Auto-Interp
Negative Logits
erville
-0.65
iewicz
-0.64
optional
-0.63
squared
-0.61
breeze
-0.61
crates
-0.59
dar
-0.59
seed
-0.58
iser
-0.57
andise
-0.57
POSITIVE LOGITS
miser
1.70
catast
1.04
afe
1.03
dism
0.99
horribly
0.97
lect
0.95
fully
0.95
spectacular
0.90
ingly
0.89
muster
0.82
Activations Density 0.038%