INDEX
Explanations
instances of the word "decided."
New Auto-Interp
Negative Logits
licorne
-0.46
readlines
-0.46
Facility
-0.45
Stunning
-0.45
Facility
-0.43
"../../../../
-0.43
McIntosh
-0.43
facility
-0.42
hoa
-0.41
Mariners
-0.41
POSITIVE LOGITS
decided
0.93
decided
0.84
decide
0.73
decides
0.68
decide
0.66
décidé
0.66
Decide
0.65
Decided
0.63
решили
0.62
Decided
0.62
Activations Density 0.009%