INDEX
Explanations
the word "substitute"
occurrences of the word "substitute" and its variations
New Auto-Interp
Negative Logits
stra
-0.76
worthiness
-0.74
erer
-0.74
erers
-0.74
bra
-0.72
trust
-0.72
Bird
-0.70
raq
-0.70
Decay
-0.65
bear
-0.65
POSITIVE LOGITS
itute
1.09
utions
1.02
substit
0.88
substitutes
0.86
substitute
0.85
itutes
0.79
uting
0.79
Subst
0.78
substituted
0.76
aneous
0.75
Activations Density 0.020%