INDEX
Explanations
the word "Boot"
repeated occurrences of the word "foot."
New Auto-Interp
Negative Logits
lapse
-0.73
exha
-0.70
MIT
-0.64
ancest
-0.63
agon
-0.62
misunder
-0.62
riber
-0.61
agara
-0.61
ĻĤ
-0.60
galvan
-0.59
POSITIVE LOGITS
strap
1.14
hing
1.14
hed
1.09
oot
1.04
sie
1.03
ishly
0.99
iful
0.95
eers
0.94
stra
0.93
erness
0.91
Activations Density 0.023%