INDEX
Explanations
sentence endings and punctuation marks
New Auto-Interp
Head Attr Weights
0:0.27
1:0.10
2:0.06
3:0.04
4:0.03
5:0.02
6:0.10
7:0.08
8:0.03
9:0.04
10:0.12
11:0.05
Negative Logits
Bell
-3.54
Sailor
-3.45
romeda
-3.41
Telecommunications
-3.32
Leviathan
-3.25
Oz
-3.11
Nept
-3.05
iodine
-3.05
ot
-3.04
Eps
-2.99
POSITIVE LOGITS
Frazier
10.28
Fraz
4.42
Maul
3.98
Frie
3.77
Fritz
3.72
Kaufman
3.68
Fur
3.67
Zucker
3.59
Fischer
3.50
Fowler
3.49
Activations Density 0.001%