INDEX
Explanations
punctuation and formatting elements in the text
New Auto-Interp
Head Attr Weights
0:0.20
1:0.18
2:0.06
3:0.04
4:0.05
5:0.09
6:0.04
7:0.04
8:0.06
9:0.06
10:0.05
11:0.07
Negative Logits
seq
-1.97
Colo
-1.76
doubles
-1.74
Bravo
-1.69
ace
-1.60
Chal
-1.58
Burr
-1.54
cake
-1.52
chio
-1.52
Rica
-1.46
POSITIVE LOGITS
none
1.89
minist
1.88
otent
1.86
Library
1.81
Unable
1.80
University
1.77
DragonMagazine
1.75
Redditor
1.73
Government
1.72
aton
1.72
Activations Density 0.000%