INDEX
Explanations
punctuation marks, specifically colons
New Auto-Interp
Head Attr Weights
0:0.07
1:0.06
2:0.10
3:0.08
4:0.09
5:0.06
6:0.08
7:0.09
8:0.08
9:0.06
10:0.09
11:0.08
Negative Logits
seekers
-1.90
thumbnails
-1.70
ENSE
-1.68
"]=>
-1.64
tarians
-1.64
eers
-1.60
dream
-1.58
verse
-1.58
chio
-1.53
giveaways
-1.53
POSITIVE LOGITS
mart
1.74
abase
1.72
tops
1.65
Ples
1.64
downed
1.64
leck
1.61
hower
1.61
Brist
1.59
untled
1.59
pear
1.56
Activations Density 0.000%