INDEX
Explanations
punctuation marks, specifically commas
New Auto-Interp
Head Attr Weights
0:0.08
1:0.08
2:0.06
3:0.09
4:0.07
5:0.09
6:0.08
7:0.07
8:0.09
9:0.07
10:0.08
11:0.08
Negative Logits
CBS
-2.83
Sov
-2.75
�
-2.69
762
-2.59
dilig
-2.58
paran
-2.58
ouver
-2.53
kefeller
-2.51
coon
-2.49
visor
-2.48
POSITIVE LOGITS
Thomson
2.93
Gender
2.87
*,
2.79
Uganda
2.57
transsexual
2.54
Shiny
2.44
Unicode
2.42
gender
2.41
Alexa
2.41
MLA
2.40
Activations Density 0.000%