INDEX
Explanations
punctuation marks, specifically commas
New Auto-Interp
Head Attr Weights
0:0.10
1:0.07
2:0.09
3:0.10
4:0.08
5:0.08
6:0.07
7:0.07
8:0.06
9:0.08
10:0.07
11:0.09
Negative Logits
erest
-2.71
�
-2.67
�
-2.55
裏�
-2.54
�
-2.52
Score
-2.50
stall
-2.44
osa
-2.42
emis
-2.41
selfie
-2.38
POSITIVE LOGITS
IEEE
3.00
Adin
2.88
Diesel
2.68
Gadget
2.61
Morse
2.60
FCC
2.56
NEC
2.55
Pagan
2.55
Kob
2.54
Hamm
2.50
Activations Density 0.000%