INDEX
Explanations
punctuation marks and symbols
New Auto-Interp
Head Attr Weights
0:0.06
1:0.16
2:0.05
3:0.08
4:0.07
5:0.14
6:0.05
7:0.13
8:0.06
9:0.05
10:0.05
11:0.06
Negative Logits
messenger
-2.61
Lynn
-2.60
�
-2.47
Sean
-2.36
Mrs
-2.33
Natalie
-2.32
Nurse
-2.27
silenced
-2.27
polar
-2.23
Ms
-2.20
POSITIVE LOGITS
icc
2.88
ACA
2.73
avascript
2.71
artz
2.69
isode
2.64
acc
2.63
eca
2.57
ternity
2.56
acteria
2.54
odic
2.54
Activations Density 0.000%