INDEX
Explanations
punctuation marks and formatting indicators
New Auto-Interp
Head Attr Weights
0:0.06
1:0.01
2:0.05
3:0.03
4:0.05
5:0.03
6:0.20
7:0.04
8:0.04
9:0.38
10:0.01
11:0.03
Negative Logits
Messenger
-3.66
pumpkin
-3.54
gh
-3.46
Pumpkin
-3.32
zbollah
-3.27
�
-3.21
Burlington
-3.16
575
-3.14
Shannon
-3.12
OnePlus
-3.11
POSITIVE LOGITS
Ed
8.57
Ed
8.25
ED
6.79
ed
6.77
Edwards
6.38
Edmund
5.46
ED
5.28
Edward
5.21
Edwin
5.20
Edward
5.07
Activations Density 0.017%