INDEX
Explanations
punctuation marks and specific function words
New Auto-Interp
Head Attr Weights
0:0.06
1:0.04
2:0.06
3:0.04
4:0.06
5:0.05
6:0.19
7:0.05
8:0.08
9:0.25
10:0.03
11:0.04
Negative Logits
Hawai
-4.26
packing
-3.95
aho
-3.75
�
-3.58
Hawaiian
-3.56
pack
-3.48
Pilgrim
-3.38
hun
-3.36
uo
-3.33
packs
-3.32
POSITIVE LOGITS
Der
9.63
Der
9.24
Derby
8.40
der
6.40
der
5.85
DER
5.08
Dahl
5.01
Dixon
4.78
Die
4.73
Die
4.65
Activations Density 0.003%