INDEX
Explanations
specific particle and suffix combinations in words
New Auto-Interp
Head Attr Weights
0:0.02
1:0.03
2:0.05
3:0.05
4:0.04
5:0.04
6:0.44
7:0.03
8:0.05
9:0.06
10:0.08
11:0.05
Negative Logits
��
-1.41
Advertisement
-1.25
submission
-1.24
exclusive
-1.22
/-
-1.22
ollah
-1.19
••
-1.15
PRES
-1.14
lifting
-1.14
Tara
-1.13
POSITIVE LOGITS
emort
1.62
*/(
1.56
tarians
1.54
etooth
1.49
encia
1.49
hett
1.46
iani
1.41
hid
1.41
ciples
1.38
agall
1.34
Activations Density 0.008%