INDEX
Explanations
attends to numeric tokens from themselves and adjacent tokens
New Auto-Interp
Head Attr Weights
0:0.07
1:0.21
2:0.09
3:0.04
4:0.15
5:0.25
6:0.07
7:0.08
Negative Logits
للمعارف
-0.40
للاسماء
-0.38
дописавши
-0.37
linkovi
-0.33
ValueGeneration
-0.32
FontOfSize
-0.31
nakalista
-0.31
ConstraintMaker
-0.31
joaat
-0.28
Italijani
-0.28
POSITIVE LOGITS
unately
0.30
XmlAccessType
0.29
outheast
0.27
skrä
0.26
edly
0.26
eterangan
0.26
cioc
0.26
martre
0.26
anzu
0.25
OMET
0.25
Activations Density 0.104%