INDEX
Explanations
attends to significant nouns or concepts from corresponding identifiers or descriptors later in the sequence
New Auto-Interp
Head Attr Weights
0:0.13
1:0.15
2:0.12
3:0.11
4:0.15
5:0.04
6:0.12
7:0.14
Negative Logits
становника
-0.44
AssemblyTitle
-0.43
InjectAttribute
-0.38
헌
-0.35
istoitu
-0.35
böz
-0.35
кономі
-0.35
vailability
-0.34
loroethene
-0.34
hound
-0.34
POSITIVE LOGITS
AndEndTag
0.44
'\\;'
0.41
normais
0.35
elettrica
0.34
ագրություններ
0.34
giuri
0.33
viață
0.32
universitarios
0.32
actuelles
0.32
cantit
0.32
Activations Density 23.329%