INDEX
Explanations
attends to numeric tokens from the dialogue's context or narrative elements
New Auto-Interp
Head Attr Weights
0:0.10
1:0.08
2:0.06
3:0.06
4:0.08
5:0.41
6:0.06
7:0.11
Negative Logits
UnusedPrivate
-0.36
elemField
-0.36
<=",
-0.36
CreateTagHelper
-0.35
contentLoaded
-0.32
برانيه
-0.31
Roskov
-0.31
Попис
-0.30
مشين
-0.29
EconPapers
-0.29
POSITIVE LOGITS
tornillo
0.22
act
0.22
loose
0.21
lo
0.20
nahm
0.20
mybatis
0.20
]))
0.19
LO
0.19
"",
0.18
виться
0.18
Activations Density 0.066%