INDEX
Explanations
attends to various tokens that are structurally significant or denote sections within a technical or mathematical context from tokens with no clear reference
New Auto-Interp
Head Attr Weights
0:0.53
1:0.17
2:0.06
3:0.04
4:0.03
5:0.04
6:0.03
7:0.06
Negative Logits
ſp
-0.56
itſelf
-0.53
متعلقه
-0.53
myſelf
-0.53
raiſ
-0.52
deſt
-0.51
ſte
-0.50
ſche
-0.49
ſta
-0.48
poffible
-0.47
POSITIVE LOGITS
asteroide
0.32
@"/
0.31
-
0.30
AndEndTag
0.30
–
0.28
DispatchToProps
0.28
ContentAsync
0.28
,
0.28
typeparam
0.28
serve
0.27
Activations Density 2.104%