INDEX
Explanations
attends to numeric tokens from tokens that are relevant to mathematical operations and comparisons
New Auto-Interp
Head Attr Weights
0:0.04
1:0.08
2:0.04
3:0.04
4:0.21
5:0.48
6:0.03
7:0.04
Negative Logits
AddTagHelper
-0.32
ProtoMessage
-0.31
bootstrapcdn
-0.27
encodeWith
-0.26
AspNetCore
-0.25
NavController
-0.24
EndGlobalSection
-0.23
afficheront
-0.23
felder
-0.22
richTextPanel
-0.22
POSITIVE LOGITS
pleaſure
0.36
myſelf
0.33
Diſ
0.32
reaſon
0.31
Reſ
0.31
houſe
0.31
ſche
0.31
Efq
0.30
poffible
0.30
raiſ
0.30
Activations Density 0.799%