INDEX
Explanations
expressions related to uncertainty or inconclusiveness
New Auto-Interp
Head Attr Weights
0:0.02
1:0.03
2:0.02
3:0.05
4:0.02
5:0.05
6:0.34
7:0.03
8:0.02
9:0.03
10:0.10
11:0.23
Negative Logits
––
-4.31
●
-4.30
\\
-4.02
-4.01
\\\\
-3.81
||
-3.65
�
-3.60
-3.50
—-
-3.48
�
-3.47
POSITIVE LOGITS
Templ
2.88
ulkan
2.88
Anim
2.87
Kyoto
2.81
mares
2.65
McH
2.61
Yad
2.60
mum
2.59
Metatron
2.58
MLB
2.58
Activations Density 0.359%