INDEX
Explanations
certainty and assurance in statements
New Auto-Interp
Negative Logits
æŁĦ
-0.16
igest
-0.15
654
-0.15
åĬ¹
-0.15
ullen
-0.14
ichte
-0.14
ÏģÏį
-0.14
pis
-0.14
اÙĦا
-0.13
als
-0.13
POSITIVE LOGITS
sure
0.56
certainty
0.54
sure
0.46
Sure
0.46
Sure
0.39
certain
0.39
confidence
0.38
cert
0.36
confident
0.35
SURE
0.35
Activations Density 0.154%