INDEX
Explanations
instances of uncertainty or disbelief
New Auto-Interp
Negative Logits
IMG
-0.16
ìĪ
-0.15
neau
-0.14
bol
-0.14
ubl
-0.14
rights
-0.14
ê
-0.14
ibre
-0.14
bond
-0.14
Oakland
-0.14
POSITIVE LOGITS
Filed
0.34
Categories
0.26
Categories
0.20
Filed
0.20
Source
0.19
lash
0.18
LOOK
0.17
Category
0.16
άλ
0.16
943
0.16
Activations Density 0.003%