INDEX
Explanations
statements and claims made by organizations and authorities
New Auto-Interp
Negative Logits
ContentType
-0.17
uis
-0.16
hora
-0.16
ubb
-0.15
Carb
-0.15
986
-0.15
McA
-0.14
ave
-0.14
и
-0.14
eral
-0.14
POSITIVE LOGITS
ycastle
0.17
ycop
0.15
ç¿Ķ
0.15
laden
0.15
adam
0.15
GOODS
0.14
veau
0.14
tô
0.14
acles
0.14
056
0.14
Activations Density 0.048%