INDEX
Explanations
topics related to journalism and press freedom
New Auto-Interp
Negative Logits
homicides
-0.15
uhn
-0.14
homic
-0.14
Murder
-0.14
911
-0.14
akistan
-0.14
Booth
-0.14
_fatal
-0.13
homicide
-0.13
bjerg
-0.13
POSITIVE LOGITS
Arbitrary
0.25
arbitrary
0.23
arbitrarily
0.19
langu
0.19
çĭ
0.16
çį
0.15
Fir
0.15
disappeared
0.15
XHR
0.15
Gest
0.14
Activations Density 0.063%