INDEX
Explanations
terms related to physical places or locations
words and concepts related to crime and danger
New Auto-Interp
Negative Logits
emale
-0.65
Niet
-0.61
Frie
-0.60
farious
-0.58
helicop
-0.55
Azerb
-0.55
Nare
-0.55
ADRA
-0.55
lyak
-0.54
pheus
-0.54
POSITIVE LOGITS
TEXT
0.59
lement
0.59
]
0.58
actionDate
0.57
âĢº
0.54
hoax
0.53
][
0.52
BUS
0.51
::
0.50
Vulkan
0.50
Activations Density 0.329%