INDEX
Explanations
phrases indicating fear for someone's safety or life
concerns related to fear for personal safety or life
New Auto-Interp
Negative Logits
AMA
-0.57
Americans
-0.55
humans
-0.54
tourists
-0.52
gays
-0.52
americ
-0.50
African
-0.49
American
-0.49
consumers
-0.49
locals
-0.48
POSITIVE LOGITS
accomp
0.68
Citiz
0.68
conclud
0.64
idable
0.60
occas
0.58
atus
0.57
Move
0.57
ende
0.57
ä¹ĭ
0.56
æĪ¦
0.55
Activations Density 0.668%