INDEX
Explanations
phrases related to political controversy and accusations
New Auto-Interp
Negative Logits
Diseases
-0.16
diseases
-0.15
Impl
-0.15
sez
-0.14
wars
-0.14
Fires
-0.14
ammable
-0.13
Bom
-0.13
launcher
-0.13
วà¸ģ
-0.13
POSITIVE LOGITS
incident
0.60
event
0.50
episode
0.47
incident
0.44
Incident
0.39
encounter
0.38
äºĭä»¶
0.37
events
0.36
experience
0.35
episode
0.35
Activations Density 0.280%