INDEX
Explanations
phrases related to investigations or inquiries into various incidents or events
repeated mentions of the term "investigating" in various contexts
New Auto-Interp
Negative Logits
ãĤª
-0.72
âĹ¼
-0.70
aez
-0.67
hold
-0.66
ña
-0.66
bows
-0.66
eries
-0.64
ners
-0.63
icio
-0.62
-|
-0.62
POSITIVE LOGITS
whether
0.93
allegations
0.93
alleged
0.77
probing
0.75
irregularities
0.75
incidents
0.74
UFOs
0.74
wrongdoing
0.73
independently
0.72
misconduct
0.71
Activations Density 0.048%