INDEX
Explanations
phrases related to controversial or newsworthy events and topics
phrases related to controversies and serious incidents
New Auto-Interp
Negative Logits
++++++++++++++++
-0.66
++++++++
-0.60
âĿ
-0.59
:=
-0.57
-)
-0.55
Fold
-0.55
Compact
-0.55
compact
-0.54
')
-0.54
entary
-0.54
POSITIVE LOGITS
culminated
0.75
culminating
0.69
SPONSORED
0.69
triggering
0.67
prompted
0.66
purportedly
0.65
purported
0.63
resulted
0.63
ostensibly
0.63
itled
0.63
Activations Density 0.849%