INDEX
Explanations
mentions of political figures and their statements or actions
New Auto-Interp
Negative Logits
SequentialGroup
-0.56
UnsafeEnabled
-0.55
$_['
-0.51
ModelRenderer
-0.51
enumii
-0.51
UrlResolution
-0.50
ThroughAttribute
-0.50
ViewImports
-0.49
nisso
-0.46
enumi
-0.44
POSITIVE LOGITS
hinted
0.86
hinting
0.75
reiterated
0.75
insinu
0.71
implicitly
0.70
implied
0.69
alluded
0.68
hints
0.67
vowed
0.66
reiter
0.65
Activations Density 0.514%