INDEX
Explanations
mentions of specific entities or individuals
references to political figures and entities involved in controversies or scandals
New Auto-Interp
Negative Logits
asha
-0.82
taboola
-0.74
gra
-0.71
aten
-0.69
uffle
-0.68
lander
-0.68
ilde
-0.65
Cosponsors
-0.64
iba
-0.63
atre
-0.62
POSITIVE LOGITS
permission
0.93
pause
0.78
ample
0.73
thumbs
0.71
plenty
0.70
access
0.69
ample
0.68
valuable
0.67
nightmares
0.64
congratulations
0.64
Activations Density 0.326%