INDEX
Explanations
phrases related to drawing attention, particularly in the context of politics and activism
New Auto-Interp
Negative Logits
nown
-0.77
ican
-0.75
keep
-0.71
bom
-0.65
unker
-0.63
curing
-0.61
terday
-0.59
Replay
-0.59
onte
-0.59
icum
-0.58
POSITIVE LOGITS
parallels
1.01
strings
0.90
drawn
0.89
attention
0.88
inspiration
0.87
conclusions
0.86
cards
0.85
Cosponsors
0.85
card
0.83
resemb
0.78
Activations Density 0.457%