INDEX
Explanations
references to the term "alt-right" and its variations
New Auto-Interp
Negative Logits
commencement
-0.76
forward
-0.68
balcon
-0.66
dominate
-0.65
shattering
-0.63
overriding
-0.63
bulldo
-0.62
ascertain
-0.61
ambul
-0.61
altering
-0.61
POSITIVE LOGITS
adena
0.87
Offline
0.75
�
0.71
ˈ
0.70
ogether
0.69
igl
0.69
uild
0.64
ヴ
0.63
Sent
0.62
Poké
0.61
Activations Density 0.013%