INDEX
Explanations
names of political figures
prominent political figures and their statements
New Auto-Interp
Negative Logits
ILCS
-0.75
VID
-0.73
Editors
-0.69
LTD
-0.67
geries
-0.67
iries
-0.66
Torrent
-0.66
itaire
-0.66
Architects
-0.66
ources
-0.65
POSITIVE LOGITS
reiterated
1.26
dodged
1.24
joked
1.16
contrasted
1.13
piv
1.13
elaborated
1.13
interrupted
1.11
asserted
1.10
alluded
1.10
mocked
1.10
Activations Density 0.229%