INDEX
Explanations
terms related to political contexts and discussions
New Auto-Interp
Negative Logits
illet
-0.19
ayo
-0.15
ing
-0.15
uld
-0.14
iez
-0.14
lap
-0.13
aves
-0.13
res
-0.13
mand
-0.13
igua
-0.13
POSITIVE LOGITS
amarin
0.17
Tome
0.17
idders
0.15
Pascal
0.15
Č
0.14
urrency
0.14
loub
0.14
ety
0.14
sterdam
0.14
inker
0.14
Activations Density 0.247%