INDEX
Explanations
terms related to opposition and resistance
New Auto-Interp
Negative Logits
VIC
-0.16
tron
-0.15
atoria
-0.15
itra
-0.14
illo
-0.14
ATOR
-0.14
appa
-0.14
hurst
-0.13
sco
-0.13
itchens
-0.13
POSITIVE LOGITS
against
0.18
à¸Ĺาà¸Ļ
0.18
Against
0.17
ìĤ¬íķŃ
0.15
ors
0.15
enta
0.15
sing
0.15
ably
0.15
alent
0.15
ναν
0.14
Activations Density 0.116%