INDEX
Explanations
mentions of the word "ide"
words related to "ideology" or "ideological concepts."
New Auto-Interp
Negative Logits
cffff
-0.71
thening
-0.69
iona
-0.68
%]
-0.68
enegger
-0.68
=~=~
-0.67
ichick
-0.65
ourced
-0.62
sembly
-0.62
#$#$
-0.61
POSITIVE LOGITS
ide
0.91
ously
0.83
lli
0.80
verty
0.77
aux
0.76
aten
0.74
llo
0.71
vice
0.71
IDE
0.70
Sov
0.70
Activations Density 0.012%