INDEX
Explanations
names of people or entities
words related to the concept of "democracy" and its derivatives
New Auto-Interp
Negative Logits
multif
-0.71
ModLoader
-0.63
horizont
-0.63
ledged
-0.61
iots
-0.61
pear
-0.59
hint
-0.58
congr
-0.58
amen
-0.58
spoilers
-0.58
POSITIVE LOGITS
ufact
1.06
stration
0.70
agement
0.69
urate
0.68
itions
0.67
ques
0.67
oppers
0.67
rius
0.65
oun
0.65
icable
0.64
Activations Density 0.099%