INDEX
Explanations
words related to political systems and policies
New Auto-Interp
Negative Logits
queue
-0.85
consulted
-0.72
ichen
-0.71
jan
-0.68
BILL
-0.67
ade
-0.67
quer
-0.67
UPDATE
-0.67
amazon
-0.66
Provided
-0.65
POSITIVE LOGITS
displeasure
1.09
individuality
1.08
seriousness
1.05
discontent
1.04
masculinity
1.04
greatness
1.03
humanity
1.03
sentiments
0.98
dissatisfaction
0.98
sincerity
0.97
Activations Density 2.774%