INDEX
Explanations
words related to political statements and opinions
New Auto-Interp
Negative Logits
millenn
-0.74
respective
-0.69
illions
-0.67
operation
-0.67
OTAL
-0.67
ilater
-0.66
frequency
-0.65
Versions
-0.63
luster
-0.63
articles
-0.62
POSITIVE LOGITS
Galile
0.75
LLP
0.74
pandemonium
0.72
Torment
0.72
pload
0.70
Spac
0.68
Phant
0.68
yi
0.67
Neighborhood
0.67
Rebellion
0.66
Activations Density 4.399%