INDEX
Explanations
discussions and mentions of various social and political issues
New Auto-Interp
Negative Logits
itſelf
-0.71
Rüyada
-0.70
fubject
-0.67
sclero
-0.67
Anſ
-0.66
glasses
-0.66
doubtnut
-0.65
Mores
-0.65
Gerente
-0.64
occafion
-0.63
POSITIVE LOGITS
issues
1.08
topics
1.04
Topics
0.93
Issues
0.91
themes
0.82
issues
0.81
Topics
0.80
questions
0.79
Issues
0.79
topics
0.71
Activations Density 0.114%