INDEX
Explanations
mentions of social media followers
references to social media followers
New Auto-Interp
Negative Logits
Genocide
-0.73
Rim
-0.72
ces
-0.72
ced
-0.70
Prosecutor
-0.67
RAG
-0.66
circumstance
-0.66
Kob
-0.65
Scotia
-0.63
Stars
-0.63
POSITIVE LOGITS
hip
1.28
followers
1.04
hips
0.97
follower
0.85
wagon
0.83
lia
0.81
adelphia
0.76
wagon
0.76
ievers
0.76
azel
0.75
Activations Density 0.012%