INDEX
Explanations
occurrences of social media posts
instances of the word "posted."
New Auto-Interp
Negative Logits
inho
-0.73
idge
-0.71
isma
-0.71
icably
-0.68
ppo
-0.66
hart
-0.65
Galile
-0.64
phant
-0.64
pter
-0.64
Gand
-0.63
POSITIVE LOGITS
posting
0.91
posted
0.91
uploads
0.90
postings
0.86
gres
0.83
posts
0.82
ulate
0.81
doctoral
0.81
hum
0.78
posted
0.77
Activations Density 0.025%