INDEX
Explanations
references to social media posts and their interactions
New Auto-Interp
Negative Logits
zelf
-0.17
rat
-0.16
usa
-0.15
vÃŃ
-0.15
apa
-0.15
illi
-0.15
bs
-0.14
rats
-0.14
olds
-0.14
rá
-0.14
POSITIVE LOGITS
ulate
0.33
erior
0.31
graduate
0.30
natal
0.30
greSQL
0.28
ulates
0.28
ulated
0.28
uring
0.26
gresql
0.25
facto
0.24
Activations Density 0.060%