INDEX
Explanations
social media account usernames
references to social media and communication platforms
New Auto-Interp
Negative Logits
Levant
-0.68
carbohyd
-0.65
beginners
-0.64
lication
-0.63
testers
-0.62
livest
-0.62
etheless
-0.61
lengths
-0.60
testament
-0.60
distances
-0.59
POSITIVE LOGITS
bh
0.89
0.85
Uk
0.79
HQ
0.77
zx
0.77
uez
0.77
jj
0.76
xus
0.75
qv
0.74
OY
0.73
Activations Density 0.062%