INDEX
Explanations
references to social media engagement or interactions
New Auto-Interp
Negative Logits
uit
-0.63
uin
-0.62
Sug
-0.61
Fowler
-0.61
Citation
-0.61
ilit
-0.60
olit
-0.60
Saud
-0.60
Winged
-0.59
pur
-0.59
POSITIVE LOGITS
detectable
0.71
hiba
0.70
huge
0.69
emerge
0.69
THER
0.69
breeze
0.61
hattan
0.61
TAIN
0.60
ecd
0.60
hang
0.59
Activations Density 1.474%