INDEX
Explanations
social media profiles to follow
commands to follow individuals on social media
New Auto-Interp
Negative Logits
minist
-0.75
wcs
-0.68
,,,,
-0.68
fortunately
-0.65
pite
-0.63
paces
-0.61
riott
-0.60
coron
-0.59
notor
-0.58
pmwiki
-0.58
POSITIVE LOGITS
@
0.95
Stories
0.92
Jerome
0.87
Hass
0.84
Jonah
0.83
HuffPost
0.83
Dat
0.82
Tess
0.81
Chuck
0.80
ers
0.78
Activations Density 0.019%