INDEX
Explanations
social media posts or updates
references to photos and posts on social media
New Auto-Interp
Negative Logits
anmar
-0.84
Rounds
-0.77
senal
-0.73
osponsors
-0.71
metic
-0.67
Centauri
-0.66
============
-0.65
fect
-0.63
enthal
-0.63
itone
-0.63
POSITIVE LOGITS
Ø
0.66
username
0.65
DragonMagazine
0.63
(@
0.62
76561
0.61
atl
0.61
cov
0.61
_
0.59
indu
0.59
ja
0.58
Activations Density 0.051%