INDEX
Explanations
mentions of user profiles and online interactions
New Auto-Interp
Negative Logits
estranged
-0.84
photograp
-0.79
staggered
-0.71
fleeing
-0.71
pitching
-0.70
purs
-0.69
orchestr
-0.69
libraries
-0.69
predec
-0.69
escaping
-0.69
POSITIVE LOGITS
Posted
1.36
Reply
1.35
________________
1.34
User
1.28
Guest
1.23
Anonymous
1.20
anon
1.16
Edited
1.16
Anyway
1.14
Comment
1.10
Activations Density 0.189%