INDEX
Explanations
numerical information related to user profiles or joined dates
numbers and statistics related to user activity or posts
New Auto-Interp
Negative Logits
ÂŃ
-1.13
âĢij
-1.10
SPONSORED
-0.93
âĢIJ
-0.89
—-
-0.83
Contents
-0.81
Enlarge
-0.79
"—
-0.79
â̲
-0.78
advertisement
-0.77
POSITIVE LOGITS
Quote
1.14
didnt
1.11
doesnt
1.09
dont
1.05
reply
1.01
OP
0.96
Seems
0.96
Quote
0.94
Reply
0.94
alot
0.93
Activations Density 0.286%