INDEX
Explanations
information related to posts made in online forums or communities
instances of the word "Posts" and track post counts
New Auto-Interp
Negative Logits
Bhar
-0.71
mere
-0.64
neglig
-0.62
rette
-0.62
flies
-0.61
energies
-0.61
orman
-0.60
rium
-0.60
consultants
-0.59
Aid
-0.59
POSITIVE LOGITS
Joined
1.05
Posts
0.95
reply
0.86
Last
0.82
Reply
0.79
Edited
0.74
ategory
0.74
Why
0.72
Offense
0.71
Thread
0.70
Activations Density 0.017%