INDEX
Explanations
instances of users posting or commenting in a forum context
New Auto-Interp
Negative Logits
uchs
-0.16
okus
-0.15
issen
-0.15
azzi
-0.15
pell
-0.15
pta
-0.14
rear
-0.14
anan
-0.14
εί
-0.14
edian
-0.14
POSITIVE LOGITS
Joined
0.19
wrote
0.19
Contact
0.19
Re
0.19
contact
0.18
Contact
0.17
å¸ĸ
0.16
wards
0.16
Users
0.16
Return
0.15
Activations Density 0.006%