INDEX
Explanations
mentions and interactions between users in a discussion or comment thread
New Auto-Interp
Negative Logits
stället
-0.60
hemsida
-0.57
récomp
-0.56
ValueGenerated
-0.56
portátiles
-0.56
“.
-0.54
tovább
-0.54
niyang
-0.54
materna
-0.53
Điều
-0.53
POSITIVE LOGITS
Anonymous
0.89
anonymous
0.80
Anonymous
0.75
Anon
0.72
j
0.71
anonymous
0.67
mr
0.65
user
0.65
Anon
0.65
Mr
0.64
Activations Density 0.313%