INDEX
Explanations
references to online identities or specific usernames
New Auto-Interp
Head Attr Weights
0:0.17
1:0.14
2:0.07
3:0.09
4:0.03
5:0.11
6:0.03
7:0.03
8:0.11
9:0.08
10:0.05
11:0.05
Negative Logits
bay
-1.90
payers
-1.76
iverse
-1.62
kiss
-1.61
bike
-1.61
lif
-1.58
file
-1.58
pour
-1.58
775
-1.57
clone
-1.52
POSITIVE LOGITS
rocal
1.90
ENE
1.85
Contra
1.73
Miscellaneous
1.73
otine
1.70
Sap
1.70
Span
1.66
Wasserman
1.59
Introduction
1.58
ggle
1.58
Activations Density 0.002%