INDEX
Explanations
mentions of legal issues or violations
New Auto-Interp
Head Attr Weights
0:0.27
1:0.02
2:0.02
3:0.06
4:0.08
5:0.06
6:0.04
7:0.02
8:0.30
9:0.04
10:0.02
11:0.03
Negative Logits
plank
-1.77
pept
-1.75
awa
-1.65
pills
-1.64
chenko
-1.64
reins
-1.59
adjust
-1.54
compliant
-1.50
capsules
-1.49
wra
-1.48
POSITIVE LOGITS
rien
1.97
arth
1.86
uel
1.72
aturday
1.72
downtime
1.69
Tours
1.68
Vend
1.63
vironments
1.61
BLIC
1.60
Contact
1.58
Activations Density 0.000%