INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.09
2:0.08
3:0.09
4:0.07
5:0.07
6:0.08
7:0.08
8:0.07
9:0.07
10:0.09
11:0.08
Negative Logits
Pastebin
-2.10
nels
-1.78
cake
-1.69
Daddy
-1.65
enstein
-1.63
OVA
-1.60
cleaners
-1.60
lux
-1.60
Corona
-1.59
Wonderland
-1.56
POSITIVE LOGITS
osponsors
1.95
ospons
1.92
ensional
1.72
pse
1.67
NF
1.63
coales
1.58
lamb
1.57
startled
1.54
trou
1.48
kins
1.47
Activations Density 0.000%
No Known Activations
This feature has no known activations.