INDEX
Explanations
references to media consumption and entertainment contexts
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.14
3:0.09
4:0.12
5:0.03
6:0.04
7:0.09
8:0.08
9:0.03
10:0.16
11:0.14
Negative Logits
覚醒
-1.28
atile
-1.25
versatile
-1.25
ailable
-1.24
spac
-1.19
Flavor
-1.19
ORY
-1.19
trait
-1.19
fits
-1.19
resilient
-1.17
POSITIVE LOGITS
gnu
1.43
umbledore
1.31
scan
1.26
millenn
1.25
paused
1.22
dos
1.21
ovies
1.21
aunder
1.21
scrape
1.19
okemon
1.19
Activations Density 0.061%