INDEX
Explanations
terms and phrases related to seasonal themes and community events
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.15
3:0.04
4:0.27
5:0.02
6:0.17
7:0.14
8:0.02
9:0.03
10:0.04
11:0.04
Negative Logits
田
-1.36
SPONSORED
-1.20
starter
-1.19
ら
-1.18
Applications
-1.16
pled
-1.14
abouts
-1.14
prototype
-1.14
Trivia
-1.13
ファ
-1.12
POSITIVE LOGITS
iem
1.32
icro
1.32
neighb
1.25
aila
1.18
ijk
1.15
Aless
1.13
Beaut
1.12
AUT
1.10
sts
1.10
unbeliev
1.10
Activations Density 0.009%