INDEX
Explanations
attends to beautiful and gorgeous from comments celebrating beauty
New Auto-Interp
Head Attr Weights
0:0.09
1:0.11
2:0.08
3:0.13
4:0.12
5:0.05
6:0.23
7:0.14
Negative Logits
بوابة
-0.29
mostly
-0.29
antaranya
-0.29
GGLE
-0.28
onStop
-0.28
MonoBehaviour
-0.28
,
-0.28
obicei
-0.27
mainly
-0.27
anywhere
-0.26
POSITIVE LOGITS
bootstrapcdn
0.30
EconPapers
0.30
thums
0.29
TSCA
0.28
+#+#
0.27
prefixer
0.26
Catawiki
0.26
Giving
0.26
joaat
0.26
okuyayım
0.26
Activations Density 0.070%