INDEX
Explanations
phrases that indicate a listing or enumeration of items or examples
New Auto-Interp
Head Attr Weights
0:0.01
1:0.01
2:0.13
3:0.18
4:0.10
5:0.03
6:0.08
7:0.09
8:0.06
9:0.03
10:0.07
11:0.16
Negative Logits
��極
-1.76
��
-1.65
EStream
-1.60
��
-1.56
龍喚士
-1.43
士
-1.40
ghazi
-1.38
iano
-1.28
bah
-1.26
神
-1.24
POSITIVE LOGITS
etsy
1.27
unanswered
1.17
flourishing
1.15
coming
1.13
swirling
1.13
buzzing
1.12
popping
1.12
happening
1.11
unheard
1.10
unwanted
1.07
Activations Density 0.039%