INDEX
Explanations
emotional expressions of pride and self-reflection
New Auto-Interp
Negative Logits
-0.50
autorytatywna
-0.43
\#:
-0.43
+#+#
-0.42
mtrl
-0.42
這位
-0.41
تضيفلها
-0.41
EndInit
-0.40
NamedQuery
-0.40
这位
-0.39
POSITIVE LOGITS
myself
0.51
<<<<<<<<<<<<<<
0.48
my
0.48
thing
0.46
mío
0.45
myself
0.44
realising
0.43
culously
0.42
それで
0.42
I
0.41
Activations Density 0.976%