INDEX
Explanations
the term "certain" or related context of specificity
New Auto-Interp
Negative Logits
اÙĨÙĩ
-0.17
ust
-0.16
pack
-0.15
ase
-0.15
enda
-0.14
antar
-0.14
endas
-0.14
ota
-0.14
µ¬
-0.14
yar
-0.14
POSITIVE LOGITS
kinds
0.21
akin
0.19
kind
0.18
dozen
0.15
umb
0.15
tul
0.15
ulis
0.14
types
0.14
ially
0.14
.hw
0.14
Activations Density 0.018%