INDEX
Explanations
terms related to identity and cultural reference points
New Auto-Interp
Negative Logits
oka
-0.15
cow
-0.15
245
-0.15
Brom
-0.14
elpers
-0.14
esin
-0.14
Cow
-0.14
Ivy
-0.14
ivel
-0.14
.iv
-0.14
POSITIVE LOGITS
onian
0.20
anian
0.20
ean
0.19
adian
0.18
esian
0.17
edian
0.16
arian
0.16
มà¸Ń
0.16
avian
0.16
sonian
0.16
Activations Density 0.210%