INDEX
Explanations
words related to uniqueness or individuality in various contexts
New Auto-Interp
Negative Logits
blown
-0.18
bidden
-0.17
otation
-0.16
regation
-0.15
ertia
-0.15
tract
-0.15
lasting
-0.15
achable
-0.15
usal
-0.15
clusion
-0.15
POSITIVE LOGITS
izes
0.40
ifies
0.40
äºĨä¸Ģ
0.39
äºĨ
0.33
ulates
0.29
äºĨ
0.29
iates
0.29
uje
0.29
ises
0.28
IZES
0.26
Activations Density 0.287%