INDEX
Explanations
references to singular or individual items and actions
New Auto-Interp
Negative Logits
LETE
-0.17
orro
-0.15
ammad
-0.15
меж
-0.15
UIBar
-0.15
èĪĮ
-0.14
Levine
-0.14
hift
-0.14
öm
-0.14
inium
-0.14
POSITIVE LOGITS
/single
0.30
(single
0.27
-single
0.24
olo
0.22
single
0.21
solitary
0.20
singleton
0.20
solo
0.20
-alone
0.20
isolated
0.19
Activations Density 0.344%