INDEX
Explanations
instances of distinction or separation between entities or concepts
New Auto-Interp
Negative Logits
onium
-0.15
ibraltar
-0.15
onomies
-0.15
Fallon
-0.14
ÏīÏĥη
-0.14
StatusLabel
-0.14
Mend
-0.14
lai
-0.14
uhe
-0.14
ка
-0.13
POSITIVE LOGITS
separate
0.24
Separate
0.22
seperate
0.21
separately
0.20
çĭ¬ç«ĭ
0.18
distinct
0.18
independent
0.18
independently
0.17
separ
0.16
inton
0.15
Activations Density 0.142%