INDEX
Explanations
phrases that express statements or quotes
New Auto-Interp
Negative Logits
олом
-0.17
ãĥ«ãĥķ
-0.16
èle
-0.15
dit
-0.15
ypo
-0.14
adoption
-0.14
SCO
-0.14
ÐĴС
-0.14
Äįen
-0.14
Dün
-0.14
POSITIVE LOGITS
brook
0.16
agues
0.15
conc
0.15
ÙIJÙħ
0.15
conc
0.15
antan
0.15
res
0.15
egg
0.14
па
0.14
RAL
0.14
Activations Density 0.013%