INDEX
Explanations
nouns and adjectives related to descriptions and attributes
New Auto-Interp
Negative Logits
плоÑī
-0.20
\Collections
-0.16
práci
-0.16
пÑĢиÑģÑĤÑĥп
-0.16
ÑĢабоÑĤÑĭ
-0.16
дÑĢÑĥгой
-0.15
inda
-0.15
Compilation
-0.15
sposób
-0.15
ãģ²ãģ¨
-0.15
POSITIVE LOGITS
вÑĢемÑı
0.19
знаÑĩение
0.17
ÑĤеÑĩение
0.17
лиÑĨо
0.17
полоÑĤ
0.17
колиÑĩе
0.17
покол
0.17
колиÑĩеÑģÑĤво
0.17
ÑģÑĢедÑģÑĤво
0.16
ÑĢаÑģÑĤение
0.15
Activations Density 0.023%