INDEX
Explanations
instances of structural or physical components and their characteristics
New Auto-Interp
Negative Logits
perman
-0.15
ãĥ³ãĥĩ
-0.14
çĮ
-0.14
wat
-0.14
::<
-0.14
Verb
-0.14
::
-0.13
career
-0.13
failed
-0.13
ạt
-0.13
POSITIVE LOGITS
Evet
0.17
aroo
0.15
emek
0.15
bol
0.15
icana
0.15
ONGL
0.15
IDD
0.15
çµ
0.14
ÑĨ
0.14
olec
0.14
Activations Density 0.006%