INDEX
Explanations
concepts related to the relationship between reason and sensation
New Auto-Interp
Negative Logits
cestor
-0.16
ebi
-0.15
'".
-0.15
¸ı
-0.15
818
-0.15
ilde
-0.14
ÃŃny
-0.14
енка
-0.14
ethyst
-0.14
ayıp
-0.14
POSITIVE LOGITS
corrupt
0.20
passions
0.19
corruption
0.19
motion
0.19
agent
0.18
appet
0.18
agency
0.17
agent
0.16
Appet
0.16
animal
0.16
Activations Density 0.063%