INDEX
Explanations
thoughts and expressions regarding change and adaptation
New Auto-Interp
Negative Logits
Ùħشار
-0.16
ltk
-0.16
Fasc
-0.16
aki
-0.14
icer
-0.14
FW
-0.14
Fetcher
-0.14
иж
-0.14
motivation
-0.14
ÑĥлÑĮ
-0.14
POSITIVE LOGITS
familiar
0.65
familiarity
0.54
amiliar
0.52
Fam
0.47
çĨŁ
0.42
comfort
0.34
acquainted
0.33
пÑĢивÑĭ
0.33
ìĿµ
0.29
comfortable
0.29
Activations Density 0.208%