INDEX
Explanations
references to spiritual or personal growth
New Auto-Interp
Negative Logits
ClassName
-0.14
ise
-0.14
ÃŁen
-0.14
wig
-0.13
agar
-0.13
wick
-0.13
ophone
-0.13
بÛĮر
-0.13
ara
-0.13
Dynam
-0.13
POSITIVE LOGITS
Ñģки
0.14
strup
0.14
ÄĻk
0.14
quets
0.14
anzi
0.14
ettel
0.13
proced
0.13
Reich
0.13
pun
0.13
readcr
0.13
Activations Density 0.097%