INDEX
Explanations
references to personal and collective growth
New Auto-Interp
Negative Logits
çħ§
-0.15
oir
-0.15
acades
-0.15
صÙĩ
-0.14
als
-0.14
aye
-0.14
ookie
-0.14
aurus
-0.14
uring
-0.14
åºľ
-0.14
POSITIVE LOGITS
asser
0.17
(er
0.16
æĤī
0.16
nda
0.15
íij¸
0.14
رز
0.14
ÙĨدگاÙĨ
0.14
enor
0.14
cÃŃm
0.14
vine
0.14
Activations Density 0.056%