INDEX
Explanations
expressions of excitement or anticipation related to personal experiences and social interactions
New Auto-Interp
Negative Logits
709
-0.15
pok
-0.15
owie
-0.14
POCH
-0.14
æŁĦ
-0.14
POS
-0.13
è´µ
-0.13
ipe
-0.13
pty
-0.13
pick
-0.13
POSITIVE LOGITS
hos
0.16
dech
0.15
umi
0.14
============================================================================↵
0.14
ortal
0.14
elsea
0.14
profund
0.14
ancestral
0.14
iversary
0.14
اÙĪØª
0.13
Activations Density 0.043%