INDEX
Explanations
phrases related to nostalgia or past events
New Auto-Interp
Negative Logits
بعدÛĮ
-0.19
_SCL
-0.16
><?
-0.16
ãģ°ãģĭãĤĬ
-0.16
æĬŀ
-0.15
uzey
-0.15
óÅĤ
-0.15
enÃŃ
-0.14
zens
-0.14
zatÃŃm
-0.14
POSITIVE LOGITS
back
0.88
back
0.56
Back
0.55
Back
0.53
_back
0.53
.back
0.50
BACK
0.50
-back
0.48
years
0.45
back
0.43
Activations Density 0.137%