INDEX
Explanations
words associated with experiences and sensations
New Auto-Interp
Negative Logits
sight
-0.17
.trans
-0.15
oldt
-0.14
statt
-0.14
emer
-0.14
metis
-0.14
sights
-0.14
หาร
-0.14
tera
-0.14
.sky
-0.13
POSITIVE LOGITS
amos
0.15
oppable
0.15
é¨İ
0.14
shoot
0.14
adiens
0.14
ÑĢÑĥÑģ
0.14
Kas
0.14
amas
0.14
ãĤīãģĦ
0.13
ãģĵãģĿ
0.13
Activations Density 0.023%