INDEX
Explanations
positive and surprising experiences related to nature
New Auto-Interp
Negative Logits
oce
-0.17
vod
-0.16
oÅĻ
-0.15
ouro
-0.15
enek
-0.15
rets
-0.14
ulty
-0.14
odÄĽ
-0.13
Ľå»º
-0.13
â̦↵↵↵
-0.13
POSITIVE LOGITS
åī£
0.16
паÑĤ
0.15
leap
0.14
Tort
0.14
.realm
0.14
Obs
0.14
aliment
0.14
eh
0.14
ogram
0.14
pat
0.14
Activations Density 0.012%