INDEX
Explanations
themes related to dreams and aspirations for a better future
New Auto-Interp
Negative Logits
æĦŁæĥħ
-0.16
eward
-0.16
ames
-0.14
aviest
-0.14
rocessing
-0.14
ево
-0.13
adem
-0.13
682
-0.13
asta
-0.13
ernen
-0.13
POSITIVE LOGITS
ighbor
0.16
toler
0.16
Patch
0.15
_patch
0.14
patch
0.14
asant
0.14
opia
0.14
MethodName
0.14
patch
0.14
Patch
0.14
Activations Density 0.096%