INDEX
Explanations
references to wishes and charitable actions
New Auto-Interp
Negative Logits
resher
-0.14
/feed
-0.14
Welfare
-0.14
æ½®
-0.14
leanup
-0.14
éϰ
-0.14
รส
-0.14
flea
-0.13
Contrib
-0.13
FAULT
-0.13
POSITIVE LOGITS
wish
0.39
Wish
0.37
wishes
0.36
wish
0.35
wished
0.26
wishing
0.26
Dreams
0.23
granted
0.23
granting
0.23
wishlist
0.22
Activations Density 0.016%