INDEX
Explanations
commands and suggestions related to consuming media
New Auto-Interp
Negative Logits
sunshine
-0.60
duck
-0.59
invitation
-0.57
water
-0.55
azing
-0.54
wishes
-0.54
iami
-0.53
wish
-0.53
moot
-0.53
blues
-0.52
POSITIVE LOGITS
it
0.88
uve
0.68
anu
0.65
().
0.65
itia
0.64
itted
0.62
orie
0.62
illac
0.62
!:
0.61
Patron
0.60
Activations Density 0.181%