INDEX
Explanations
YouTube and Google Drive links in text
web addresses, specifically links to YouTube and Google
New Auto-Interp
Negative Logits
Frie
-0.80
terday
-0.71
unused
-0.67
Letters
-0.64
Awakens
-0.64
-+-+
-0.63
Hebdo
-0.60
Diplom
-0.60
letters
-0.59
Bung
-0.59
POSITIVE LOGITS
watch
1.30
embed
0.98
spread
0.97
channel
0.95
user
0.89
search
0.88
sites
0.88
share
0.87
gallery
0.87
youtu
0.86
Activations Density 0.012%