INDEX
Explanations
content related to news articles or headlines
commands related to enlarging or toggling images
New Auto-Interp
Negative Logits
warm
-0.65
naires
-0.65
angering
-0.62
angers
-0.62
nuts
-0.62
anger
-0.62
onew
-0.61
upt
-0.60
comes
-0.60
sucker
-0.60
POSITIVE LOGITS
Enlarge
0.94
ONSORED
0.92
UTERS
0.85
Image
0.85
WATCHED
0.85
caption
0.79
toggle
0.73
PHOTO
0.73
Photo
0.71
Reached
0.69
Activations Density 0.011%