INDEX
Explanations
the word "See" at varying levels of activation
phrases related to viewing content or information
New Auto-Interp
Negative Logits
naires
-0.64
naire
-0.61
angan
-0.58
ngth
-0.58
othal
-0.56
alky
-0.54
hurd
-0.54
elf
-0.54
iod
-0.54
posal
-0.54
POSITIVE LOGITS
Sample
0.59
Cancel
0.54
Sketch
0.54
Property
0.53
Widget
0.53
whats
0.53
Entreprene
0.53
Vu
0.52
Shooting
0.52
previews
0.52
Activations Density 0.013%