INDEX
Explanations
phrases related to discovering or uncovering information
phrases associated with discovering information or outcomes
New Auto-Interp
Negative Logits
eries
-0.67
Crystal
-0.66
berus
-0.63
bite
-0.63
oun
-0.62
istent
-0.60
sensibilities
-0.60
Pers
-0.58
Textures
-0.58
athi
-0.58
POSITIVE LOGITS
about
0.96
how
0.96
why
0.85
exactly
0.82
aloud
0.79
beforehand
0.79
what
0.79
how
0.78
whats
0.78
ledge
0.76
Activations Density 0.047%