INDEX
Explanations
phrases related to discovering or uncovering information
phrases indicating the act of discovering or learning information
New Auto-Interp
Negative Logits
oun
-0.75
Crystal
-0.69
ovich
-0.69
avorite
-0.66
ĸļ
-0.64
luster
-0.64
berus
-0.62
eries
-0.61
uga
-0.59
erate
-0.58
POSITIVE LOGITS
about
1.02
why
0.98
how
0.97
WHY
0.87
whats
0.86
beforehand
0.85
exactly
0.85
what
0.83
whether
0.82
afterwards
0.82
Activations Density 0.022%