INDEX
Explanations
words related to gaining understanding or knowledge
references to gaining or providing insight on various topics
New Auto-Interp
Negative Logits
ategory
-0.69
shr
-0.64
fuss
-0.63
disbanded
-0.60
notor
-0.60
ony
-0.60
slapping
-0.59
senal
-0.59
FACE
-0.56
stiff
-0.56
POSITIVE LOGITS
ibility
1.00
ively
0.89
glean
0.89
insight
0.87
fully
0.83
insights
0.82
ibly
0.82
spection
0.82
ible
0.81
ives
0.79
Activations Density 0.027%