INDEX
Explanations
phrases related to acquiring knowledge or information
phrases indicating the act of learning or receiving news
New Auto-Interp
Negative Logits
exting
-0.73
teasp
-0.68
enture
-0.66
ascript
-0.65
privile
-0.64
pmwiki
-0.63
paio
-0.62
oreAnd
-0.62
ĸļ
-0.61
respective
-0.61
POSITIVE LOGITS
that
0.76
about
0.70
how
0.70
goodbye
0.70
DeVos
0.68
he
0.68
news
0.67
she
0.67
what
0.66
noon
0.64
Activations Density 0.126%