INDEX
Explanations
instances where it is deemed important to take note of specific information
phrases that emphasize the importance of noting something
New Auto-Interp
Negative Logits
atan
-0.73
ravel
-0.71
quer
-0.70
atom
-0.67
namese
-0.67
iffe
-0.66
soever
-0.65
oing
-0.64
adesh
-0.64
estern
-0.63
POSITIVE LOGITS
ably
0.83
books
0.79
lessly
0.75
book
0.73
how
0.72
ATURE
0.70
noting
0.69
Keeper
0.69
BOOK
0.69
ATURES
0.68
Activations Density 0.013%