INDEX
Explanations
article followed by descriptive word
New Auto-Interp
Negative Logits
SUCCESSFULLY
0.17
#!/
0.16
)
0.16
an
0.16
having
0.16
the
0.15
ately
0.15
preferentially
0.15
ably
0.15
the
0.15
POSITIVE LOGITS
few
0.30
slight
0.28
bit
0.27
flurry
0.27
很好的
0.26
lot
0.25
fairly
0.25
nice
0.25
tremendous
0.25
little
0.25
Activations Density 0.251%