INDEX
Explanations
phrases comparing two entities or situations
instances of the word "the."
New Auto-Interp
Negative Logits
besides
-0.84
furthermore
-0.76
namely
-0.73
âĢł
-0.72
followed
-0.70
elaide
-0.69
gpu
-0.69
istries
-0.68
alongside
-0.68
âĢķ
-0.68
POSITIVE LOGITS
slightest
1.24
usual
1.20
aforementioned
1.13
smallest
1.12
entirety
1.11
rest
1.06
proverbial
1.01
latter
1.00
prevailing
1.00
same
0.99
Activations Density 0.400%