INDEX
Explanations
mentions of law enforcement or legal situations
the article "the" in various contexts
New Auto-Interp
Negative Logits
eno
-0.72
emin
-0.66
leeve
-0.66
thood
-0.64
iffe
-0.62
besides
-0.62
earch
-0.61
ateurs
-0.61
verage
-0.59
iev
-0.59
POSITIVE LOGITS
latter
1.13
slightest
1.08
heaviest
1.04
biggest
1.02
resulting
1.01
smallest
1.01
largest
1.01
fastest
1.01
longest
0.99
aforementioned
0.94
Activations Density 0.270%