INDEX
Explanations
instances where the word "the" is followed by another particular word
New Auto-Interp
Negative Logits
£ı
-0.72
arry
-0.70
.--
-0.67
omever
-0.67
bear
-0.67
gat
-0.66
exceeds
-0.66
.''
-0.65
acy
-0.65
strap
-0.65
POSITIVE LOGITS
same
1.18
latter
1.13
idea
1.10
latest
1.09
largest
1.07
infamous
1.04
brunt
1.03
aforementioned
1.02
entirety
1.00
toughest
1.00
Activations Density 0.454%