INDEX
Explanations
repeated instances of the article "the"
New Auto-Interp
Negative Logits
osponsors
-0.78
handedly
-0.69
govtrack
-0.63
Turkey
-0.63
accompl
-0.63
insk
-0.62
pers
-0.61
@@
-0.60
ato
-0.58
ofi
-0.58
POSITIVE LOGITS
same
1.45
ses
1.20
dreaded
1.09
latter
1.04
aforementioned
1.04
slightest
1.01
same
1.00
latest
1.00
brunt
0.99
highest
0.94
Activations Density 0.253%