INDEX
Explanations
phrases related to spatial locations or directions
instances of the word "the" at various points in the text
New Auto-Interp
Negative Logits
eers
-0.72
aris
-0.71
paio
-0.69
intosh
-0.67
firsthand
-0.65
Wik
-0.65
LY
-0.65
uba
-0.64
arians
-0.63
maid
-0.62
POSITIVE LOGITS
country
0.88
proverbial
0.85
equation
0.84
millennium
0.83
smallest
0.81
latter
0.80
world
0.80
aforementioned
0.80
spectrum
0.77
circle
0.77
Activations Density 0.257%