INDEX
Explanations
speculative phrases about the future
occurrences of the word "the" in various contexts, indicating a focus on significant or notable subjects
New Auto-Interp
Negative Logits
lement
-0.72
aco
-0.66
alone
-0.63
ousy
-0.62
folk
-0.62
arson
-0.60
tel
-0.59
thood
-0.57
bots
-0.57
bourg
-0.56
POSITIVE LOGITS
verge
1.62
brink
1.48
lookout
1.30
forefront
1.16
chopping
1.04
periphery
0.99
sidelines
0.98
edge
0.98
heels
0.97
precip
0.96
Activations Density 0.066%