INDEX
Explanations
words related to time or temporal proximity
references to impending future events or conditions
New Auto-Interp
Negative Logits
ensional
-0.53
Canaver
-0.50
ctions
-0.48
Matter
-0.46
sect
-0.45
Bosh
-0.45
Invention
-0.44
Pens
-0.44
casters
-0.44
[+
-0.43
POSITIVE LOGITS
an
0.67
aned
0.62
\\
0.59
ana
0.58
ans
0.58
ane
0.55
inges
0.54
lihood
0.54
ja
0.53
aning
0.52
Activations Density 0.599%