INDEX
Explanations
instances of specific types of concepts or entities within a context
various types of opportunities and options in different contexts
New Auto-Interp
Negative Logits
cffffcc
-0.70
ãĤ´ãĥ³
-0.68
staking
-0.65
comings
-0.61
conclud
-0.60
exting
-0.60
WAYS
-0.60
ãĤĴ
-0.58
ãĥ¡
-0.58
hig
-0.58
POSITIVE LOGITS
they
1.14
we
1.01
THEY
0.94
he
0.93
you
0.93
soever
0.89
she
0.86
they
0.86
thou
0.85
it
0.83
Activations Density 0.155%