INDEX
Explanations
words related to movement or direction
New Auto-Interp
Negative Logits
Shakspeare
-0.88
Theſe
-0.80
Shaksp
-0.78
blest
-0.78
lidl
-0.73
creeds
-0.70
operands
-0.69
Aleppo
-0.69
Mahomet
-0.69
Pallas
-0.69
POSITIVE LOGITS
--)
0.73
()]
0.71
=>
0.70
+");
0.69
={
0.69
+')
0.67
(
0.64
());
0.63
>({0.63
)),
0.63
Activations Density 0.313%