INDEX
Explanations
verbs related to actions or commands
phrases related to actions and their consequences
New Auto-Interp
Negative Logits
yssey
-0.68
oult
-0.63
respectively
-0.57
aukee
-0.52
anwhile
-0.52
actionGroup
-0.51
worth
-0.50
ishers
-0.50
These
-0.50
cells
-0.50
POSITIVE LOGITS
it
1.67
thereof
1.07
It
1.00
thereto
0.93
therein
0.92
It
0.91
it
0.91
hers
0.90
theirs
0.88
Its
0.79
Activations Density 2.605%