INDEX
Explanations
conjunctions and phrases denoting connections or relationships between concepts
New Auto-Interp
Negative Logits
its
-0.15
atch
-0.14
_OLD
-0.13
_suspend
-0.13
lements
-0.13
pozn
-0.13
.Factory
-0.13
ause
-0.13
sut
-0.13
X
-0.12
POSITIVE LOGITS
extent
0.21
/or
0.20
entirety
0.19
nature
0.19
spirit
0.16
ivery
0.16
ĵn
0.16
duration
0.15
confines
0.15
tone
0.15
Activations Density 0.111%