INDEX
Explanations
nouns and verbs indicating action or development over time
New Auto-Interp
Negative Logits
ura
-0.17
orsi
-0.17
BOTH
-0.15
feedback
-0.14
both
-0.14
isia
-0.14
breaker
-0.14
URA
-0.14
ransition
-0.13
idir
-0.13
POSITIVE LOGITS
Conway
0.15
asca
0.14
qe
0.14
plib
0.14
alu
0.14
immel
0.14
MethodImpl
0.14
ÐĴÑĸк
0.14
554
0.14
emax
0.14
Activations Density 0.034%