INDEX
Explanations
words related to hierarchy or organization, especially when structured in a sequential or ranking order
phrases that include the word "order" followed by numerical values or related descriptors
New Auto-Interp
Negative Logits
stump
-0.68
iov
-0.67
></
-0.65
iland
-0.65
>[
-0.64
iday
-0.64
gow
-0.62
>]
-0.62
zinski
-0.61
)].
-0.60
POSITIVE LOGITS
magnitude
1.22
precedence
1.00
rontal
0.86
thumb
0.75
lege
0.72
ety
0.72
nuns
0.70
particulars
0.70
affairs
0.70
Templ
0.69
Activations Density 0.050%