INDEX
Explanations
phrases related to positions or actions of authority
words related to physical actions or states of being
New Auto-Interp
Negative Logits
roma
-0.69
PORT
-0.64
Means
-0.64
ONES
-0.63
ARB
-0.60
Rog
-0.56
Sciences
-0.56
ACTED
-0.54
Emerson
-0.53
offer
-0.53
POSITIVE LOGITS
ked
1.36
ky
1.32
king
1.26
kers
1.22
ks
1.11
kered
1.01
ksh
1.00
kish
0.94
ker
0.92
ki
0.90
Activations Density 0.127%