INDEX
Explanations
locations or objects situated in a high position
phrases indicating a position or location in relation to other objects
New Auto-Interp
Negative Logits
len
-0.81
gm
-0.79
nel
-0.73
ties
-0.70
д
-0.70
nery
-0.69
nc
-0.68
ners
-0.68
arians
-0.66
zel
-0.66
POSITIVE LOGITS
ĺħ
0.92
atop
0.89
owered
0.76
ĪĴ
0.74
ļé
0.74
hap
0.73
eous
0.73
İĭ
0.72
avorite
0.71
htaking
0.71
Activations Density 0.030%