INDEX
Explanations
phrases related to taking a stand or asserting oneself
New Auto-Interp
Negative Logits
zo
-0.16
ury
-0.15
jec
-0.14
aste
-0.14
vals
-0.14
endoza
-0.13
pedia
-0.13
oge
-0.13
330
-0.13
urb
-0.13
POSITIVE LOGITS
tall
0.16
anni
0.15
taller
0.14
леж
0.14
alled
0.14
اÙĦØ´
0.14
-alone
0.14
Tall
0.14
alth
0.14
Monetary
0.14
Activations Density 0.033%