INDEX
Explanations
significant mentions of groups or communities that involve individuals or customers
New Auto-Interp
Negative Logits
Ñģобой
-0.17
å®ĥ
-0.15
bạn
-0.15
oke
-0.14
ÑģобоÑİ
-0.14
немÑĥ
-0.14
ä½ł
-0.13
ÙĪÙĬÙĥ
-0.13
оно
-0.13
ema
-0.13
POSITIVE LOGITS
an
0.35
another
0.34
a
0.34
the
0.33
access
0.30
something
0.30
some
0.30
plenty
0.29
ample
0.28
permission
0.25
Activations Density 0.170%