INDEX
Explanations
phrases that indicate a majority or commonality among a group of entities
New Auto-Interp
Negative Logits
zx
-0.60
pursuant
-0.58
liger
-0.58
افته
-0.57
auge
-0.57
不断的
-0.56
Nazi
-0.55
er
-0.55
dedo
-0.55
Spicer
-0.54
POSITIVE LOGITS
fleste
1.30
flesta
1.28
meisten
1.25
meeste
1.20
plupart
1.18
most
1.16
большинство
1.12
most
1.10
Most
1.04
maioria
0.99
Activations Density 0.139%