INDEX
Explanations
references to large groups of people or populations
New Auto-Interp
Negative Logits
ï¸ı
-0.23
zelf
-0.18
ity
-0.16
crowded
-0.16
utom
-0.15
dorf
-0.15
á»ĭch
-0.15
itarian
-0.15
/do
-0.15
oyo
-0.14
POSITIVE LOGITS
ourced
0.28
ourcing
0.24
gather
0.18
source
0.17
ings
0.17
gathered
0.17
favorites
0.17
urf
0.17
Spell
0.16
-control
0.16
Activations Density 0.019%