INDEX
Explanations
references to patriarchal structures and concepts
New Auto-Interp
Negative Logits
uales
-0.17
illis
-0.15
uations
-0.15
Ñĩини
-0.15
enaire
-0.15
enders
-0.15
ittings
-0.15
βο
-0.15
IGO
-0.15
gel
-0.15
POSITIVE LOGITS
archy
0.42
arch
0.41
archs
0.38
ARCH
0.30
archical
0.30
arch
0.28
аÑĢÑħ
0.25
otic
0.25
аÑĢÑħ
0.24
Arch
0.24
Activations Density 0.010%