INDEX
Explanations
phrases indicating legal or formal directives and actions, potentially related to government or law enforcement
symbols or special characters used in a political context
New Auto-Interp
Negative Logits
mares
-0.69
ousel
-0.67
fascination
-0.67
Elys
-0.67
Niet
-0.65
adore
-0.65
Leica
-0.64
rom
-0.63
fertility
-0.62
Romance
-0.62
POSITIVE LOGITS
ª
1.39
Ĵ
1.30
ij
1.28
¤
1.24
®
1.22
«
1.22
ı
1.16
IJ
1.16
°
1.15
ĸ
1.13
Activations Density 0.190%