INDEX
Explanations
references to individuals or entities regarded as members of a group or organization
New Auto-Interp
Negative Logits
styleType
-0.76
neux
-0.55
bium
-0.50
ifrance
-0.50
äu
-0.50
✨:
-0.50
міністра
-0.49
Према
-0.48
himo
-0.47
mators
-0.47
POSITIVE LOGITS
member
2.56
Member
2.46
Member
2.32
member
2.26
MEMBER
2.19
MEMBER
1.88
Mitglied
1.53
miembro
1.32
membro
1.28
membre
1.21
Activations Density 0.106%