INDEX
Explanations
pronouns referring to groups of people
New Auto-Interp
Negative Logits
onal
-0.78
heny
-0.71
rought
-0.65
emis
-0.64
Federation
-0.63
ion
-0.62
Mub
-0.61
olid
-0.61
wire
-0.61
microsoft
-0.58
POSITIVE LOGITS
é¾įåĸļ士
0.99
selves
0.78
ternally
0.75
ortium
0.73
DragonMagazine
0.70
çīĪ
0.69
gypt
0.68
ãģ¯
0.68
imei
0.67
æ³
0.67
Activations Density 0.368%