INDEX
Explanations
names of individuals, especially athletes
proper names, particularly those of individuals involved in sports and politics
New Auto-Interp
Negative Logits
skirts
-0.79
UCK
-0.75
UME
-0.73
hips
-0.72
à©
-0.71
adder
-0.70
OWS
-0.70
Reviewed
-0.69
Continued
-0.68
ITNESS
-0.68
POSITIVE LOGITS
Gian
1.22
nis
0.92
fort
0.80
lapt
0.77
amas
0.76
ster
0.75
iani
0.74
emy
0.73
Sic
0.73
Lorenzo
0.71
Activations Density 0.006%