INDEX
Explanations
sports-related statistics and player achievements
before pronouns
sports achievements and countries
New Auto-Interp
Negative Logits
propOrder
-0.60
Snapdragon
-0.56
nawr
-0.56
bezeichneter
-0.55
esgue
-0.54
partimento
-0.50
gemeester
-0.49
celia
-0.49
Agra
-0.49
Bastille
-0.47
POSITIVE LOGITS
Britain
0.71
India
0.69
Nigeria
0.65
فريبيس
0.64
England
0.60
our
0.60
Australia
0.59
我国
0.59
America
0.59
咱们
0.59
Activations Density 0.577%