INDEX
Explanations
locations and weather-related terms
information related to weather, cold climates, and geographical locations
New Auto-Interp
Negative Logits
omission
-0.69
oyal
-0.62
agent
-0.60
informant
-0.58
agent
-0.55
Offer
-0.54
accuracy
-0.54
linem
-0.53
slurs
-0.53
è£ħ
-0.52
POSITIVE LOGITS
opolis
0.79
Population
0.78
population
0.75
democracy
0.71
stagnant
0.69
economically
0.65
industrialized
0.65
democracies
0.63
populous
0.63
birthplace
0.61
Activations Density 1.873%