INDEX
Explanations
number values or parameters associated with different categories or contexts
phrases indicating variability or comparisons across different categories or demographics
New Auto-Interp
Negative Logits
usions
-0.69
redits
-0.65
Buff
-0.65
Effects
-0.64
clips
-0.64
Trivia
-0.62
Adv
-0.62
ensibly
-0.61
phies
-0.61
Imp
-0.61
POSITIVE LOGITS
region
1.70
locality
1.69
province
1.68
country
1.56
municipality
1.54
jurisdiction
1.51
locale
1.44
continent
1.42
city
1.39
county
1.37
Activations Density 0.504%