INDEX
Explanations
locations/countries
references to countries and regions
New Auto-Interp
Negative Logits
lik
-0.57
behavi
-0.54
distingu
-0.52
streng
-0.49
reconc
-0.49
itness
-0.48
attribute
-0.47
tee
-0.47
answ
-0.47
curve
-0.45
POSITIVE LOGITS
,
0.84
and
0.76
etc
0.74
thia
0.73
ibia
0.73
Philippines
0.70
Territory
0.69
Arabia
0.68
Netherlands
0.67
ania
0.67
Activations Density 0.103%