INDEX
Explanations
locations within cities
geographic names of cities and locations
New Auto-Interp
Negative Logits
ecause
-0.64
cause
-0.59
sbm
-0.58
namely
-0.57
sparing
-0.54
viz
-0.53
nodd
-0.53
Article
-0.53
sugg
-0.52
reapp
-0.52
POSITIVE LOGITS
,
1.02
,-
0.94
,,,,,,,,
0.74
,,
0.67
CITY
0.65
!,
0.63
)=(
0.63
,.
0.61
.,
0.60
?,
0.58
Activations Density 0.141%