INDEX
Explanations
references to societal structures and community activities
New Auto-Interp
Negative Logits
’s
-0.27
's
-0.27
(s
-0.23
´s
-0.22
to
-0.22
sWith
-0.22
the
-0.21
‘s
-0.21
`s
-0.21
type
-0.21
POSITIVE LOGITS
'
0.45
’
0.44
cape
0.43
heets
0.41
ides
0.33
cales
0.33
pace
0.32
pecific
0.31
uits
0.30
ystems
0.30
Activations Density 4.419%