INDEX
Explanations
references to nature or the environment
nouns related to physical structures and natural elements
New Auto-Interp
Negative Logits
tains
-0.77
soever
-0.71
Recomm
-0.65
PLA
-0.61
doms
-0.60
Recommend
-0.60
gery
-0.59
subjects
-0.56
nant
-0.56
Allows
-0.56
POSITIVE LOGITS
were
1.14
are
1.12
weren
1.11
aren
1.06
remain
0.99
have
0.96
expire
0.90
revert
0.90
cape
0.90
collided
0.89
Activations Density 0.247%