INDEX
Explanations
phrases related to legal actions or consequences
phrases related to risk or probability in decision-making contexts
New Auto-Interp
Negative Logits
natureconservancy
-0.68
Nin
-0.63
kefeller
-0.60
artney
-0.59
urches
-0.55
Sutherland
-0.54
pires
-0.53
anchester
-0.53
atcher
-0.51
odcast
-0.51
POSITIVE LOGITS
)).
0.76
)."
0.74
]."
0.71
%).
0.69
sic
0.68
%.
0.62
elim
0.61
.'"
0.59
}.
0.59
etc
0.59
Activations Density 1.495%