INDEX
Explanations
phrases related to agreements or some form of official directives
terms related to agreements and calls to action
New Auto-Interp
Negative Logits
cale
-0.77
ecause
-0.74
vae
-0.69
ndum
-0.67
MORE
-0.66
agine
-0.65
poon
-0.64
ourke
-0.62
Interested
-0.62
stasy
-0.62
POSITIVE LOGITS
itself
0.82
iest
0.78
ariat
0.75
consists
0.71
revolves
0.70
consisted
0.70
factor
0.67
ultimate
0.66
interval
0.66
seemed
0.66
Activations Density 0.437%