INDEX
Explanations
phrases indicating different levels or stages of something
phrases indicating thresholds or levels of measurement
New Auto-Interp
Negative Logits
indications
-0.69
packages
-0.66
evidence
-0.63
enegger
-0.61
Orders
-0.60
invitations
-0.59
instructions
-0.59
constructs
-0.59
examples
-0.58
NP
-0.57
POSITIVE LOGITS
venge
1.51
rouse
1.16
halt
0.96
predetermined
0.96
manageable
0.86
point
0.82
coma
0.79
lesser
0.79
certain
0.78
whopping
0.77
Activations Density 0.122%