INDEX
Explanations
phrases related to physical actions or instructions
expressions related to taking action or suggesting involvement
New Auto-Interp
Negative Logits
independ
-0.83
`.
-0.81
Advertisements
-0.80
Whilst
-0.76
´
-0.75
doesnt
-0.74
________________________________________________________________
-0.73
********************************
-0.72
Firstly
-0.72
whilst
-0.71
POSITIVE LOGITS
Enlarge
0.83
—"
0.82
Rodham
0.70
toggle
0.68
—
0.65
persuaded
0.65
Giuliani
0.64
—
0.63
Suppose
0.63
Cohn
0.61
Activations Density 1.250%