INDEX
Explanations
imperative verbs followed by direct objects
New Auto-Interp
Negative Logits
Languages
-0.65
Zen
-0.65
availability
-0.63
Interstitial
-0.63
inger
-0.62
bage
-0.62
liest
-0.60
marine
-0.59
DAQ
-0.58
natureconservancy
-0.58
POSITIVE LOGITS
tered
1.14
tering
0.95
icia
0.95
us
0.92
me
0.80
loose
0.78
itia
0.78
him
0.76
ting
0.75
slip
0.73
Activations Density 2.857%