INDEX
Explanations
verbs related to actions or processes
phrases emphasizing ongoing actions or states of being
New Auto-Interp
Negative Logits
..............
-0.65
ichita
-0.63
option
-0.63
so
-0.60
Brigham
-0.60
envy
-0.60
utch
-0.57
PayPal
-0.55
request
-0.55
gif
-0.55
POSITIVE LOGITS
oneself
0.77
olphin
0.71
Blaster
0.66
ebus
0.64
utical
0.63
Scientist
0.60
orously
0.60
redients
0.60
umenthal
0.60
ilda
0.60
Activations Density 0.070%