INDEX
Explanations
verbs related to actions or commands
verbs related to creation and influence
New Auto-Interp
Negative Logits
cone
-0.73
requent
-0.73
Seeking
-0.73
described
-0.72
dozen
-0.69
WATCHED
-0.68
cum
-0.67
hess
-0.65
ãĤ´ãĥ³
-0.65
Stronghold
-0.65
POSITIVE LOGITS
innocent
0.90
unwitting
0.89
us
0.85
insulting
0.83
scapego
0.82
undue
0.81
oneself
0.81
unnecessary
0.81
people
0.80
everybody
0.79
Activations Density 0.259%