INDEX
Explanations
verbs related to instruction or guidance
New Auto-Interp
Negative Logits
raltar
-1.03
},"
-0.79
plom
-0.78
iya
-0.76
shall
-0.75
udence
-0.75
kr
-0.74
iry
-0.73
ãĤ´ãĥ³
-0.72
hire
-0.72
POSITIVE LOGITS
yourselves
1.04
matically
1.04
yourself
1.02
orously
1.02
ourselves
0.99
oneself
0.97
them
0.96
uate
0.95
imize
0.89
ively
0.88
Activations Density 2.627%