INDEX
Explanations
questions that express capability or possibility
New Auto-Interp
Negative Logits
мага
-0.16
ullan
-0.15
ode
-0.15
owie
-0.15
enders
-0.15
Callable
-0.15
ninger
-0.14
asio
-0.14
ikan
-0.14
erty
-0.14
POSITIVE LOGITS
inho
0.16
ipeg
0.15
inci
0.15
IgnoreCase
0.14
yaw
0.14
mmc
0.13
strument
0.13
Misc
0.13
Karlov
0.13
Bias
0.13
Activations Density 0.026%