INDEX
Explanations
verbs that express command or caution
New Auto-Interp
Negative Logits
OGND
-0.69
pylint
-0.66
requestData
-0.62
subsubsection
-0.61
pisah
-0.61
herself
-0.58
pira
-0.57
hingga
-0.57
gheny
-0.56
řel
-0.56
POSITIVE LOGITS
Donny
0.93
Dont
0.83
Doy
0.80
TagHelper
0.76
Dont
0.76
dont
0.75
Jangan
0.74
beware
0.73
勿
0.72
Don
0.72
Activations Density 0.069%