INDEX
Explanations
terms expressing a negative evaluation or judgment towards actions or ideas
New Auto-Interp
Negative Logits
Tembelea
-0.78
LayoutStyle
-0.70
Accurate
-0.67
Accurate
-0.66
""],
-0.62
Приятного
-0.62
AutoScale
-0.61
lenker
-0.61
Euer
-0.59
XmlAccessorType
-0.59
POSITIVE LOGITS
silly
1.04
stupid
0.89
silly
0.89
dumb
0.88
ridiculous
0.83
dumb
0.79
stupid
0.75
absurd
0.75
stupidly
0.73
embarrass
0.72
Activations Density 0.086%