INDEX
Explanations
positive attributes or qualities
expressions that highlight notable or significant aspects of various subjects
New Auto-Interp
Negative Logits
scl
-0.83
actionGroup
-0.73
bow
-0.70
COL
-0.69
tenance
-0.67
Cause
-0.67
kr
-0.66
rift
-0.66
cue
-0.64
DERR
-0.63
POSITIVE LOGITS
these
0.82
this
0.80
Kinnikuman
0.75
owning
0.70
modern
0.68
deploying
0.63
adding
0.63
designing
0.62
constructing
0.62
Banner
0.62
Activations Density 0.160%