INDEX
Explanations
phrases indicating opinions or evaluations
negations or phrases expressing the opposite of a statement
New Auto-Interp
Negative Logits
"(
-0.56
thinkable
-0.54
IRE
-0.54
Is
-0.54
Assembly
-0.53
osi
-0.52
ahs
-0.51
bathing
-0.51
'd
-0.50
idian
-0.50
POSITIVE LOGITS
apego
0.67
itself
0.66
acronym
0.65
retty
0.61
decentralized
0.60
decentral
0.58
disbanded
0.58
phased
0.57
igmatic
0.57
Balk
0.55
Activations Density 1.062%