INDEX
Explanations
words related to positive impact or approval
New Auto-Interp
Negative Logits
anos
-0.65
bley
-0.62
atten
-0.59
mare
-0.58
kas
-0.58
zan
-0.55
mur
-0.55
mber
-0.55
rys
-0.55
inity
-0.55
POSITIVE LOGITS
as
0.64
},"
0.63
onse
0.61
]);
0.59
Parameters
0.59
};
0.58
FTWARE
0.58
isSpecialOrderable
0.57
atically
0.56
.:
0.56
Activations Density 0.933%