INDEX
Explanations
phrases expressing certainty or knowledge
expressions of knowing or awareness related to various situations
New Auto-Interp
Negative Logits
SPONSORED
-0.72
udo
-0.70
cour
-0.68
enture
-0.63
territ
-0.62
certific
-0.62
unia
-0.62
BuyableInstoreAndOnline
-0.62
udeau
-0.62
seiz
-0.61
POSITIVE LOGITS
)</
0.71
sarc
0.67
PLA
0.61
Mean
0.61
lied
0.58
Slip
0.58
)=
0.58
!'"
0.56
true
0.55
ujah
0.55
Activations Density 0.424%