INDEX
Explanations
phrases related to uncertainty and confirmation
phrases that indicate the state or condition of something
New Auto-Interp
Negative Logits
pmwiki
-0.99
Parameters
-0.80
SPONSORED
-0.78
ieties
-0.74
Marginal
-0.72
lengths
-0.71
*/(
-0.69
ité
-0.69
è¦ļéĨĴ
-0.68
estyles
-0.68
POSITIVE LOGITS
genuine
1.16
legit
1.14
authentic
1.13
real
1.10
indeed
1.08
true
1.05
theirs
1.01
actually
1.00
fake
0.97
hers
0.96
Activations Density 0.276%