INDEX
Explanations
phrases expressing intentions or desires
New Auto-Interp
Negative Logits
OGND
-0.97
最快更新
-0.94
expandindo
-0.90
<bos>
-0.81
المعيارى
-0.79
SourceChecksum
-0.79
theless
-0.79
nahilalakip
-0.78
itſelf
-0.77
✨:
-0.77
POSITIVE LOGITS
wanna
1.02
Wanna
0.85
Wanna
0.84
wanna
0.82
veulent
0.79
willen
0.79
want
0.78
querem
0.78
voulu
0.71
souhaitent
0.70
Activations Density 0.108%