INDEX
Explanations
negative sentiments or outcomes
Follows a period or colon
end of phrase or definition
New Auto-Interp
Negative Logits
;
-0.57
.
-0.52
;
-0.51
&
-0.49
-0.48
ino
-0.47
лка
-0.47
rines
-0.43
—
-0.43
view
-0.42
POSITIVE LOGITS
Datuak
1.28
:✨
1.05
uxxxx
1.04
XNUMX
0.93
💼
0.92
NUMX
0.91
الحياه
0.90
محفوظة
0.89
.*")]
0.87
بوابة
0.86
Activations Density 0.011%