INDEX
Explanations
punctuation marks, particularly periods and commas
New Auto-Interp
Negative Logits
attempt
-0.07
gings
-0.07
βολ
-0.07
VERTISEMENT
-0.07
Attempt
-0.06
lendirme
-0.06
ÏĨεÏģ
-0.06
AGR
-0.06
uai
-0.06
à¹īาà¸ĩ
-0.06
POSITIVE LOGITS
themselves
0.07
sayesinde
0.07
otherwise
0.07
-sama
0.07
even
0.07
resulting
0.06
avid
0.06
rof
0.06
pit
0.06
cul
0.06
Activations Density 0.048%