INDEX
Explanations
specific nouns and the start of phrases
New Auto-Interp
Negative Logits
กลุ่ม
0.52
cesz
0.51
d
0.50
inside
0.49
rufo
0.48
insisting
0.46
প্রতিষ্ঠাতা
0.46
mall
0.45
کھیلنا
0.45
gosta
0.45
POSITIVE LOGITS
repayment
0.45
elect
0.44
repaid
0.44
SD
0.43
a
0.42
I
0.41
nonthermal
0.41
i
0.41
са
0.41
funded
0.40
Activations Density 0.001%