INDEX
Explanations
instances of the word "almost" or emphasize near-completion
New Auto-Interp
Negative Logits
iyat
-0.16
oren
-0.16
pes
-0.15
xor
-0.14
iyatı
-0.14
792
-0.14
hores
-0.14
ะ
-0.14
idd
-0.14
imens
-0.14
POSITIVE LOGITS
itious
0.15
mente
0.15
ness
0.15
ny
0.15
s
0.14
importantly
0.14
-utils
0.14
çĦ¶
0.14
ný
0.14
ãģĤãĤĭ
0.14
Activations Density 0.041%