INDEX
Explanations
expressions of uncertainty or loss regarding existence and sustainability
New Auto-Interp
Negative Logits
somehow
-0.19
benh
-0.17
/*č↵
-0.15
igo
-0.15
udas
-0.15
quate
-0.15
IGO
-0.15
ãĥªãĥ¼ãĤº
-0.15
alia
-0.14
اعب
-0.14
POSITIVE LOGITS
anymore
1.09
nữa
0.58
lagi
0.43
again
0.34
longer
0.33
åĨį
0.31
artık
0.31
again
0.27
further
0.26
دÛĮگر
0.25
Activations Density 0.228%