INDEX
Explanations
punctuation marks at the end of sentences and delimiters
New Auto-Interp
Negative Logits
ptr
-0.18
rost
-0.16
oton
-0.16
ussen
-0.15
ocker
-0.15
uth
-0.14
urge
-0.14
opak
-0.14
ê¸Ī
-0.14
Dale
-0.14
POSITIVE LOGITS
Gow
0.18
konkrét
0.16
šet
0.15
ayet
0.15
à¸Ńà¸ĩà¸Īาà¸ģ
0.15
ɵ
0.14
ez
0.14
елÑİ
0.14
ighton
0.14
oui
0.14
Activations Density 0.012%