INDEX
Explanations
numerical values and percentages
New Auto-Interp
Negative Logits
ank
-0.17
vez
-0.16
erm
-0.15
074
-0.15
ote
-0.15
les
-0.15
yster
-0.15
ekl
-0.15
exter
-0.15
Grat
-0.15
POSITIVE LOGITS
altogether
0.17
ì´Ŀ
0.16
çľ¾
0.16
-tip
0.15
KANJI
0.15
UMENT
0.15
sik
0.14
ãģªãģĮ
0.14
sled
0.14
iek
0.14
Activations Density 0.128%