INDEX
Explanations
instances of the word "and" indicating connections or additions in the text
New Auto-Interp
Negative Logits
iol
-0.16
alles
-0.16
vant
-0.15
ëĿ½
-0.15
anko
-0.14
éĽ¢
-0.14
(*(
-0.14
ÐĴÐŀ
-0.14
chemy
-0.13
eteor
-0.13
POSITIVE LOGITS
orm
0.14
shift
0.14
disc
0.14
ãģ«ãģ¨
0.13
558
0.13
;
0.13
vis
0.13
aklı
0.13
466
0.13
¸
0.13
Activations Density 0.304%