INDEX
Explanations
conjunctions, particularly the word "and"
New Auto-Interp
Negative Logits
®,
-0.15
landers
-0.15
ollider
-0.14
edBy
-0.14
orks
-0.13
ities
-0.13
/of
-0.13
amp
-0.13
agger
-0.13
egt
-0.13
POSITIVE LOGITS
istrovstvÃŃ
0.18
ìĿ´ëĬĶ
0.18
though
0.18
zwar
0.17
бо
0.15
since
0.15
although
0.15
albeit
0.15
while
0.15
yet
0.14
Activations Density 0.353%