INDEX
Explanations
the word "even" in various contexts
New Auto-Interp
Negative Logits
еком
-0.16
oves
-0.14
antasy
-0.14
ersistent
-0.14
strr
-0.14
ÑĢÑĥб
-0.14
anel
-0.14
addir
-0.13
zbo
-0.13
emouth
-0.13
POSITIVE LOGITS
though
0.46
Though
0.38
Though
0.38
though
0.38
aunque
0.23
tho
0.22
since
0.19
èϽçĦ¶
0.18
èϽ
0.18
Tho
0.17
Activations Density 0.029%