INDEX
Explanations
instances of the word "even" indicating emphasis or contrast
New Auto-Interp
Negative Logits
ersistent
-0.15
à¹ĥà¸Ķ
-0.15
/Gate
-0.15
ByExample
-0.14
umd
-0.14
osloven
-0.14
обÑĢеÑĤ
-0.13
senal
-0.13
å¨ĺ
-0.13
optic
-0.13
POSITIVE LOGITS
though
0.50
though
0.39
Though
0.38
Though
0.37
aunque
0.27
ings
0.25
iment
0.22
tho
0.22
with
0.22
ness
0.21
Activations Density 0.045%