INDEX
Explanations
instances of the word "despite" indicating a contrast or contradiction
New Auto-Interp
Negative Logits
lampe
-0.42
ldc
-0.39
Русский
-0.37
tomado
-0.36
Bound
-0.35
ön
-0.34
下一个
-0.33
pén
-0.33
selaer
-0.33
tagext
-0.33
POSITIVE LOGITS
despite
1.34
despite
1.27
nonostante
1.20
Despite
1.19
Despite
1.15
ostante
1.13
malgré
1.12
Malgré
1.09
trotz
1.09
رغم
0.95
Activations Density 0.175%