INDEX
Explanations
phrases that introduce contrasting points or contexts
New Auto-Interp
Negative Logits
olm
-0.55
despite
-0.50
obwohl
-0.48
despite
-0.45
nonostante
-0.45
日まで
-0.44
ammen
-0.44
sær
-0.43
zanim
-0.43
one
-0.42
POSITIVE LOGITS
Conversely
1.53
Conversely
1.41
conversely
1.39
Meanwhile
1.22
Meanwhile
1.19
Whereas
1.16
meanwhile
1.16
Sedangkan
1.15
natomiast
1.11
Likewise
1.10
Activations Density 0.368%