INDEX
Explanations
punctuation and transitional descriptors in text
New Auto-Interp
Negative Logits
takže
-0.62
sondern
-0.61
joten
-0.58
就知道
-0.56
بلکه
-0.55
piram
-0.54
utan
-0.54
והוא
-0.53
はじめに
-0.53
hésite
-0.53
POSITIVE LOGITS
However
1.76
however
1.75
However
1.56
however
1.44
entanto
1.14
Однако
1.08
Cependant
1.07
Однако
1.05
Cependant
1.02
tuttavia
1.02
Activations Density 0.250%