INDEX
Explanations
conjunctions indicating contrast or opposition
New Auto-Interp
Negative Logits
Majefty
-0.66
comfy
-0.66
Shakspeare
-0.65
͡°
-0.59
grandkids
-0.58
Efq
-0.58
Продам
-0.57
שלנו
-0.55
לכם
-0.55
loving
-0.54
POSITIVE LOGITS
although
0.84
tuttavia
0.81
however
0.77
findpost
0.77
toutefois
0.73
новниш
0.72
AddTagHelper
0.72
tačiau
0.71
However
0.68
تضيفلها
0.67
Activations Density 0.254%