INDEX
Explanations
the word "unlike" and its variations to signal contrasts or differences
New Auto-Interp
Negative Logits
muſt
-0.70
becauſe
-0.61
papy
-0.57
subgoal
-0.56
zbęd
-0.56
урна
-0.56
presumption
-0.55
Annette
-0.55
corder
-0.54
alſo
-0.53
POSITIVE LOGITS
unlike
1.84
unlike
1.81
Unlike
1.69
Unlike
1.68
Contrary
0.91
Gegensatz
0.89
отличие
0.89
Like
0.82
diferencia
0.81
Contrary
0.81
Activations Density 0.096%