INDEX
Explanations
sentences containing the word "contrary to" followed by various statements
phrases that contrast popular beliefs or expectations
New Auto-Interp
Negative Logits
estones
-0.94
onna
-0.77
esses
-0.75
oided
-0.74
ross
-0.73
olini
-0.73
leground
-0.70
hetto
-0.69
azz
-0.69
ITED
-0.67
POSITIVE LOGITS
conventional
0.73
prevailing
0.72
orthodox
0.71
altogether
0.69
ptions
0.69
ãĥĪ
0.64
usual
0.64
Wink
0.63
intuitive
0.63
extant
0.62
Activations Density 0.081%