INDEX
Explanations
contradictory statements or situations
phrases indicating opposition or contradiction
New Auto-Interp
Negative Logits
ilitating
-0.81
rod
-0.76
killer
-0.74
artney
-0.73
oleon
-0.72
anus
-0.71
allo
-0.70
dust
-0.68
arij
-0.67
oya
-0.66
POSITIVE LOGITS
notwithstanding
0.87
etheless
0.79
lihood
0.77
ly
0.76
guiActiveUn
0.76
SPONSORED
0.75
contrary
0.73
ptions
0.69
nces
0.67
clock
0.65
Activations Density 0.028%