INDEX
Explanations
phrases indicating contradiction or inconsistency
instances of the word "contradict" and its variations
New Auto-Interp
Negative Logits
emetery
-0.85
sale
-0.85
awar
-0.84
home
-0.75
Home
-0.75
RNA
-0.75
ttp
-0.74
outh
-0.73
aws
-0.72
rug
-0.72
POSITIVE LOGITS
contradict
1.42
contradicted
1.29
contradicts
1.26
contradictory
1.11
contradictions
1.02
substant
0.91
ĸļ
0.87
conflicting
0.85
corrobor
0.82
undermin
0.79
Activations Density 0.005%