INDEX
Explanations
instances of the word "contradict" and its variations
New Auto-Interp
Negative Logits
gins
-0.80
enfranch
-0.75
fare
-0.73
NetMessage
-0.72
itizen
-0.70
itizens
-0.69
ixtape
-0.68
emetery
-0.67
ogo
-0.66
aii
-0.66
POSITIVE LOGITS
substant
0.90
assertions
0.89
contradict
0.89
statements
0.86
contradictory
0.86
assumptions
0.79
contradicted
0.79
edly
0.79
Shack
0.77
everything
0.76
Activations Density 0.011%