INDEX
Explanations
inconsistencies or contradictions in written text
New Auto-Interp
Negative Logits
Rumble
-0.77
ocene
-0.66
Stockholm
-0.61
bang
-0.61
Dota
-0.59
Ventura
-0.58
stay
-0.57
genuinely
-0.56
ELY
-0.54
GET
-0.54
POSITIVE LOGITS
forward
0.81
situated
0.80
inclined
0.79
quartered
0.77
minded
0.76
apy
0.74
etheless
0.73
tainment
0.72
leep
0.72
nown
0.70
Activations Density 0.024%