INDEX
Explanations
instances where a statement or story contradicts initial reports or recorded views
the word "that" in various contexts
New Auto-Interp
Negative Logits
atre
-0.67
stead
-0.67
Laughs
-0.65
å¸
-0.63
gur
-0.63
esi
-0.60
Twe
-0.60
DER
-0.59
kamp
-0.58
hare
-0.58
POSITIVE LOGITS
accompanies
1.13
prevailed
1.02
resulted
1.00
contradicts
0.99
preceded
0.99
existed
0.98
governs
0.97
justifies
0.95
arose
0.95
occurred
0.93
Activations Density 0.214%