INDEX
Explanations
statements indicating a contradiction or clarification
the phrase "in fact."
New Auto-Interp
Negative Logits
FTWARE
-0.85
iris
-0.77
soDeliveryDate
-0.73
76561
-0.71
SourceFile
-0.70
asks
-0.68
ð
-0.68
kamp
-0.66
Alert
-0.66
20439
-0.65
POSITIVE LOGITS
essence
1.27
fact
1.24
theory
1.21
effect
1.20
turn
1.12
principle
1.11
aggregate
1.02
retrospect
1.01
hindsight
0.97
reality
0.93
Activations Density 0.071%