INDEX
Explanations
passages in the text that require correction or clarification
New Auto-Interp
Negative Logits
atos
-0.90
NetMessage
-0.76
soDeliveryDate
-0.74
axy
-0.71
Hots
-0.70
esthetic
-0.66
ramid
-0.66
iland
-0.65
joining
-0.65
join
-0.63
POSITIVE LOGITS
inaccur
0.85
misinformation
0.78
hift
0.70
inaccurate
0.70
erroneous
0.70
endum
0.69
leveled
0.68
Corrections
0.67
Correction
0.66
errors
0.64
Activations Density 0.026%