INDEX
Explanations
corrections or editorial notes within text
instances of the word "correction" and its variations
New Auto-Interp
Negative Logits
atos
-0.94
bang
-0.76
axy
-0.72
ramid
-0.69
ulhu
-0.68
gha
-0.67
aden
-0.65
WAYS
-0.65
ahan
-0.64
enf
-0.64
POSITIVE LOGITS
inaccur
0.77
leveled
0.76
Correction
0.74
Reviewed
0.71
Corrections
0.69
ettings
0.68
Correction
0.66
empt
0.66
Correct
0.66
inaccurate
0.65
Activations Density 0.027%