INDEX
Explanations
editor's notes within the text
editorial notes and corrections in text
New Auto-Interp
Negative Logits
pload
-0.75
manif
-0.74
ŃĶ
-0.69
scrim
-0.65
ĪĴ
-0.65
decom
-0.63
forms
-0.63
sewage
-0.62
ãĥĩ
-0.62
impe
-0.62
POSITIVE LOGITS
:]
0.78
HuffPost
0.75
note
0.75
BOOK
0.73
Editors
0.73
:
0.73
Edited
0.71
Corrections
0.70
":"","
0.70
NOTE
0.69
Activations Density 0.025%