INDEX
Explanations
error corrections in news articles
corrections or clarifications to previously stated information
New Auto-Interp
Negative Logits
Mods
-0.76
Ń·
-0.69
mods
-0.66
SpaceEngineers
-0.64
abe
-0.63
¥µ
-0.62
Cthulhu
-0.61
Frameworks
-0.61
Plex
-0.61
soDeliveryDate
-0.61
POSITIVE LOGITS
spelling
0.86
incorrectly
0.85
corrected
0.85
typo
0.84
incorrect
0.82
mistakenly
0.80
Clar
0.78
erroneous
0.75
cler
0.74
headline
0.74
Activations Density 0.143%