INDEX
Explanations
factual corrections in news articles
corrections
New Auto-Interp
Negative Logits
kasarigan
-0.78
TintMode
-0.75
gameserver
-0.69
SwitchCompat
-0.64
ValueStyle
-0.63
RegressionTest
-0.59
CppCodeGen
-0.59
########.
-0.56
مرئيه
-0.53
vPvB
-0.52
POSITIVE LOGITS
__':
0.71
...');
0.71
snippetHide
0.68
'):
0.67
>");
0.64
Efq
0.64
')):
0.63
]');
0.63
/>";
0.62
/>";
0.62
Activations Density 0.434%