INDEX
Explanations
verbs related to taking back or withdrawing something
terms related to retraction or correction of published content
New Auto-Interp
Negative Logits
amac
-0.89
pn
-0.84
places
-0.82
atown
-0.78
swick
-0.77
arding
-0.77
eers
-0.76
drivers
-0.73
fire
-0.73
eson
-0.71
POSITIVE LOGITS
retract
0.98
raction
0.87
ractions
0.86
retracted
0.83
guiActive
0.71
unsub
0.70
ŃĶ
0.69
confessions
0.69
iple
0.68
facult
0.66
Activations Density 0.020%