INDEX
Explanations
sentences related to criticism or disapproval
New Auto-Interp
Negative Logits
disadvantaged
-0.55
subcontract
-0.54
Reincarnated
-0.54
oppable
-0.53
..............
-0.52
Tycoon
-0.52
discriminated
-0.52
customs
-0.51
Fighting
-0.50
Transition
-0.50
POSITIVE LOGITS
reader
0.84
journalistic
0.80
ascript
0.76
rhetorical
0.74
reader
0.74
canon
0.72
cynicism
0.72
entious
0.69
editorial
0.68
readers
0.68
Activations Density 1.161%