INDEX
Explanations
phrases starting with "Not so" and providing contrasting information
statements related to consent and disagreement
New Auto-Interp
Negative Logits
quartered
-0.78
matical
-0.75
catentry
-0.75
UNCLASSIFIED
-0.70
imon
-0.70
viol
-0.69
iban
-0.67
axis
-0.67
construct
-0.64
serving
-0.62
POSITIVE LOGITS
Shutterstock
0.73
Nguyen
0.71
Conn
0.69
ONG
0.69
TOR
0.68
Redditor
0.66
Turns
0.66
Stranger
0.66
DER
0.65
Vaughan
0.65
Activations Density 1.255%