INDEX
Explanations
phrases indicating contrasting or differing opinions
phrases indicating contrasting opinions or evaluations
New Auto-Interp
Negative Logits
ĸļ
-0.78
andise
-0.75
INS
-0.66
gross
-0.66
Attach
-0.61
Constructed
-0.60
ucc
-0.60
subscribing
-0.60
ensor
-0.60
ETS
-0.59
POSITIVE LOGITS
Others
1.33
others
1.20
Others
1.17
ones
0.78
some
0.69
mouth
0.68
latter
0.67
Ago
0.63
juveniles
0.61
mostly
0.61
Activations Density 0.422%