INDEX
Explanations
phrases or words indicating negation or denial
the phrase "ain't," indicating informal or colloquial expressions
New Auto-Interp
Negative Logits
INC
-0.70
Impl
-0.63
Agency
-0.63
ULT
-0.62
Promotion
-0.62
=-=-=-=-=-=-=-=-
-0.62
EV
-0.61
ersen
-0.61
Carbuncle
-0.61
IAN
-0.61
POSITIVE LOGITS
't
0.96
ain
0.92
gin
0.88
gonna
0.83
\\\\\\\\
0.83
strument
0.82
ãĥ³ãĤ¸
0.82
ny
0.77
ga
0.76
thouse
0.76
Activations Density 0.008%