INDEX
Explanations
phrases related to controversial topics and opinions
New Auto-Interp
Negative Logits
ngth
-0.78
luaj
-0.73
opez
-0.71
ividual
-0.68
aneers
-0.67
ifferent
-0.62
perty
-0.61
ebted
-0.60
isively
-0.60
vertisements
-0.60
POSITIVE LOGITS
happening
0.91
true
0.82
understandable
0.81
quickShipAvailable
0.80
why
0.79
compounded
0.79
untrue
0.76
blasphemy
0.74
reassuring
0.73
SPONSORED
0.72
Activations Density 3.905%