INDEX
Explanations
phrases with commatas
overall percentages related to public opinions or demographics
New Auto-Interp
Negative Logits
minster
-0.84
iously
-0.81
RL
-0.71
aciously
-0.70
MAX
-0.66
inar
-0.66
rou
-0.66
Laughs
-0.66
bor
-0.64
ãĥ¥
-0.64
POSITIVE LOGITS
however
1.26
meanwhile
1.00
moreover
0.93
though
0.90
according
0.86
although
0.83
excluding
0.79
therefore
0.77
suffice
0.75
analysts
0.74
Activations Density 0.411%