INDEX
Explanations
phrases indicating choice or alternative options
conditional phrases indicating alternative scenarios or possibilities
New Auto-Interp
Negative Logits
/
-0.74
ETS
-0.69
ocracy
-0.66
©¶æ
-0.65
Pen
-0.65
kamp
-0.65
\":
-0.65
)|
-0.64
onomy
-0.64
\/
-0.64
POSITIVE LOGITS
alternatively
1.31
simply
1.13
outright
1.07
else
1.05
merely
0.95
chard
0.91
maybe
0.89
downright
0.86
phans
0.86
worse
0.84
Activations Density 0.141%