INDEX
Explanations
phrases suggesting options or alternatives
phrases that suggest alternatives or conditional scenarios
New Auto-Interp
Negative Logits
IDENT
-0.69
emi
-0.68
owner
-0.67
estro
-0.64
onday
-0.64
ETS
-0.62
igr
-0.61
ires
-0.61
UD
-0.61
ursday
-0.60
POSITIVE LOGITS
chard
1.23
alternatively
1.17
Else
1.05
else
0.98
acular
0.94
nery
0.93
whatever
0.88
otherwise
0.88
maybe
0.85
worse
0.85
Activations Density 0.065%