INDEX
Explanations
suggestions or recommendations
New Auto-Interp
Negative Logits
fare
-0.76
cler
-0.71
brance
-0.70
isol
-0.70
mania
-0.69
WB
-0.69
gie
-0.68
vin
-0.67
Ïī
-0.67
sup
-0.66
POSITIVE LOGITS
reconsider
0.84
alternatives
0.82
solutions
0.74
hypot
0.74
suggestions
0.73
explanations
0.72
alternative
0.71
aloud
0.71
remedies
0.69
alternate
0.69
Activations Density 0.686%