INDEX
Explanations
phrases or sentences starting with "Let's"
New Auto-Interp
Negative Logits
ELD
-0.69
promoted
-0.67
eur
-0.61
supported
-0.60
Palest
-0.59
announced
-0.59
Eighth
-0.58
Guardian
-0.57
IER
-0.57
exhibited
-0.56
POSITIVE LOGITS
summarize
1.08
clarify
1.03
pretend
1.01
assume
1.01
examine
0.98
simplify
0.96
revisit
0.95
recap
0.95
analyse
0.94
proceed
0.94
Activations Density 0.013%