INDEX
Explanations
phrases indicating improvement or benefit
phrases indicating whether something is better or worse off
New Auto-Interp
Negative Logits
Gall
-0.74
FK
-0.72
forestation
-0.67
ãĤ±
-0.67
SPA
-0.66
MAC
-0.64
ART
-0.64
Ming
-0.64
Export
-0.63
ologic
-0.62
POSITIVE LOGITS
financially
0.77
looking
0.76
amia
0.75
course
0.75
eem
0.72
nings
0.71
usa
0.67
acers
0.65
seiz
0.65
ners
0.64
Activations Density 0.014%