INDEX
Explanations
words related to policy-making or problem-solving
phrases related to problem-solving and policy development
New Auto-Interp
Negative Logits
anamo
-0.84
ghazi
-0.79
pour
-0.76
minus
-0.76
hid
-0.73
oland
-0.73
sorry
-0.73
itus
-0.69
bsite
-0.69
thank
-0.68
POSITIVE LOGITS
stronger
1.27
alternatives
1.24
solutions
1.23
clearer
1.22
meaningful
1.21
smarter
1.19
adequate
1.19
better
1.18
suitable
1.15
safer
1.13
Activations Density 0.296%