INDEX
Explanations
expressions related to willingness and political intent
New Auto-Interp
Negative Logits
à¹ĩà¸Ńà¸ģ
-0.15
avel
-0.15
olley
-0.15
PLUGIN
-0.15
alue
-0.15
اÙĦرÙħ
-0.15
ãģªãģĮãĤī
-0.15
ìn
-0.15
uell
-0.14
bero
-0.14
POSITIVE LOGITS
interest
0.30
willing
0.28
willingness
0.28
ready
0.28
readiness
0.26
Interest
0.24
appetite
0.23
interested
0.23
desire
0.23
motivation
0.23
Activations Density 0.241%