INDEX
Explanations
phrases indicating delegation or decision-making authority
phrases related to responsibility or decision-making
New Auto-Interp
Negative Logits
hello
-0.87
ãĤ¤ãĥĪ
-0.86
ãĥĥãĥĪ
-0.80
ä¹ĭ
-0.80
ãĥŃ
-0.78
iazep
-0.78
女
-0.78
isky
-0.77
artifacts
-0.76
rets
-0.73
POSITIVE LOGITS
whoever
1.10
Congress
0.99
discretion
0.99
individual
0.92
shoulders
0.92
us
0.91
policymakers
0.89
professionals
0.89
municipalities
0.87
bureaucrats
0.86
Activations Density 0.176%