INDEX
Explanations
phrases related to legal or political discussions
phrases that involve political authority and mandates
New Auto-Interp
Negative Logits
incorpor
-0.56
erenn
-0.52
Neb
-0.51
Gloss
-0.49
achus
-0.49
Cynthia
-0.48
Lumpur
-0.48
glim
-0.47
avia
-0.47
wagon
-0.46
POSITIVE LOGITS
)?
1.01
)).
0.78
"?
0.77
Ħ¢
0.75
?).
0.74
?",
0.74
?".
0.73
)),
0.73
'?
0.71
?ãĢį
0.70
Activations Density 2.595%