INDEX
Explanations
words related to proposals, recommendations, and promises regarding policy and legislation
New Auto-Interp
Negative Logits
uste
-0.17
367
-0.16
ressing
-0.15
ongan
-0.15
otas
-0.14
/to
-0.14
yna
-0.13
erse
-0.13
ubo
-0.13
uala
-0.13
POSITIVE LOGITS
rằng
0.21
bahwa
0.19
that
0.19
ionario
0.17
that
0.17
/request
0.17
että
0.16
ëĮĢë¡ľ
0.15
something
0.15
entially
0.15
Activations Density 0.158%