INDEX
Explanations
phrases indicating organizational processes and actions
New Auto-Interp
Negative Logits
errer
-0.18
ork
-0.18
ãĥ¼ãĥĬ
-0.17
gan
-0.16
rine
-0.15
.req
-0.15
structure
-0.15
ÏģÏĮÏĤ
-0.15
rehabilit
-0.14
frau
-0.14
POSITIVE LOGITS
establishment
0.21
receipt
0.19
receipt
0.19
creation
0.19
its
0.18
Its
0.17
ãģ¸ãģ®
0.16
formation
0.16
passage
0.16
introduction
0.16
Activations Density 0.205%