INDEX
Explanations
patterns of ellipses and scores of 4 and above, indicating significance or emphasis in the text
New Auto-Interp
Negative Logits
ereal
-0.17
ployment
-0.16
avo
-0.15
474
-0.15
ubo
-0.14
ssue
-0.14
ayar
-0.14
ÑĢави
-0.14
allo
-0.14
aimassage
-0.14
POSITIVE LOGITS
iversit
0.15
urs
0.15
distance
0.15
ifer
0.14
stanov
0.14
ButtonModule
0.14
unes
0.13
št
0.13
Spatial
0.13
because
0.13
Activations Density 0.085%