INDEX
Explanations
references to various types of centers, specifically those involved in community services or support
New Auto-Interp
Negative Logits
arten
-0.18
vat
-0.18
soever
-0.17
änd
-0.16
erged
-0.16
/her
-0.16
sp
-0.15
-speaking
-0.15
omen
-0.15
ÑĮми
-0.14
POSITIVE LOGITS
pieces
0.31
ing
0.25
ial
0.23
lain
0.21
most
0.20
fold
0.19
-left
0.19
piece
0.18
prises
0.18
ally
0.17
Activations Density 0.050%