INDEX
Explanations
possessive pronouns and apostrophes indicating ownership
New Auto-Interp
Negative Logits
ognuno
-0.57
entrambi
-0.54
ciascuno
-0.53
'
-0.53
respectivamente
-0.52
femininas
-0.52
scoperta
-0.52
démocr
-0.51
avoient
-0.51
copertina
-0.51
POSITIVE LOGITS
“
1.42
’)
1.40
’”
1.38
own
1.27
’).
1.26
‘
1.21
.’”
1.21
’,
1.20
”),
1.17
,’”
1.14
Activations Density 0.215%