INDEX
Explanations
punctuation marks indicating the end of sentences or quotations
New Auto-Interp
Negative Logits
ness
-0.33
)
-0.33
visit
-0.31
pub
-0.28
jug
-0.28
pub
-0.26
unes
-0.24
ment
-0.24
ones
-0.24
),
-0.24
POSITIVE LOGITS
disambiguazione
0.98
AnchorStyles
0.93
verwijspagina
0.91
Personendaten
0.82
مشين
0.82
Administrativna
0.82
InstrumentedTest
0.81
Manbalar
0.81
majánló
0.77
للاسماء
0.77
Activations Density 0.953%