INDEX
Explanations
phrases that indicate conditions or actions that should or should not be undertaken, often related to respect or guidelines
following prepositions
general nouns and conditions
New Auto-Interp
Negative Logits
esternos
-0.75
ligiloj
-0.69
queſta
-0.69
चीज़ों
-0.66
Wikimedijinoj
-0.65
хьтан
-0.64
ValueStyle
-0.64
itſelf
-0.63
препратки
-0.62
tartalomajánló
-0.60
POSITIVE LOGITS
0.55
_
0.50
.
0.46
ge
0.46
verhältnisse
0.45
1
0.45
L
0.44
0.44
L
0.43
iv
0.42
Activations Density 5.681%