INDEX
Explanations
articles, both definite and indefinite, in the text
articles followed by nouns/adjectives
New Auto-Interp
Negative Logits
Попис
-0.52
nemlig
-0.50
iaitu
-0.45
étoit
-0.41
feroit
-0.40
seamnă
-0.36
yaitu
-0.36
auroit
-0.35
pouvoit
-0.34
namelijk
-0.33
POSITIVE LOGITS
uxxxx
0.64
defaultstate
0.63
lastly
0.57
その他
0.57
めでとう
0.56
الحياه
0.56
RTHOOK
0.55
etc
0.55
downright
0.54
subsequent
0.53
Activations Density 0.083%