INDEX
Explanations
negations and expressions of uncertainty
"t" followed by a word indicating thought or knowledge
don't know, don't believe, don't get
New Auto-Interp
Negative Logits
linkovi
-0.37
出版年
-0.35
rokken
-0.33
saurait
-0.33
trouvez
-0.33
fallu
-0.32
conveniente
-0.32
appena
-0.32
Bronnen
-0.32
ucapnya
-0.32
POSITIVE LOGITS
informée
0.60
LEncoder
0.53
argout
0.52
Ahnung
0.51
know
0.50
KNOW
0.50
KNOW
0.49
فريبيس
0.49
Rohy
0.49
trăm
0.49
Activations Density 0.331%