INDEX
Explanations
phrases indicating rationale or justification
New Auto-Interp
Negative Logits
multer
-0.46
دانشنامهٔ
-0.44
Jîn
-0.42
dengan
-0.39
djangoproject
-0.39
Попис
-0.38
cotch
-0.37
ASKET
-0.36
-0.36
-0.35
POSITIVE LOGITS
why
0.97
why
0.70
mengapa
0.68
warum
0.64
kenapa
0.63
Reason
0.62
WHY
0.62
behind
0.59
reason
0.58
waarom
0.56
Activations Density 0.240%