INDEX
Explanations
verbs in a language other than English related to clicking buttons, agreements, or postal services
foreign language
New Auto-Interp
Negative Logits
AndEndTag
-0.81
مرئيه
-0.74
saraba
-0.68
nahilalakip
-0.65
parsedMessage
-0.64
InjectAttribute
-0.63
HtmlAttribute
-0.60
exitRule
-0.59
Setiap
-0.59
Sunda
-0.59
POSITIVE LOGITS
σουμε
0.68
apunov
0.58
åt
0.57
ोंने
0.56
σετε
0.55
shadowOpacity
0.54
θούν
0.54
catalyzed
0.53
世界杯
0.53
définiti
0.52
Activations Density 0.247%