INDEX
Explanations
technical or domain-specific terminology
New Auto-Interp
Negative Logits
докÑĥм
-0.20
поба
-0.19
инÑĦоÑĢма
-0.18
ÑĢониÑĩеÑģ
-0.17
ga
-0.16
cal
-0.15
è
-0.15
ово
-0.15
гÑĢадÑĥ
-0.15
заболева
-0.15
POSITIVE LOGITS
в
0.21
и
0.20
addCriterion
0.18
allas
0.17
ÐIJÑĢÑħÑĸв
0.17
же
0.17
$MESS
0.16
ÌĨ
0.16
âĦĸâĦĸ
0.16
ãģĵãģĿ
0.16
Activations Density 0.358%