INDEX
Explanations
phrases indicating contrast or comparisons
New Auto-Interp
Negative Logits
StatefulWidget
-0.57
astă
-0.56
Wikimedijinoj
-0.56
Népesség
-0.49
derabad
-0.48
iscope
-0.47
Such
-0.46
références
-0.45
poichè
-0.45
grine
-0.44
POSITIVE LOGITS
isso
1.16
eso
1.11
disso
0.85
vậy
0.85
itu
0.84
นั้น
0.83
ello
0.82
that
0.80
ça
0.77
cela
0.76
Activations Density 0.204%