INDEX
Negative Logits
renaming
0.41
دکھ
0.36
downloadable
0.34
দিয়েছে
0.34
classifies
0.34
있을
0.33
characterizes
0.33
Relating
0.33
Much
0.33
Terre
0.33
POSITIVE LOGITS
enforced
0.63
violated
0.60
exercised
0.57
fulfilled
0.54
invoked
0.54
established
0.52
activated
0.50
établie
0.50
соблю
0.50
applied
0.49
Activations Density 0.022%