INDEX
Negative Logits
ฆ
0.42
遅
0.41
避
0.40
स्टार्ट
0.39
猶
0.38
埃
0.37
amenazas
0.36
iciary
0.36
кален
0.36
osy
0.36
POSITIVE LOGITS
effective
0.80
Effective
0.66
Effective
0.64
leaving
0.61
farewell
0.60
effective
0.59
after
0.58
amic
0.58
amicable
0.57
termination
0.56
Activations Density 0.009%