INDEX
Negative Logits
included
0.44
ட்கள்
0.41
iverso
0.39
OSA
0.39
alguno
0.38
缫
0.38
也好
0.38
liking
0.38
baked
0.38
disagreement
0.38
POSITIVE LOGITS
variable
0.55
unpredictable
0.53
Variable
0.51
変動
0.50
متغير
0.49
Variable
0.49
fluctuate
0.49
variable
0.48
variabile
0.46
fluctuating
0.46
Activations Density 0.000%