INDEX
Negative Logits
áh
-0.07
agit
-0.06
Gret
-0.06
theft
-0.06
universities
-0.06
row
-0.06
often
-0.06
divide
-0.06
agreement
-0.06
.Undef
-0.06
POSITIVE LOGITS
..\
0.08
0.07
nhắc
0.07
및
0.06
Reviewer
0.06
').'</
0.06
ebp
0.06
_beh
0.06
rl
0.06
barang
0.06
Activations Density 0.007%