INDEX
Negative Logits
binding
-0.07
TV
-0.07
DOMAIN
-0.07
treat
-0.07
FD
-0.06
aro
-0.06
unre
-0.06
рован
-0.06
television
-0.06
Inform
-0.06
POSITIVE LOGITS
lüğü
0.08
0.07
rewrite
0.06
ũng
0.06
scraps
0.06
consect
0.06
MJ
0.06
Evo
0.06
itr
0.06
0.06
Activations Density 0.036%