INDEX
Negative Logits
olic
-0.07
FAMILY
-0.07
ury
-0.06
orial
-0.06
lazy
-0.06
-Muslim
-0.06
ritz
-0.06
ния
-0.06
links
-0.06
ink
-0.06
POSITIVE LOGITS
most
0.11
Most
0.08
Most
0.08
最
0.08
иболее
0.07
ldkf
0.07
самый
0.07
가장
0.07
diyor
0.07
最
0.07
Activations Density 0.033%