INDEX
Negative Logits
مالك
-0.09
인은
-0.08
rapproche
-0.08
barrage
-0.08
Buen
-0.08
그리고
-0.08
narration
-0.07
_SECURITY
-0.07
peripheral
-0.07
اور
-0.07
POSITIVE LOGITS
-ish
0.08
visited
0.08
-like
0.08
principle
0.07
soort
0.07
-type
0.07
Principal
0.07
itive
0.07
danh
0.07
ustes
0.07
Activations Density 0.128%