INDEX
Negative Logits
EXAMPLE
-0.07
irim
-0.07
brigade
-0.07
Examples
-0.07
وس
-0.07
pw
-0.07
Remarks
-0.06
conservatives
-0.06
avit
-0.06
.pem
-0.06
POSITIVE LOGITS
IZED
0.07
developer
0.06
_es
0.06
.Alignment
0.06
TURN
0.06
_runner
0.06
formation
0.06
Jason
0.06
uncated
0.06
tipo
0.06
Activations Density 0.005%