INDEX
Negative Logits
esempio
0.44
us
0.42
im
0.42
RN
0.41
irm
0.41
ancest
0.40
}))$
0.40
rm
0.40
ARE
0.40
government
0.39
POSITIVE LOGITS
obfusc
0.47
همچنین
0.46
역할
0.44
plagiarism
0.43
gyroscope
0.43
planification
0.42
padding
0.42
adhesives
0.42
humanoid
0.41
stupidity
0.41
Activations Density 0.003%