INDEX
Negative Logits
하지
0.60
expressing
0.57
sounded
0.56
intang
0.55
Express
0.54
banging
0.53
bangs
0.52
admitting
0.52
رسمی
0.52
edly
0.51
POSITIVE LOGITS
answer
0.94
answers
0.93
回應
0.89
answered
0.89
ответы
0.86
beantwort
0.86
resposta
0.85
ответить
0.83
answer
0.83
Antworten
0.83
Activations Density 0.206%