INDEX
Negative Logits
ük
0.42
利用
0.42
الفر
0.40
majoring
0.39
sidelined
0.39
ब्रैकेट
0.39
Unfortunately
0.37
неболь
0.37
<sub>
0.36
dependent
0.36
POSITIVE LOGITS
siquiera
0.74
coher
0.67
coherent
0.63
explicação
0.55
consciously
0.54
表情
0.50
fysis
0.49
Worte
0.49
comprehend
0.48
woorden
0.48
Activations Density 0.041%