INDEX
Negative Logits
instead
-0.08
दिशा
-0.08
clarified
-0.08
recommendations
-0.08
convictions
-0.07
اص
-0.07
reminders
-0.07
_READY
-0.07
निर्देश
-0.07
commandments
-0.07
POSITIVE LOGITS
acknowledge
0.10
acknowledging
0.10
acknowledges
0.09
acknowledged
0.09
공
0.09
imperfections
0.09
Vir
0.08
oefenen
0.08
인정
0.08
admitir
0.08
Activations Density 0.038%