INDEX
Negative Logits
�
-0.08
favor
-0.08
favor
-0.08
ROW
-0.07
pcb
-0.07
Sr
-0.07
ROWS
-0.07
Favor
-0.07
709
-0.07
rego
-0.07
POSITIVE LOGITS
sentences
0.11
句
0.11
Sentence
0.11
_sentence
0.10
sentence
0.10
משפט
0.10
subordinate
0.10
Sentence
0.10
bağlant
0.09
বাক
0.09
Activations Density 0.025%