INDEX
Negative Logits
izielle
0.36
лого
0.36
颔
0.35
şehir
0.35
閲
0.34
క్తి
0.34
enkele
0.34
园
0.33
荚
0.33
伍章
0.33
POSITIVE LOGITS
code
0.41
fundamental
0.38
code
0.37
principles
0.33
↵
0.33
্জ
0.32
offence
0.32
ribut
0.32
↵↵↵↵↵↵↵↵↵
0.31
offense
0.31
Activations Density 0.000%