INDEX
Negative Logits
praised
0.44
律师
0.42
सावधान
0.42
praising
0.40
মৃত্যুদ
0.40
numerals
0.40
unambiguously
0.38
പരിശ
0.38
ప్రేక్షకు
0.38
eliminating
0.38
POSITIVE LOGITS
theories
1.84
theory
1.71
Theories
1.64
teorías
1.63
теор
1.60
teoria
1.59
teorie
1.57
Theory
1.52
théorie
1.51
teoría
1.50
Activations Density 0.027%