INDEX
Explanations
high quality and good things
New Auto-Interp
Negative Logits
illiterate
0.45
existencia
0.42
sogenannte
0.42
powerless
0.42
cytoplasm
0.41
ignorance
0.41
dites
0.41
meaningless
0.40
existence
0.40
impair
0.40
POSITIVE LOGITS
excellent
0.82
excelente
0.74
बेहतरीन
0.70
മികച്ച
0.69
отлич
0.67
excellent
0.66
хороший
0.66
좋은
0.65
excelentes
0.65
excellente
0.64
Activations Density 0.313%