INDEX
Explanations
describing appearance or quality
New Auto-Interp
Negative Logits
Thông
0.83
За
0.80
Во
0.71
кри
0.71
Про
0.70
У
0.67
Вы
0.67
localização
0.65
Бе
0.65
Б
0.65
POSITIVE LOGITS
ish
0.89
esque
0.85
type
0.84
type
0.83
like
0.76
ish
0.74
এর
0.72
like
0.70
based
0.69
looking
0.67
Activations Density 0.056%