INDEX
Explanations
avoid sensitive or negative content
New Auto-Interp
Negative Logits
சில
0.59
biraz
0.55
alcune
0.50
രണ്ട്
0.50
다양한
0.50
zwei
0.50
কিছুটা
0.49
Biraz
0.49
alguns
0.48
alcuni
0.47
POSITIVE LOGITS
unless
0.91
veya
0.90
или
0.88
任何
0.83
或其他
0.79
किंवा
0.78
любых
0.75
Unless
0.74
하거나
0.73
ANYTHING
0.73
Activations Density 0.146%