INDEX
Negative Logits
십니까
0.69
zig
0.66
KC
0.66
borist
0.65
legitim
0.65
/"><
0.63
येलो
0.62
AKA
0.62
madad
0.61
ющее
0.60
POSITIVE LOGITS
else
1.02
else
0.95
Else
0.85
球
0.73
↵
0.68
ELSE
0.67
सार
0.65
↵↵
0.64
را
0.64
Стаўкі
0.64
Activations Density 0.322%