INDEX
Explanations
Key improvements and explanations
New Auto-Interp
Negative Logits
uen
0.65
NAB
0.64
Man
0.64
ناد
0.63
Nub
0.62
Man
0.62
Nab
0.61
नर
0.61
yna
0.60
nad
0.60
POSITIVE LOGITS
ста
0.65
GRAND
0.60
্সর
0.59
शा
0.59
conti
0.59
करेंसी
0.57
ستی
0.57
skim
0.57
ске
0.57
डोज
0.57
Activations Density 0.082%