INDEX
Negative Logits
orchestration
0.46
p
0.42
परे
0.41
dock
0.40
proxy
0.40
paraphrase
0.40
seda
0.40
solidaridad
0.40
orrow
0.39
undermine
0.39
POSITIVE LOGITS
Samuel
0.47
العن
0.45
Cheese
0.44
இறை
0.44
മുഹമ്മ
0.43
Cheese
0.42
Muslim
0.42
Е
0.41
শিশু
0.41
क्षमता
0.40
Activations Density 0.011%