INDEX
Negative Logits
ParallelGroup
0.62
собенности
0.62
ہتھی
0.59
עי
0.58
aufen
0.57
matches
0.57
Doss
0.56
فيذ
0.56
PostMapping
0.55
Match
0.55
POSITIVE LOGITS
sorry
3.66
Sorry
3.61
apologies
3.56
Sorry
3.39
sorry
3.34
apologize
3.20
apology
3.12
apologise
2.91
apologized
2.87
apologizing
2.73
Activations Density 0.388%