INDEX
Negative Logits
Insurance
0.50
insurance
0.47
보험
0.47
insurance
0.46
保険
0.46
Insurance
0.45
INSURANCE
0.45
жили
0.45
Entities
0.45
GDPR
0.43
POSITIVE LOGITS
abnorm
0.44
此时
0.42
ausgew
0.41
পরাজিত
0.41
अत्य
0.40
cola
0.39
বাংলাদেশী
0.39
racist
0.39
Anger
0.38
Ruta
0.38
Activations Density 0.006%