INDEX
Negative Logits
violations
0.40
🧜
0.39
extremist
0.39
имущества
0.39
चर्चित
0.39
activations
0.38
bohemian
0.37
momenta
0.37
grounds
0.37
outdated
0.37
POSITIVE LOGITS
refrigeration
0.72
electricity
0.71
penicillin
0.70
telephones
0.70
electricity
0.68
antibiotics
0.67
изобре
0.63
gunpowder
0.63
invention
0.62
Electricity
0.62
Activations Density 0.046%