INDEX
Explanations
terms related to punishment and its consequences
New Auto-Interp
Negative Logits
surla
-0.76
.*")]
-0.73
autorytatywna
-0.72
thren
-0.69
quiries
-0.68
httphttps
-0.67
Fle
-0.66
MigrationBuilder
-0.66
ׂ
-0.64
RegistryLite
-0.64
POSITIVE LOGITS
punish
1.58
punishment
1.51
punishments
1.47
penalties
1.46
punished
1.44
reward
1.44
penalty
1.40
Penalties
1.35
punishing
1.30
Penalty
1.27
Activations Density 0.212%