INDEX
Explanations
phrases that indicate harm or damage to objects or entities
New Auto-Interp
Negative Logits
referenties
-0.82
roxene
-0.65
Boyer
-0.64
hujan
-0.63
Underline
-0.61
edoc
-0.60
autorytatywna
-0.59
Fowler
-0.58
Finley
-0.57
Према
-0.57
POSITIVE LOGITS
damage
1.74
damages
1.71
DAMAGE
1.65
Damage
1.64
Damages
1.63
Damage
1.53
damage
1.53
DAMAGES
1.46
Damaged
1.38
damaged
1.37
Activations Density 0.087%