INDEX
Explanations
phrases related to causing harm or damage
phrases that indicate harm or damage to entities or systems
New Auto-Interp
Negative Logits
DragonMagazine
-0.72
soDeliveryDate
-0.69
iguous
-0.66
Edited
-0.66
wine
-0.65
lich
-0.63
reflect
-0.63
atonin
-0.62
fw
-0.62
isson
-0.62
POSITIVE LOGITS
entire
1.09
livelihood
0.99
credibility
0.97
sensibilities
0.88
incumb
0.85
reputation
0.85
whole
0.85
effectiveness
0.83
morale
0.83
psyche
0.83
Activations Density 0.307%