INDEX
Explanations
phrases related to physical destruction and injury
words associated with violence and destruction
New Auto-Interp
Negative Logits
eq
-0.68
href
-0.66
cest
-0.63
Flavoring
-0.63
tein
-0.63
rosso
-0.61
alone
-0.59
phr
-0.59
Sol
-0.58
argon
-0.58
POSITIVE LOGITS
iHUD
0.79
tered
0.67
anew
0.66
stretched
0.65
deteriorated
0.65
hinges
0.65
aeus
0.65
arie
0.62
Fargo
0.60
hement
0.59
Activations Density 0.715%