INDEX
Explanations
instances where deletion or removing something is mentioned
occurrences of the word "delete" and related phrases indicating removal or deletion of accounts and content
New Auto-Interp
Negative Logits
annis
-0.81
ebus
-0.80
orsi
-0.73
heit
-0.72
negotiators
-0.71
asio
-0.68
Building
-0.67
verning
-0.66
croft
-0.65
acs
-0.65
POSITIVE LOGITS
deleted
0.89
delet
0.88
delete
0.83
Delete
0.83
unnecessary
0.72
deleting
0.69
delete
0.68
scrolls
0.68
username
0.67
aneous
0.65
Activations Density 0.062%