INDEX
Explanations
references to administrative positions or roles
occurrences of the word "administrator."
New Auto-Interp
Negative Logits
çīĪ
-0.78
roads
-0.73
{"-0.73
PO
-0.72
eele
-0.71
False
-0.71
True
-0.70
DonaldTrump
-0.69
tra
-0.69
doors
-0.66
POSITIVE LOGITS
administrator
1.12
rators
0.91
admin
0.89
administ
0.88
intendent
0.85
stration
0.84
administrators
0.81
uthor
0.78
ially
0.76
iation
0.76
Activations Density 0.012%