INDEX
Explanations
references to investigations or examinations
instances of the word "scrutiny."
New Auto-Interp
Negative Logits
thus
-0.80
activated
-0.77
tre
-0.77
tein
-0.72
activate
-0.68
gran
-0.67
sell
-0.64
ird
-0.63
joining
-0.63
stead
-0.63
POSITIVE LOGITS
scrutiny
1.48
scrutin
0.88
é¾įå¥ij士
0.82
levied
0.77
oreAnd
0.76
Probe
0.76
wcsstore
0.76
dilig
0.75
examinations
0.75
terness
0.74
Activations Density 0.006%