INDEX
Explanations
instances where investigations or examinations are being conducted
occurrences of the word "probe" in various contexts
New Auto-Interp
Negative Logits
Extrem
-0.68
Present
-0.63
compe
-0.62
Hurricanes
-0.62
awed
-0.62
reditary
-0.61
perm
-0.61
dec
-0.60
Production
-0.60
Dub
-0.59
POSITIVE LOGITS
probe
1.36
probes
1.25
Probe
1.13
probing
1.12
inquiry
0.86
investigation
0.84
Inquiry
0.74
investigating
0.74
dred
0.70
igate
0.69
Activations Density 0.010%